DSpace Repository

Visual Object Recognition from Videos ( VORV )

Show simple item record

dc.contributor.author Aima Zahoor, 01-134151-009
dc.contributor.author M.USMAN KHAN, 01-134151-061
dc.date.accessioned 2019-03-11T11:02:31Z
dc.date.available 2019-03-11T11:02:31Z
dc.date.issued 2018
dc.identifier.uri http://hdl.handle.net/123456789/8340
dc.description Supervised By Dr. Imran Siddiqi en_US
dc.description.abstract Object detection has many applications in computer vision, including image retrieval and video surveillance. The idea of object detection was presented with feature description and it outperformed existed algorithms and methodologies in pedestrian detection and it was a huge breakthrough in image processing field. In the past when it was required to find out a specific object from a video, one has to manually sit and watch out the complete video to take out the required part of that video manually after a lot of effort and time. As we have huge amount of data in the form of videos and to search something from videos require lots of time i.e. to go through the whole video, therefore we need to have an automated system to search specific content from videos. Videos are basically sequence of frames and frames are sequence of images. Therefore, we need to extract frames to recognize the objects in videos. Using the transfer learning technique on S SD and inception-v 2 and using Constitutional Neural Network we have trained our system to detect and recognize buildings. We have developed a system named as “Visual Object Recognition from Videos”, which recognize buildings from videos. We have fine-tuned CNN model on our dataset. As we know for Deep learning techniques, we need a huge amount of data for training to get more accurate results, therefore initially we collected more than 120 unique images of each class. Front-end is based on .NET framework C# to make it easy to use system for users. For recognition of object, we have used object detection model with Tensor Flow and also the inception-v 2 classifier, which recognize buildings in videos. For fast processing system, we have used GPU because training on images and then processing videos is very slow on CPU. The system proved precision nearly 93.26%. This is a problem solving desktop application to give ease to users of any regulatory body. en_US
dc.language.iso en_US en_US
dc.publisher Bahria University Islamabad Campus en_US
dc.relation.ispartofseries BS (CS);P-8023
dc.subject Computer science en_US
dc.title Visual Object Recognition from Videos ( VORV ) en_US
dc.type Project Reports en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account