Visual Object Recognition from Videos ( VORV )

Aima Zahoor, 01-134151-009; M.USMAN KHAN, 01-134151-061

DSpace Home
→
Final Year Project Report (BUIC)
→
Department of Computer Science and IT (BUIC-E-8)
→
BS (CS) (BUIC-FYP-E8)
→
View Item

dc.contributor.author	Aima Zahoor, 01-134151-009
dc.contributor.author	M.USMAN KHAN, 01-134151-061
dc.date.accessioned	2019-03-11T11:02:31Z
dc.date.available	2019-03-11T11:02:31Z
dc.date.issued	2018
dc.identifier.uri	http://hdl.handle.net/123456789/8340
dc.description	Supervised By Dr. Imran Siddiqi	en_US
dc.description.abstract	Object detection has many applications in computer vision, including image retrieval and video surveillance. The idea of object detection was presented with feature description and it outperformed existed algorithms and methodologies in pedestrian detection and it was a huge breakthrough in image processing field. In the past when it was required to find out a specific object from a video, one has to manually sit and watch out the complete video to take out the required part of that video manually after a lot of effort and time. As we have huge amount of data in the form of videos and to search something from videos require lots of time i.e. to go through the whole video, therefore we need to have an automated system to search specific content from videos. Videos are basically sequence of frames and frames are sequence of images. Therefore, we need to extract frames to recognize the objects in videos. Using the transfer learning technique on S SD and inception-v 2 and using Constitutional Neural Network we have trained our system to detect and recognize buildings. We have developed a system named as “Visual Object Recognition from Videos”, which recognize buildings from videos. We have fine-tuned CNN model on our dataset. As we know for Deep learning techniques, we need a huge amount of data for training to get more accurate results, therefore initially we collected more than 120 unique images of each class. Front-end is based on .NET framework C# to make it easy to use system for users. For recognition of object, we have used object detection model with Tensor Flow and also the inception-v 2 classifier, which recognize buildings in videos. For fast processing system, we have used GPU because training on images and then processing videos is very slow on CPU. The system proved precision nearly 93.26%. This is a problem solving desktop application to give ease to users of any regulatory body.	en_US
dc.language.iso	en_US	en_US
dc.publisher	Bahria University Islamabad Campus	en_US
dc.relation.ispartofseries	BS (CS);P-8023
dc.subject	Computer science	en_US
dc.title	Visual Object Recognition from Videos ( VORV )	en_US
dc.type	Project Reports	en_US