DSpace Repository

Semantic video retrieval using natural language queries

Show simple item record

dc.contributor.author Ahmed Hassan, 0 l -249 182-002
dc.date.accessioned 2020-12-15T01:18:42Z
dc.date.available 2020-12-15T01:18:42Z
dc.date.issued 2020
dc.identifier.uri http://hdl.handle.net/123456789/10541
dc.description Supervised by Dr. lmran Ahmed Siddiqi en_US
dc.description.abstract Video retrieval is searching and retrieving videos that are relevant to user-defined query. This is one one the most challenging and novel issue in multimedia search as well as in real life This research work is focused on employing the concepts of deep learning and natural language processing to solve the video retrieval problem, Thanks to Deep learning which enables us to make an end-to-end trainable system and avoiding the complexity of image and video processing techniques present in traditional systems. We are proposing a semantic-based video retrieval system in which the actual content of the video will be explored, persons in the video will be recognized, and description of the frames of video will be generated using image caption technique, it will help to understand the contents of the video. so combining both person recognition, and captioning models we will be able to have both person-related information and the description of the video frames. In retrieval phase, we will employ word embedding technique to find similar words to those appearing in the given query text which would help to retrieve the most relevant videos w.r.t given query. This will help to reduce the semantic gap and desired videos are expected to be retrieved. We considered 20 key individuals in our study, There are three key components in our study i.e Face recognition, caption generation, and query similarity measure. To recognize persons face appearing in a video we use FaccNet model, and to generate a description of the scene in the video frames we employ an image captioning model. The output of these two models along with the frame and video information is saved in the database. In the retrieval phase, a natural language query is provided to the system, here we usc the concept of word embedding model to find the top five similar words against the provided query. Videos against the matching words and queried individuals are then returned by the system. We conducted the experiments on a collection of 100 videos and promising results are reported. en_US
dc.language.iso en en_US
dc.publisher Bahria University Islamabad Campus en_US
dc.relation.ispartofseries MS (DS);T-8850
dc.subject Computer Science en_US
dc.title Semantic video retrieval using natural language queries en_US
dc.type MS Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account