Multilingual artificial text extraction and script identification from video images (T-0691) (MFN 4014)

Welcome to DSpace BU Repository

Welcome to the Bahria University DSpace digital repository. DSpace is a digital service that collects, preserves, and distributes digital material. Repositories are important tools for preserving an organization's legacy; they facilitate digital preservation and scholarly communication.

Show simple item record

dc.contributor.author Azra Batool, 01-244112-003
dc.date.accessioned 2017-07-27T06:22:25Z
dc.date.available 2017-07-27T06:22:25Z
dc.date.issued 2014
dc.identifier.uri http://hdl.handle.net/123456789/3088
dc.description Supervised by Dr. Imran Ahmed Siddiqui en_US
dc.description.abstract With the tremendous growth in the amount of multimedia data, especially videos, has increased the need for efficient indexing and retrieval techniques. In addition to the audio-visual content itself, a power tool that be employed for indexing of videos is the caption text appearing in them. An important component of textual content based video indexing and retrieval systems is the detection and extraction of text from video frames. Most of the existing text extraction system target textual occurrences in a particular script or language. We have proposed a generic multilingual text extraction system that relies on a combination of unsupervised and supervised techniques. The unsupervised approach is based on application of image analysis techniques which exploit the contrast, alignment and geometrical properties of text and identify candidate text regions in an image. Potential text regions are then validated by an Artificial Neural Network (ANN) using a set of features computed from Gray Level Co-occurrence Matrices (GLCM). Detected text regions are then binarized to segment text from the background. The script of the extracted text is finally identified using texture based features based on Local Binary Patterns (LBP). The proposed system was evaluated on video images containing textual occurrences in five different languages including English, Urdu, Hindi, Chinese and Arabic. The promising results of the experimental evaluations validate the effectiveness of the proposed system for text extraction and script identification. en_US
dc.language.iso en en_US
dc.publisher Software Engineering, Bahria University Engineering School Islamabad en_US
dc.relation.ispartofseries MS SE;T-0691
dc.subject Software Engineering en_US
dc.title Multilingual artificial text extraction and script identification from video images (T-0691) (MFN 4014) en_US
dc.type MS Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account