Welcome to the Bahria University DSpace digital repository. DSpace is a digital service that collects, preserves, and distributes digital material. Repositories are important tools for preserving an organization's legacy; they facilitate digital preservation and scholarly communication.
dc.contributor.author | Israr Uddin | |
dc.contributor.author | Imran Siddiqi | |
dc.contributor.author | Shehzad Khalid | |
dc.date.accessioned | 2018-09-24T10:36:36Z | |
dc.date.available | 2018-09-24T10:36:36Z | |
dc.date.issued | 2017 | |
dc.identifier.uri | http://hdl.handle.net/123456789/7471 | |
dc.description.abstract | Optical Character Recognition (OCR) is one of the continuously explored problems. Presently, commercial character recognizers are available reporting near to 100% recognition rates on text in a number of scripts. Despite these advancements, OCR systems however, have yet to mature for cursive scripts like Urdu. This study presents a holistic technique for recognition of Urdu text in Nastaliq font using ‘complete’ ligatures as recognition units. The term ‘complete’ refers to a partial word including its main body and secondary components (dots and diacritic marks). Discrete Wavelet Transform (DWT) is employed as feature extractor while a separate Hidden Markov Model (HMM) is trained for each ligature considered in our study. More than 2000 frequently used unique Urdu ligatures from the standard CLE (Center of Language Engineering) dataset are considered in our evaluations. The system reads a promising accuracy of 88.87% on more than 10,000 partial words. | en_US |
dc.language.iso | en | en_US |
dc.publisher | Bahria University Islamabad Campus | en_US |
dc.subject | Department of Computer Science CS | en_US |
dc.title | A Holistic Approach for Recognition of Complete Urdu Ligatures using Hidden Markov Models | en_US |
dc.type | Article | en_US |