Feature Extraction for Cursive Language Document Images

Welcome to DSpace BU Repository

Welcome to the Bahria University DSpace digital repository. DSpace is a digital service that collects, preserves, and distributes digital material. Repositories are important tools for preserving an organization's legacy; they facilitate digital preservation and scholarly communication.

Show simple item record

dc.contributor.author Maria Siddiqui
dc.contributor.author Imran Siddiqi
dc.contributor.author Khurram Khurshid
dc.date.accessioned 2018-11-29T12:10:13Z
dc.date.available 2018-11-29T12:10:13Z
dc.date.issued 2018
dc.identifier.uri http://hdl.handle.net/123456789/7762
dc.description.abstract The efficiency of any machine learning and computer vision system depends largely on the robustness of feature extraction and selection process. In word spotting applications, many appropriate features have been proposed over the years in literature. Most of these features are extracted for Latin text but are used with Oriental script as well. Extracting features that are more specific to Oriental text is also being investigated and a lot of research is being focused on this aspect lately as well. Deep Learning has also been employed for this purpose. In this paper, we have tried investigate the performance of shape based features for Urdu script. Urdu and Arabic belong to the same family of script and both share similar set of alphabet. This means that features investigated on Urdu will give similar performance for Arabic as well as other Oriental scripts. For this paper, we have compiled results on approximately 21000 ligatures belonging to 200 unique classes taken from scanned pages of the popular Urdu series ‘Zaawiyya’. This is Higher Education Commission granted project, due to this data set is provided by them. Proposed system gives encouraging results with precision of 88.5% and recall rate of 90.8%. en_US
dc.language.iso en en_US
dc.publisher Bahria University Islamabad Campus en_US
dc.relation.ispartofseries ;doi.org/10.1145/3177148.3180099
dc.subject Department of Computer Science CS en_US
dc.title Feature Extraction for Cursive Language Document Images en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account