DSpace Repository

Mining Emerging News from Text Data (T-0694) (MFN 5093)

Show simple item record

dc.contributor.author Sabah Anwar Dar, 01-244132-014
dc.date.accessioned 2017-07-04T06:24:23Z
dc.date.available 2017-07-04T06:24:23Z
dc.date.issued 2015
dc.identifier.uri http://hdl.handle.net/123456789/2069
dc.description Supervised by Dr. Muhammad Muzammal en_US
dc.description.abstract We live in the information age. There is so much information emerging over the internet that it is next to impossible to be able to go through all of it. This work is focused on extracting “interesting” information from the web. As a first step, we assume that newspapers report the most interesting information and thus propose a framework that is able to extract interesting information from the internet using the news feed from news websites. We collect RSS feed from a set of user-specified sources and thus obtain the title of the news from the RSS feed. Next, we remove the insignificant words from the news title and a tokenization procedure transforms the keywords into tokens. These tokens are combined to form sets of items. An itemset mining algorithm is implemented to extract most interesting patterns and a de-tokenization procedure is used to extract the most interesting news. en_US
dc.language.iso en en_US
dc.publisher Software Engineering, Bahria University Engineering School Islamabad en_US
dc.relation.ispartofseries MS SE;T-0694
dc.subject Software Engineering en_US
dc.title Mining Emerging News from Text Data (T-0694) (MFN 5093) en_US
dc.type MS Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account