Welcome to the Bahria University DSpace digital repository. DSpace is a digital service that collects, preserves, and distributes digital material. Repositories are important tools for preserving an organization's legacy; they facilitate digital preservation and scholarly communication.
dc.contributor.author | Usman Umar, 01-235161-034 | |
dc.contributor.author | Ashhad Taneem, 01-235161-003 | |
dc.date.accessioned | 2021-01-15T07:04:38Z | |
dc.date.available | 2021-01-15T07:04:38Z | |
dc.date.issued | 2020 | |
dc.identifier.uri | http://hdl.handle.net/123456789/10784 | |
dc.description | Supervised by Ms. Momina Mostesum | en_US |
dc.description.abstract | In past few years social media networking has been expanded a lot and because of this it enabled millions of users of all ages to develop and support their personal and professional relations. Although, a common flaw in these digital social websites is that it is easy to provide fake information about your name, age and gender in order to hide one’s identity. It allows the number of criminals such as pedophiles, online Threats, Stalking, Cyberbullying, Buying Illegal things, posting videos of criminal Activities. A vast number of fake profiles on social media makes it almost impossible to make manual analyses. In order to solve the above described problem, we have come up with three different techniques that are tokenization, feature extraction and classification. First when a user will write the comment or a text in the text box the system will process the text i.e. cleaning, tokenizing and removing the punctuations after the cleaning process the text will under go through a text feature extraction technique using TF-IDF algorithm through which the system will extract the features from the given text. After the feature extraction the system will then classify that if the provided text or a comment is written by a female, male or a brand on the bases of trained data. Our experimental results show that the system is generating satisfying results when can be further improved by either adding more data, treat missing and outlier values, feature engineering or algorithm tuning by applying these techniques we can improve the accuracy of the system in coming future. i | en_US |
dc.language.iso | en | en_US |
dc.publisher | Computer Sciences BUIC | en_US |
dc.relation.ispartofseries | BS (IT);MFN-P 9044 | |
dc.subject | Social Media Content | en_US |
dc.title | User demographic prediction from social media content | en_US |
dc.type | Project Reports | en_US |