Classification of Functional and Non-functional Software Requirements from User App Reviews

Aleena Zainab, 01-243231-001

DSpace Home
→
Thesis/Dissertation Repository Islamabad Campus
→
Department of Computer Sciences (BUIC-E-8)
→
MS (CS) (BUIC-E-8)
→
View Item

dc.contributor.author	Aleena Zainab, 01-243231-001
dc.date.accessioned	2026-03-04T03:29:14Z
dc.date.available	2026-03-04T03:29:14Z
dc.date.issued	2025
dc.identifier.uri	http://hdl.handle.net/123456789/20823
dc.description	Supervised by Dr. Arif Ur Rahman	en_US
dc.description.abstract	This research explores the important difference between Functional Software Requirements (FRs) and Non-Functional Software Requirements (NFRs). FRs describe what a software system should do, such as user authentication and administrative tasks, while NFRs explain how the system should perform, including aspects like performance, reliability, and security. Classifying these requirements is crucial for developing high-quality software that meets user needs and industry standards. As technology changes, the need for automated sorting of user feedback becomes more important, especially when understanding clear and unclear user requirements. This research addresses the challenges of interpreting user feedback, which can lead to misclassifications. By applying effective text preprocessing and advanced NLP techniques, such as text cleaning, lowercasing, tokenization, stopword removal, lemmatization, duplicate words removal, handling missing values, and Label Encoding, the goal is to improve the accuracy of requirement extraction. A new method is proposed to automatically extract and classify FRs and NFRs from user app reviews. This method combines data from Software Requirements Specification (SRS) documents and user feedback. In this study, machine learning algorithms were applied to the dataset, yielding promising results. The Support Vector Machine (SVM) achieved an accuracy of 86.08%, the Stochastic Gradient Descent (SGD) algorithm attained 84.92%, and the Random Forest (RF) outperformed the others with an accuracy of 91%. Using the PURE dataset: FR NFR dataset, which includes 6117 requirements where 3964 requirements are functional and 2153 requirements are nonfunctional, this study aims to create effective methods for distinguishing between clear and unclear user needs. By utilizing advanced deep learning models like BERT, the research seeks to provide valuable insights into user preferences, helping developers make informed decisions about software improvements and enhancing the software development process. By employing the BERT (Bidirectional Encoder Representations from Transformers) deep learning model, the study achieved a precision of 89.5%, an accuracy of 89.4%, a recall of 89.4%, and an F1-score of 89.4% in the task of identifying software requirements.	en_US
dc.language.iso	en	en_US
dc.publisher	Computer Sciences	en_US
dc.relation.ispartofseries	MS(CS);T-3209
dc.subject	Classification of Functional	en_US
dc.subject	Non-functional Software Requirements	en_US
dc.subject	User App Reviews	en_US
dc.title	Classification of Functional and Non-functional Software Requirements from User App Reviews	en_US
dc.type	MS Thesis	en_US