Abstract:
The process of categorizing text data can be quite tedious. With the growth of text data on
the internet, computer scientists have come up with algorithms related to machine learning
that help in text categorization. In this project I have automated the process of document
classification and indexing by the use of machine learning algorithms. The main aim of the
project is to help users, who have large amounts of text data, in classification and indexing of
documents.