TextClassification: Documentation

What is it?

TextClassification is a library, that makes text classification easily available. It relies on three components which depend on each other and work together:

  • Preprocessor: Reads in data and stores feature vectors and predictions

  • Featurizer: Extracts features out of text data

  • Classifier: Uses extracted features to train a classification model and do inference on unseen instances

Installation

To make use of the TextClassification library, clone the GitHub repository and install from source:

git clone https://github.com/bogdankostic/TextClassification.git
cd TextClassificattion
pip install -r requirements.txt
pip install --editable .

Indices and tables