Transformer-Based Text Classifier

Introduction

This project implements a Text Classifier using an Encoder-only Transformer model. It is trained on the AG_NEWS dataset, which consists of news articles that can be classified into four categories:

World
Sports
Business
Science/Technology

This model leverages the power of Transformer-based architectures to accurately classify news articles into these categories.

Installation

Follow these steps to set up the project:

Clone the repository:

git clone https://github.com/Red-RobinHood/Text-Classifier.git
cd Text-Classifier

Install the required dependencies:
```
pip install -r requirements.txt
```
Ensure that you are using Python 3.8 or higher.

Usage

1. Using the Pretrained Model

To use the pretrained model for text classification, follow these steps:

Change the custominput flag in the model.py file (on line 368):
- Set it to True to give input via the command line interface (CLI).
- Set it to False to use the validation data from the val.csv file.

Once the flag is set, run the script:

python model.py

This will use the pretrained model to classify news articles.

2. Training on a Custom Dataset

To train the model on your own dataset:

Delete the existing weights file from the weights folder or change the model_name parameter.
Add your custom dataset to the appropriate subfolder within the Dataset folder.
After this, run the training script to start training on your dataset.

Acknowledgments

• This project uses the AG_NEWS dataset for training and validation.

• The approach is inspired by research on Transformer architectures, particularly for text classification tasks.

Feel free to contribute by opening issues or submitting pull requests!

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Datasets/TextClassifier		Datasets/TextClassifier
Model		Model
Tokenizer/TextClassifier		Tokenizer/TextClassifier
Weights/TextClassifier		Weights/TextClassifier
README.md		README.md
requirments.txt		requirments.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Transformer-Based Text Classifier

Introduction

Installation

Usage

1. Using the Pretrained Model

2. Training on a Custom Dataset

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

Red-RobinHood/Text-Classifier

Folders and files

Latest commit

History

Repository files navigation

Transformer-Based Text Classifier

Introduction

Installation

Usage

1. Using the Pretrained Model

2. Training on a Custom Dataset

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages