BERT Model for Text Classification

Overview

This project implements a BERT-based model for text classification, utilizing TensorFlow and HuggingFace Transformers. BERT (Bidirectional Encoder Representations from Transformers) is a pre-trained transformer-based model developed by Google, which improves the understanding of natural language by capturing context from both directions (left-to-right and right-to-left) in text.

Results

The following confusion matrix shows the performance of the BERT model in classifying text into different categories:

Features

Preprocessing Steps:
- Tokenization: Splits text into individual words or subwords.
- Lowercasing: Converts all text to lowercase.
- Removing Punctuation: Strips unnecessary punctuation marks.
- Stopword Removal: Removes common but unimportant words (e.g., "the", "is").
- Stemming/Lemmatization: Reduces words to their root or base form.
- Padding: Ensures uniform sequence length.
- Truncation: Shortens text sequences exceeding the maximum length.
- Encoding: Converts text into numerical format (e.g., word indices or embeddings).
Model Architecture:
- BERT Layer: A pre-trained BERT transformer for feature extraction.
- Dense Layer: Classifies the extracted features into three categories (positive, negative, neutral).
- Dropout Layer: Helps prevent overfitting by randomly dropping connections during training.
Total Parameters: 2,307 (9.01 KB), all of which are trainable.

Sprint Features

Sprint 1: Data Preprocessing

Implement preprocessing steps like tokenization, lowercasing, stopword removal, and padding.
Deliverable: A cleaned and tokenized dataset ready for training.

Sprint 2: Model Architecture Design

Design the BERT-based model for text classification.
Deliverable: A complete model architecture with pre-trained BERT and a classification head.

Sprint 3: Model Training

Train the BERT model using the processed dataset and evaluate performance.
Deliverable: A trained model with evaluation metrics like accuracy, precision, recall, and F1 score.

Sprint 4: Model Evaluation

Evaluate the model using a classification report and confusion matrix.
Deliverable: Performance metrics to assess the effectiveness of the trained model.

Conclusion

The BERT model for text classification successfully classifies text into predefined categories. With an accuracy of 51% and balanced precision and recall across categories, the model shows promise for NLP tasks. Future improvements can include fine-tuning the model further or experimenting with other transformer-based architectures for better performance.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
README.md		README.md
V1_Bert_Transformer_Deep_Learning.ipynb		V1_Bert_Transformer_Deep_Learning.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BERT Model for Text Classification

Overview

Results

Features

Sprint Features

Sprint 1: Data Preprocessing

Sprint 2: Model Architecture Design

Sprint 3: Model Training

Sprint 4: Model Evaluation

Conclusion

About

Releases

Packages

Languages

leovidith/BERT-Transformer

Folders and files

Latest commit

History

Repository files navigation

BERT Model for Text Classification

Overview

Results

Features

Sprint Features

Sprint 1: Data Preprocessing

Sprint 2: Model Architecture Design

Sprint 3: Model Training

Sprint 4: Model Evaluation

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages