Skip to content

leovidith/DisasterDetection-NLP

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 

Repository files navigation

Disaster Classification - NLP

This project demonstrates how to classify text data into "disaster" or "not disaster" categories using a Naive Bayes classifier with TF-IDF vectorization. The goal is to predict whether a given sentence is related to a disaster or not based on historical data.

Overview

This project applies natural language processing (NLP) techniques to classify text into two categories: disaster and non-disaster. By training a machine learning model with historical data, the model can predict whether new input sentences relate to disasters or not.

Features

  • Model: Naive Bayes classifier
  • Vectorization: TF-IDF for feature extraction
  • Task: Binary classification of disaster-related sentences
  • Framework: scikit-learn, pandas, numpy

Results

The following visualizations provide insights into the dataset:

Agile Sprints

Sprint 1: Data Preprocessing

  • Task: Clean and preprocess the text data, including tokenization, removing stopwords, and vectorization using TF-IDF.
  • Deliverable: Preprocessed dataset ready for model training.

Sprint 2: Model Training

  • Task: Train a Naive Bayes classifier using the preprocessed data.
  • Deliverable: A trained model capable of classifying sentences as "Disaster" or "Not Disaster."

Sprint 3: Model Evaluation

  • Task: Evaluate the model using accuracy, precision, recall, and F1 score.
  • Deliverable: Performance metrics that indicate the effectiveness of the model.

Sprint 4: Optimization and Testing

  • Task: Fine-tune the model, optimize hyperparameters, and test it on new data.
  • Deliverable: An optimized model ready for deployment.

Conclusion

This project successfully applies NLP techniques to classify text into disaster-related categories. The model performs well with balanced precision and recall, achieving an accuracy of 81.10%. Future improvements could include exploring other algorithms and enhancing the feature engineering process.

About

No description or website provided.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published