Disaster Classification - NLP

This project demonstrates how to classify text data into "disaster" or "not disaster" categories using a Naive Bayes classifier with TF-IDF vectorization. The goal is to predict whether a given sentence is related to a disaster or not based on historical data.

Overview

This project applies natural language processing (NLP) techniques to classify text into two categories: disaster and non-disaster. By training a machine learning model with historical data, the model can predict whether new input sentences relate to disasters or not.

Features

Model: Naive Bayes classifier
Vectorization: TF-IDF for feature extraction
Task: Binary classification of disaster-related sentences
Framework: scikit-learn, pandas, numpy

Results

The following visualizations provide insights into the dataset:

Agile Sprints

Sprint 1: Data Preprocessing

Task: Clean and preprocess the text data, including tokenization, removing stopwords, and vectorization using TF-IDF.
Deliverable: Preprocessed dataset ready for model training.

Sprint 2: Model Training

Task: Train a Naive Bayes classifier using the preprocessed data.
Deliverable: A trained model capable of classifying sentences as "Disaster" or "Not Disaster."

Sprint 3: Model Evaluation

Task: Evaluate the model using accuracy, precision, recall, and F1 score.
Deliverable: Performance metrics that indicate the effectiveness of the model.

Sprint 4: Optimization and Testing

Task: Fine-tune the model, optimize hyperparameters, and test it on new data.
Deliverable: An optimized model ready for deployment.

Conclusion

This project successfully applies NLP techniques to classify text into disaster-related categories. The model performs well with balanced precision and recall, achieving an accuracy of 81.10%. Future improvements could include exploring other algorithms and enhancing the feature engineering process.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
images		images
Disaster_Classification_NLP.ipynb		Disaster_Classification_NLP.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Disaster Classification - NLP

Overview

Features

Results

Agile Sprints

Sprint 1: Data Preprocessing

Sprint 2: Model Training

Sprint 3: Model Evaluation

Sprint 4: Optimization and Testing

Conclusion

About

Releases

Packages

Languages

leovidith/DisasterDetection-NLP

Folders and files

Latest commit

History

Repository files navigation

Disaster Classification - NLP

Overview

Features

Results

Agile Sprints

Sprint 1: Data Preprocessing

Sprint 2: Model Training

Sprint 3: Model Evaluation

Sprint 4: Optimization and Testing

Conclusion

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages