🐦 Tweet Sentiment Analyzer

A Streamlit web application to predict the sentiment of tweets. This project uses a Logistic Regression model with TF-IDF features to classify tweets into Positive, Neutral, or Negative categories. It also displays the probabilities for each class in an interactive bar chart.
Data link : https://www.kaggle.com/datasets/jp797498e/twitter-entity-sentiment-analysis

🔹 Features

Preprocess tweets: remove stopwords, URLs, mentions, hashtags, and punctuation
Lemmatization for better text normalization
TF-IDF vectorizer for feature extraction
Logistic Regression model trained on Twitter dataset
Color-coded sentiment output in Streamlit
Easy-to-use web interface

📂 Repository Contents

File	Description
`app.py`	Streamlit web application
`logistic_model.pkl`	Trained Logistic Regression model
`tfidf_vectorizer.pkl`	Pickled TF-IDF vectorizer
`requirement.txt`	Python dependencies
`twitter_training.csv`	Training dataset
`sentiment_analysis.ipynb`	Notebook with data preprocessing and model training
`README.md`	Project description and instructions

💻 Installation

Clone the repository

git clone https://github.com/MohamedAli1937/Sentiment-Analysis-Web-App.git

⚙️ Install dependencies

pip install -r requirements.txt

🎮 Run the Streamlit app

streamlit run app.py

🤔 Prediction Function

def predict_sentiment(text):
    clean_text = clean_tweet_stopword_lemmatize(text)  # your cleaning + lemmatization function
    vectorized = vectorizer.transform([clean_text])
    return lr_model.predict(vectorized)[0]

🧠 How It Works

Preprocessing:

Lowercasing, removing URLs, mentions, hashtags, punctuation
Stopwords removal and lemmatization

Feature Extraction:

TF-IDF converts text to numerical vectors

Model:

Logistic Regression predicts sentiment class

Output:

Sentiment class (Positive/Neutral/Negative)

🚀 Future Improvements

Better Models

Use DistilBERT or RoBERTa for more accurate predictions
Deep learning models capture context better than Logistic Regression

Emotion Detection

Expand beyond Positive/Neutral/Negative
Detect specific emotions: Happy, Sad, Angry, Fear, Surprise, etc.
Use libraries like NRCLex or train multi-class classifiers

Data Enhancements

Add more neutral tweets to improve model balance
Include tweets in multiple languages

UI/UX Improvements

Show word clouds for positive/negative words
Display historical sentiment trends from multiple tweets
Add interactive charts for probabilities

Deployment

Deploy online via Streamlit Cloud, Heroku, or AWS
Make a public demo for users to try

⚠️ Known Limitations

The current Logistic Regression model sometimes misclassifies neutral tweets as positive or negative.
This happens because the training dataset has fewer neutral examples, making the model biased toward positive/negative sentiment.
Probabilities for neutral predictions may be less reliable compared to positive or negative.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🐦 Tweet Sentiment Analyzer

🔹 Features

📂 Repository Contents

💻 Installation

⚙️ Install dependencies

🎮 Run the Streamlit app

🤔 Prediction Function

🧠 How It Works

🚀 Future Improvements

⚠️ Known Limitations

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
app.py		app.py
logistic_model.pkl		logistic_model.pkl
requirements.txt		requirements.txt
sentiment_analysis.ipynb		sentiment_analysis.ipynb
tfidf_vectorizer.pkl		tfidf_vectorizer.pkl
twitter_training.csv		twitter_training.csv

Folders and files

Latest commit

History

Repository files navigation

🐦 Tweet Sentiment Analyzer

🔹 Features

📂 Repository Contents

💻 Installation

⚙️ Install dependencies

🎮 Run the Streamlit app

🤔 Prediction Function

🧠 How It Works

🚀 Future Improvements

⚠️ Known Limitations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages