On Windows

Pneumonia Detection and Predictions

This project is an end-to-end deep learning solution for detecting pneumonia from chest X-ray images. It features a custom 10-layer Convolutional Neural Network (CNN) and an interactive web application built with Flask. The application uses Explainable AI (XAI) techniques, specifically Grad-CAM, to generate heatmaps that visually highlight areas of interest in the X-ray, providing transparency for the model's predictions.

🚀 Key Features

Interactive Web Application: A user-friendly interface built with Flask to upload images and receive instant predictions.

Explainable AI (XAI): Generates Grad-CAM heatmaps and bounding boxes on predicted pneumonia cases to explain the model's reasoning.

Custom CNN Architecture: A 10-layer CNN designed for image classification, compatible with CAM visualization techniques.

Reproducible ML Pipeline: Includes scripts for data preprocessing, model training, and evaluation.

Balanced Dataset: The preprocessing script balances the dataset to prevent model bias towards a specific class.

🛠️ Setup and Installation

Follow these steps to set up the project environment.

Clone the repository: git clone cd

Add a raw_data folder and add your raw data from kaggle's rsna dataset.

Create and activate a virtual environment (recommended): python -m venv venv OR conda

On Windows

venv\Scripts\activate

On macOS/Linux

source venv/bin/activate

Install the dependencies: pip install -r requirements.txt

⚙️ Usage

Data Preprocessing

Before training, you must process the raw DICOM images into PNGs and split them into train/validation/test sets. Place your raw data (stage_2_train_labels.csv and stage_2_train_images directory) into a raw_data folder as specified in config.py.

python data_preprocessing.py

This will create a data directory with the processed images.

Model Training

To train the model, run the training script. The best model weights will be saved in the saved_models directory.

python train.py

Model Evaluation

To evaluate the trained model on the test set and see a sample CAM visualization, run the evaluation script.

python evaluate.py

Running the Web Application

To start the interactive web application, run the Flask app.

python app.py

Navigate to http://127.0.0.1:5000 in your web browser to upload an image and see the prediction.

🧠 Technical Details

Data Preparation: The pipeline reads DICOM images, filters for "Pneumonia" and "Healthy" cases, and balances the classes to ensure an equal number of samples. Images are resized to 128x128 pixels. The data is split into 70% for training, 20% for validation, and 10% for testing.

Model Architecture: The project uses a custom 10-layer CNN. It consists of four convolutional blocks with 3x3 filters and MaxPooling2D layers. A GlobalAveragePooling2D layer is used before the final Dense layer, which makes the model compatible with CAM-based visualizations.

Training: The model is trained for up to 20 epochs using the Adam optimizer with a learning rate of 0.0001 and categorical_crossentropy as the loss function. Callbacks like EarlyStopping and ModelCheckpoint are used to save the best model and prevent overfitting.

💻 Technology Stack

Backend: Flask

Deep Learning: TensorFlow (Keras)

Data Manipulation: Pandas, NumPy

Image Processing: OpenCV, Pydicom

Machine Learning: Scikit-learn

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
saved_models		saved_models
src		src
static/uploads		static/uploads
templates		templates
README.md		README.md
app.py		app.py
getting_started.txt		getting_started.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

On Windows

On macOS/Linux

About

Uh oh!

Releases

Packages

Languages

Amfer7/Pnuemonia_Localization

Folders and files

Latest commit

History

Repository files navigation

On Windows

On macOS/Linux

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages