Student Depression Classifier

Project Overview

Aim: To develop and optimize machine learning models to predict depression likelihood in students. This project focuses on tuning hyperparameters and comparing accuracy using F1-scores and cross-validation to identify the most robust predictive model.

Data Source

Source: Student Depression Dataset (Kaggle)

Dataset Statistics:

Observations: 27,899
Total Features: 18

Methodology & Key Steps

The analysis followed a rigorous data science pipeline:

Exploratory Data Analysis (EDA): Analyzed distributions and class balances.
Data Cleaning: Identified and handled missing data; treated outliers.
Feature Engineering: Performed feature encoding and selection based on correlation matrices.
Normalization: Applied feature normalization to ensure model stability.
Model Tuning: Conducted hyperparameter tuning and Cross-Validation.

Classifiers Built & Results

Several models were trained to benchmark performance.

Model	Training Accuracy	Test Accuracy
Logistic Regression	86.4%	86.3%
Random Forest	84.7%	86.3%
Ensemble Model (RF + LogReg)	--	86.0%
Support Vector Machine (SVM)	86.6%	73.5%
Neural Network	Did not converge	--

Conclusion & Insights

Best Performer: The Logistic Regression model achieved the highest consistent accuracy (86.3%). This suggests the underlying data boundaries are largely linear.
Random Forest: Capped at the highest accuracy with 200 trees, confirming that increasing complexity beyond this point yielded diminishing returns.
Overfitting: The SVM model showed signs of overfitting (High Training score vs. Low Test score).
Neural Network: The model did not converge within the computational time constraints, suggesting that simpler models (like Logistic Regression) are more efficient for this specific dataset structure.

Project by Aditya Daware

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
Classifier.Rmd		Classifier.Rmd
Student_depression_Dataset		Student_depression_Dataset
readme.md		readme.md
student_depression_dataset (1).csv		student_depression_dataset (1).csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Student Depression Classifier

Project Overview

Data Source

Methodology & Key Steps

Classifiers Built & Results

Conclusion & Insights

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Student Depression Classifier

Project Overview

Data Source

Methodology & Key Steps

Classifiers Built & Results

Conclusion & Insights

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages