Skip to content
View saitejabandaru-in's full-sized avatar
πŸ’«
πŸ’«

Sponsors

@kratos0718

Block or report saitejabandaru-in

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saitejabandaru-in/README.md

πŸ“Œ I work at the intersection of statistical theory, interpretable machine learning, and real-world clinical data.


πŸ’¬ A Short Conversation

Focus: Interpretable ML Β· Nonparametric Statistics Β· Clinical & Scientific AI


🧠 Research & Engineering Philosophy

β€œModels should not only predict well β€” they should explain well.”

I approach modeling through three principles:

  1. Statistical validity before scale
  2. Interpretability before optimization
  3. Domain meaning before deployment

My research interests include:

  • interpretable and explainable machine learning (post-hoc & intrinsic)
  • permutation-based, resampling, and nonparametric inference
  • dimensionality reduction with geometric and statistical intuition
  • robustness, stability, and noise-aware modeling
  • translating statistical theory into clinically actionable insights

πŸ› οΈ Core Stack

Used primarily for statistical modeling, interpretability research, and reproducible scientific workflows.


πŸš€ Featured Open Source Projects

An Model Context Protocol (MCP) server for searching, analyzing, and retrieving academic papers.

  • Purpose: Integrates arXiv and Semantic Scholar directly into AI coding assistants (like Claude Code/Desktop).
  • Features: Page-level text extraction from PDFs using PyMuPDF (fitz), citation graph traversal, and advanced search filters.
  • Tech: FastMCP, Python, HTTPX, PyMuPDF.

Nonparametric Combination (NPC) and bootstrap-based risk stratification model.

  • Purpose: Reproducible statistical analysis framework for our peer-reviewed research in Necrotizing Fasciitis.
  • Tech: Python, NumPy, Pandas, Scipy.

End-to-end clinical NLP platform for medical entity extraction, clinical sentiment analysis, topic modeling, and automated ICD coding.

  • Purpose: Privacy-preserving processing and deep learning pipelines for unstructured health records.
  • Tech: Python, PyTorch, Transformers, FastAPI.

🌐 Ecosystem Contributions

I actively participate in bug triaging, Q&A, and technical discussion forums across key scientific Python and developer libraries:

  • scikit-learn: Contributing technical solutions for community questions on discussions (e.g. customized Gower's distance implementations, tree routing diagnostics).
  • FastAPI / Next.js: Supporting developers in resolving configuration and execution issues in production settings.

πŸ“„ Research Paper

Permutation-Based Analysis of Clinical Variables in Necrotizing Fasciitis Using NPC and Bootstrap
Mathematics, MDPI (2025)

This work introduces a permutation-based, nonparametric framework for analyzing clinical variables in necrotizing fasciitis. By combining Nonparametric Combination (NPC) methodology with bootstrap techniques, the study enables robust inference under small-sample and distribution-free conditions, with an emphasis on interpretability and clinical relevance.

The study demonstrates how permutation-based inference can outperform classical parametric approaches in rare-disease clinical settings.

πŸ”— https://www.mdpi.com/2227-7390/13/17/2869


πŸ” Current Directions

  • permutation-based inference for small-sample biomedical studies
  • interpretability under distribution shift
  • robustness diagnostics for clinical ML models
  • statistical foundations of explainable AI

πŸ”— Research & Professional Profiles

Β  Β  Β  Β  Β 


πŸš€ What You’ll Find Here

  • πŸ“˜ math and statistics-first explanations of ML & AI
  • πŸ§ͺ reproducible experiments with robust inference
  • πŸ“Š real-world clinical and analytical datasets
  • 🧠 research-oriented notebooks focused on why, not just how

🀝 Let’s Connect

⭐ Thoughtful questions and rigorous discussions are always welcome.

Pinned Loading

  1. nf-risk-stratification nf-risk-stratification Public

    β€œNPC-based risk stratification model for necrotizing fasciitis using bootstrap and permutation methods.”

    Python 10

  2. excel-automation-toolkit excel-automation-toolkit Public

    Excel automation framework integrating VBA macros with Python (Pandas) pipelines for data preprocessing, reporting, and interactive business intelligence dashboards.

    Python 11

  3. big-data-clustering-analytics big-data-clustering-analytics Public

    Scalable clustering framework for big data using KMeans++, DBSCAN, BIRCH, OPTICS and DENCLUE, applied to NYC Taxi mobility analytics and credit card fraud detection.

    Python 11 1

  4. numerical-methods-ml numerical-methods-ml Public

    Numerical methods for machine learning using PCA, LDA, NMF and K-Means on the Iris dataset, implemented in MATLAB with visual analytics.

    MATLAB 10