Skip to content

Conversation

@mohsinposts
Copy link

Who I am:
Mohsin Shah (NetID: mohsins2)

Contribution type:
New example / use case of PyHealth.

High-level description:
This PR adds an example script examples/eicu_llm_synthetic_mortality.py which demonstrates how to use PyHealth with small tabular EHR-style data and LLM-generated synthetic cohorts from my CS598 project.
The example uses 10 hand-crafted ICU features and compares three training regimes for ICU mortality prediction:

Real train → Real test

GPT baseline synthetic train → Real test

GPT privacy-aware synthetic train → Real test

How to run / what files to look at

Example script: examples/eicu_llm_synthetic_mortality.py

The script directly loads the CSVs from my project repo via raw GitHub URLs:
https://github.com/mohsinposts/CS598-DLH-LLM-eICU
(real_icu_10feat.csv, synthetic_baseline_10feat_clean.csv, synthetic_privacy_10feat_clean.csv)

From the PyHealth repo root, run:

python examples/eicu_llm_synthetic_mortality.py

This prints ROC-AUC, PR-AUC, accuracy, F1, and loss.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant