PA-Trace

Evidence-Traced Prior-Authorization using MedGemma 4B

A submission for the Google Impact Hackathon / Kaggle

An on-device, agentic workflow that turns a: clinic note + imaging order + payer policy → submission-ready PA draft + criteria checklist + exact evidence tracing.

What this is (and isn't)

✅ A demo / hackathon prototype focused on documentation assembly, not clinical decision-making.
✅ Works with synthetic notes (no PHI).
✅ Produces a "packet bundle" folder per run:
- packet.json, checklist.json, provenance.json, packet.md, highlights.html
❌ Not a medical device.
❌ Not a payer portal integration.
❌ Not autonomous diagnosis/treatment.

Quickstart

Requires Task (go install github.com/go-task/task/v3/cmd/task@latest).

task deps              # Create venv + install project
task model             # Download MedGemma GGUF (~2.5GB)
MODE=llm task run      # Run with MedGemma on case_01
MODE=llm task eval     # Evaluate all 10 cases

Fail-fast: If you run MODE=llm task run without the model file, it will error immediately with: Missing model file. Run: task model

Open the outputs:

runs/case_01/highlights.html — evidence spans highlighted in clinical note
runs/case_01/packet.md — human-readable PA packet draft

Baseline (no LLM)

task run               # Uses regex/keyword extraction
task eval              # Evaluate baseline on all cases

Manual commands (escape hatch)

python -m pa_trace run --case cases/case_01.json --out runs/case_01 --mode llm
python -m pa_trace eval --cases cases --gold cases/gold_labels.json --out runs/eval --mode llm

Model Setup (MedGemma)

Preferred: Use task model (idempotent, downloads if missing).

Manual: Download the GGUF from Hugging Face:

huggingface-cli download google/medgemma-4b-it-gguf \
  google_medgemma-4b-it-Q4_K_M.gguf --local-dir models/

Requirements

GPU (recommended): ~6GB VRAM with CUDA-enabled llama-cpp-python
CPU fallback: Works but slow (~2-3 min per case vs ~10s on GPU)
First run: Model load takes ~10-20s; subsequent inferences are faster

llama-cpp-python installation

The default pip install llama-cpp-python builds CPU-only. For CUDA:

CMAKE_ARGS="-DGGML_CUDA=on" pip install llama-cpp-python --force-reinstall --no-cache-dir

See llama-cpp-python docs for other backends (Metal, ROCm, Vulkan).

Expected Results

On the 10-case synthetic eval set:

Metric	Expected
`symptoms_duration_weeks`	~0.90
`conservative_care_weeks`	~0.90
`red_flags_present`	~0.90
`decision_accuracy`	~0.80–0.90
`provenance_valid_rate`	1.00
`abstention_precision`	1.00

Note: Decision accuracy may vary slightly run-to-run due to stochastic LLM inference. Provenance validity should remain 1.0 — all evidence spans are validated as substrings of the source text.

Policy text

For demo purposes we ship a paraphrased policy snippet in policies/policy_demo_spine_mri.json. For a real submission, replace it with a public payer guideline excerpt you can cite, chunked into JSON.

Safety & Ethics

Synthetic data only: All cases use fabricated clinical notes with no PHI.
No clinical recommendations: A refusal guardrail blocks any attempt to use the model for diagnosis or treatment decisions.
Provenance validation: All evidence quotes are verified as exact substrings of the source text, preventing hallucinated citations.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
cases		cases
docs		docs
pa_trace		pa_trace
policies		policies
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Taskfile.yml		Taskfile.yml
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PA-Trace

Evidence-Traced Prior-Authorization using MedGemma 4B

What this is (and isn't)

Quickstart

Baseline (no LLM)

Manual commands (escape hatch)

Model Setup (MedGemma)

Requirements

llama-cpp-python installation

Expected Results

Policy text

Safety & Ethics

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PA-Trace

Evidence-Traced Prior-Authorization using MedGemma 4B

What this is (and isn't)

Quickstart

Baseline (no LLM)

Manual commands (escape hatch)

Model Setup (MedGemma)

Requirements

llama-cpp-python installation

Expected Results

Policy text

Safety & Ethics

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages