MedMini

Runs within < 3GB RAM | 0 vRAM

Inference Time < 3 sec

A Lightweight architecture for an Answering System on medical data based on LLMs, designed to run on edge devices.

Entirely on-device processing

Model Size on Disk: 500 + 250 MB

Installation Instructions

Quickstart - Docker Memory Heavy

Download the docker-run.sh file from the repository
sudo chmod +x docker-run.sh
sudo ./docker-run.sh

Uninstalling
- sudo docker image rm medmini

Raw Install Best Performance

git clone https://github.com/sarthakchittawar/Medmini.git
sudo chmod +x install.sh ; ./install.sh
sudo chmod +x run.sh ; ./run.sh
Need to have a ubuntu>=22.04 or debian>=12 based distro

Future work

Improve the RAG algorithm without compromising on efficiency

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
App		App
build_utils		build_utils
mashqa_data		mashqa_data
media		media
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
backend.py		backend.py
dbGen.py		dbGen.py
docker-run.sh		docker-run.sh
install.sh		install.sh
medmini.pdf		medmini.pdf
medmini.py		medmini.py
run.sh		run.sh
todo		todo

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MedMini

Runs within < 3GB RAM | 0 vRAM

Inference Time < 3 sec

A Lightweight architecture for an Answering System on medical data based on LLMs, designed to run on edge devices.

Entirely on-device processing

Model Size on Disk: 500 + 250 MB

Installation Instructions

Quickstart - Docker Memory Heavy

Raw Install Best Performance

Future work

About

Releases

Packages

Contributors 3

Languages

sarthakchittawar/Medmini

Folders and files

Latest commit

History

Repository files navigation

MedMini

Runs within < 3GB RAM | 0 vRAM

Inference Time < 3 sec

A Lightweight architecture for an Answering System on medical data based on LLMs, designed to run on edge devices.

Entirely on-device processing

Model Size on Disk: 500 + 250 MB

Installation Instructions

Quickstart - Docker Memory Heavy

Raw Install Best Performance

Future work

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages