ModelForge 🔧⚡

Finetune LLMs on your laptop’s GPU—no code, no PhD, no hassle.

🚀 Features

GPU-Powered Finetuning: Optimized for NVIDIA GPUs (even 4GB VRAM).
One-Click Workflow: Upload data → Pick task → Train → Test.
Hardware-Aware: Auto-detects your GPU/CPU and recommends models.
React UI: No CLI or notebooks—just a friendly interface.

📖 Supported Tasks

Text-Generation: Generates answers in the form of text based on prior and fine-tuned knowledge. Ideal for use cases like customer support chatbots, story generators, social media script writers, code generators, and general-purpose chatbots.
Summarization: Generates summaries for long articles and texts. Ideal for use cases like news article summarization, law document summarization, and medical article summarization.
Extractive Question Answering: Finds the answers relevant to a query from a given context. Best for use cases like Retrieval Augmented Generation (RAG), and enterprise document search (for example, searching for information in internal documentation).

Installation

Prerequisites

Python 3.8+: Ensure you have Python installed.
NVIDIA GPU: Recommended VRAM >= 6GB.
CUDA: Ensure CUDA is installed and configured for your GPU.
Docker Desktop: Install Docker Desktop for your OS.
NVIDIA Container Toolkit: Follow these instructions to install the NVIDIA Container Toolkit.
HuggingFace Account: Create an account on Hugging Face and generate a finegrained access token.

Steps

Clone the Repository:

git clone https://RETR0-OS/ModelForge.git
 cd ModelForge

Set HuggingFace API Key in environment variables:
Linux:

export HUGGINGFACE_TOKEN=your_huggingface_token

Windows Powershell:

$env:HUGGINGFACE_TOKEN="your_huggingface_token"

Windows CMD:

set HUGGINGFACE_TOKEN=your_huggingface_token

Build and the Docker Images:
```
docker-compose up --build
```

NOTE: This may take a while, especially the first time you run it. The images are quite large.

Done!: Navigate to http://localhost:3000 in your browser and get started!

Running the Application Again in the Future

Start the Docker Containers:
```
docker-compose up
```
Navigate to the UI:
Open your browser and go to http://localhost:3000.

Stopping the Application

To stop the application and free up resources, open a new terminal and run:

  docker-compose down

📂 Dataset Format

{"input": "Enter a really long article here...", "output": "Short summary."},
{"input": "Enter the poem topic here...", "output": "Roses are red..."}

🛠 Tech Stack

transformers + peft (LoRA finetuning)
bitsandbytes (4-bit quantization)
React (UI)
FastAPI (Backend)
Docker (Containerization)
NVIDIA Container Toolkit (GPU support)
NGINX (Reverse proxy)

Name		Name	Last commit message	Last commit date
Latest commit History 124 Commits
FastAPI_server		FastAPI_server
Frontend		Frontend
nginx		nginx
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ModelForge 🔧⚡

🚀 Features

📖 Supported Tasks

Installation

Prerequisites

Steps

Running the Application Again in the Future

Stopping the Application

📂 Dataset Format

🛠 Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

RETR0-OS/ModelForge

Folders and files

Latest commit

History

Repository files navigation

ModelForge 🔧⚡

🚀 Features

📖 Supported Tasks

Installation

Prerequisites

Steps

Running the Application Again in the Future

Stopping the Application

📂 Dataset Format

🛠 Tech Stack

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages