📄 PDF RAG Chatbot

An AI-powered document assistant that lets you upload any PDF and have a natural conversation with its contents — built with LangChain, LLaMA 3.3, and Streamlit.

🔗 Live Demo

What It Does

Most chatbots only know what they were trained on. This one reads your documents in real time.

Upload a PDF — a research paper, contract, textbook, report — and ask questions about it. The app retrieves the most relevant sections and feeds them to the LLM as context, so answers are grounded in your document rather than general knowledge.

How It Works

User uploads PDF
      ↓
PDF is split into overlapping text chunks (chunk_size=1000, overlap=200)
      ↓
Chunks are embedded using sentence-transformers/all-MiniLM-L6-v2
      ↓
Embeddings stored in ChromaDB (in-memory vector store)
      ↓
User sends a message → top-3 relevant chunks retrieved via similarity search
      ↓
Chunks injected into prompt → LLaMA 3.3 70B generates a grounded response

Tech Stack

Layer	Tool
LLM	LLaMA 3.3 70B via Groq API
Orchestration	LangChain
Embeddings	HuggingFace `all-MiniLM-L6-v2`
Vector Store	ChromaDB
PDF Parsing	PyPDFLoader
UI	Streamlit

Features

Upload any PDF and query it in natural language
Retrieval-Augmented Generation (RAG) pipeline — answers grounded in document content
Persistent chat history within session
Switchable AI personas (Helpful Assistant, Engineering Tutor, Financial Advisor, Exam Coach, Creative Writing)
Adjustable temperature for response creativity
Clean one-click chat reset

Running Locally

git clone https://github.com/Wemelo1/llm-assistant.git
cd llm-assistant
pip install -r requirements.txt
streamlit run PDF_APP.py

You'll need a free Groq API key — enter it in the sidebar when the app loads.

Why I Built This

Standard LLM chatbots hallucinate when asked about specific documents. I wanted to understand how RAG solves this by anchoring model responses to retrieved source content. This project taught me the full pipeline: chunking strategy, embedding tradeoffs, vector similarity search, and prompt construction with injected context.

What I'd Add Next

Support for multiple PDFs simultaneously
Source citation — show which chunk each answer came from
Persistent vector store (currently resets on session end)
Swap ChromaDB for FAISS for faster local retrieval

Built by Pr0_M1se — LLM Engineer

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
PDF_APP.py		PDF_APP.py
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 PDF RAG Chatbot

What It Does

How It Works

Tech Stack

Features

Running Locally

Why I Built This

What I'd Add Next

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 PDF RAG Chatbot

What It Does

How It Works

Tech Stack

Features

Running Locally

Why I Built This

What I'd Add Next

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages