Spring AI RAG Tutorial with Ollama and PGVector

This project demonstrates the implementation of Retrieval Augmented Generation (RAG) using Spring AI, Ollama, and PGVector Database. The application serves as a personal assistant that can answer questions about Spring Boot by referencing the Spring Boot Reference Documentation PDF.

Features

Uses Spring AI for RAG implementation
Integrates with Ollama for LLM capabilities
Stores and retrieves vector embeddings using PGVector
Automatically processes and ingests Spring Boot documentation
Provides REST API for question-answering

Architecture

RAG Architecture

Document Ingestion Pipeline

Prerequisites

Java 21
Docker and Docker Compose
Ollama installed locally
Maven

Setup Instructions

Install Ollama
- Follow the installation instructions at Ollama's official website
- Ensure Ollama is running on http://localhost:11434
Pull the Mistral Model
```
ollama pull mistral
```
Note: If you skip this step, the application will automatically pull the model when it first starts, which might take a few minutes.
Start PGVector Database
```
docker-compose up -d
```
This will start a PostgreSQL database with PGVector extension on port 5432.
Build the Application
```
./mvnw clean install
```

Running the Application

Start the Spring Boot Application
```
./mvnw spring-boot:run
```
The application will automatically:
- Initialize the vector store schema
- Load and process the Spring Boot reference PDF
- Start the REST API server

Usage

Send questions about Spring Boot to the API endpoint:

curl -X POST http://localhost:8080/api/chat \
     -H "Content-Type: text/plain" \
     -d "What is Spring Boot?"

Technical Details

Vector Database: PGVector (PostgreSQL with vector extension)
- Database: vectordb
- Username: testuser
- Password: testpwd
- Port: 5432
LLM Configuration:
- Model: Mistral
- Base URL: http://localhost:11434
- Initialization timeout: 5 minutes
- Auto-pulls model if not available locally
- Pull strategy: when_missing
Document Processing:
- Uses Apache Tika for PDF reading
- Implements text splitting for optimal chunk size
- Automatically ingests documentation on startup

Project Structure

ChatController: Handles REST API requests
DocumentIngestionService: Processes and stores documentation
application.properties: Contains configuration for Ollama and PGVector
compose.yml: Docker composition for PGVector database

Troubleshooting

Ensure Ollama is running and accessible at http://localhost:11434
Verify that the PostgreSQL container is running: docker ps
Check application logs for any initialization errors
Ensure the Mistral model is properly pulled in Ollama

Dependencies

Spring Boot 3.4.3
Spring AI (version 1.0.0-M6)
PGVector
Apache Tika
Spring Boot Docker Compose Support

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.mvn/wrapper		.mvn/wrapper
screenshots		screenshots
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
compose.yml		compose.yml
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Spring AI RAG Tutorial with Ollama and PGVector

Features

Architecture

RAG Architecture

Document Ingestion Pipeline

Prerequisites

Setup Instructions

Running the Application

Usage

Technical Details

Project Structure

Troubleshooting

Dependencies

About

Uh oh!

Releases

Packages

Uh oh!

Languages

SaiUpadhyayula/spring-ai-rag-ollama

Folders and files

Latest commit

History

Repository files navigation

Spring AI RAG Tutorial with Ollama and PGVector

Features

Architecture

RAG Architecture

Document Ingestion Pipeline

Prerequisites

Setup Instructions

Running the Application

Usage

Technical Details

Project Structure

Troubleshooting

Dependencies

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages