🔍 Noble Backend - LLM + MCP Powered Knowledge Extraction System

This project is a modular backend framework designed to handle intelligent content extraction and question answering using Large Language Models (LLMs) and a two-step MCP (Model Context Protocol) tool-based pipeline.

🧠 Project Overview

This project enables users to ask questions on any topic. Instead of relying solely on an LLM’s static knowledge, the system dynamically scrapes relevant web content, processes it using specialized tools (MCP), and passes the results back to the LLM for a more accurate and grounded response.

📌 Key Features

Tool-driven LLM architecture using OpenAI or Groq LLMs
Web scraping through a two-step process (via Firecrawl API or equivalent)
Modular tool system for easy extensibility
SSE-based communication between FastAPI backend and MCP service
Dockerized setup with Docker Compose for seamless development

🧭 Project Flow

User Query: A user sends a question to the FastAPI backend.
LLM Decision: The LLM evaluates the query and determines if a tool is needed.
MCP Tool Execution:
- If needed, the tool is invoked via an SSE (Server-Sent Events) channel from the MCP server.
- The tool processes the request (e.g., crawling the web or extracting in-depth internal links).
LLM Response:
- The tool’s output is returned to the LLM.
- The LLM integrates this result into the final response and returns it to the user.

🗂️ Folder Structure

noble_backend/
│
├── routes/
│   ├── doitr/
│   │   ├── client.py
│   │   └── config.py
│   ├── MCP/
│   │   ├── crawler.py
│   │   ├── crawler2.py
│   │   └── main.py         # FastMCP server
│   ├── utils/
│   │   ├── logger.py
│   │   └── OpenAI.py
│   ├── client.py           # SSE client for FastAPI to MCP connection
│   ├── web_search.py       # Tool handler
│
├── main.py                 # FastAPI server entry
├── Dockerfile
├── docker-compose.yml
├── pyproject.toml
├── .env
└── README.md

🚀 How to Run the Project

1. ✅ Prerequisites

Ensure you have the following installed:

Docker
Docker Compose
A .env file with required secrets (OpenAI/Groq keys, Firecrawl key, etc.)

Example `.env` file:

OPENAI_API_KEY=your_openai_key
FIRECRAWL_API_KEY=your_firecrawl_key
SERPER_API_KEY=your_serperapi_key

🐳 Run with Docker Compose

To start the backend and MCP servers, run:

docker-compose up --build

How to run in your local

First navigate to routes\MCP then

python main.py

The above command runs the MCp server

for starting the fastapi server navigate to nobel_backend

uvicorn main:app --reload

The above command starts the fastapi server

🔧 What This Will Do

✅ Start the FastMCP service on port 3001
✅ Start the FastAPI backend service on port 8000

🌐 Access the Services

FastAPI Backend: http://localhost:8000
MCP Server (SSE Endpoint): http://localhost:3001/sse

⚙️ Configuration Highlights

🌀 FastAPI uses a lifespan hook to initialize the MCPClient.
🔗 MCPClient connects to the MCP server at startup and fetches available tools.
🧩 Tools are modular and easy to register or extend.
🔄 Communication between FastAPI and MCP server happens via SSE (Server-Sent Events) for real-time processing.

🧪 Example Use Case

Question: "Explain Quantum Computing in simple terms with examples from recent articles"

🔁 Step-by-Step Flow

🧠 LLM decides a web search tool is required.
🕸️ crawler.py performs top-level scraping of relevant links and summaries.
🔍 crawler2.py fetches deeper in-depth data from the internal links.
📤 The results are fed back to the LLM.
🧾 LLM constructs a well-grounded, real-time response using fresh data.

🛠️ Tech Stack

Tech	Usage
FastAPI	Main backend server
Python Asyncio	Concurrency for SSE + tool execution
Docker	Containerization for consistent deployment
SSE	Real-time FastAPI ↔ MCP server communication
LLMs	OpenAI / Groq via `langchain_groq`
Firecrawl	Web crawling API for scraping and retrieval

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
__pycache__		__pycache__
routes		routes
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔍 Noble Backend - LLM + MCP Powered Knowledge Extraction System

🧠 Project Overview

📌 Key Features

🧭 Project Flow

🗂️ Folder Structure

🚀 How to Run the Project

1. ✅ Prerequisites

Example `.env` file:

🐳 Run with Docker Compose

How to run in your local

🔧 What This Will Do

🌐 Access the Services

⚙️ Configuration Highlights

🧪 Example Use Case

🔁 Step-by-Step Flow

🛠️ Tech Stack

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔍 Noble Backend - LLM + MCP Powered Knowledge Extraction System

🧠 Project Overview

📌 Key Features

🧭 Project Flow

🗂️ Folder Structure

🚀 How to Run the Project

1. ✅ Prerequisites

Example .env file:

🐳 Run with Docker Compose

How to run in your local

🔧 What This Will Do

🌐 Access the Services

⚙️ Configuration Highlights

🧪 Example Use Case

🔁 Step-by-Step Flow

🛠️ Tech Stack

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Example `.env` file:

Packages