SourceMapR

Retrieval observability where humans and AI debug together.

SourceMapR is a retrieval observability tool. Trace every LLM answer back to exact document evidence — in two lines of code. AI agents evaluate via MCP while humans review in the dashboard.

Why SourceMapR?

Problem	SourceMapR Solution
"Which chunks did the retriever return?"	See every retrieved chunk with similarity scores
"What prompt was sent to the LLM?"	Full prompt/response capture with token counts
"Why did the model hallucinate?"	Click any chunk to view it in the original PDF
"Is my chunking strategy working?"	Compare experiments side by side
"How do I evaluate retrieval at scale?"	AI agents run LLM-as-judge via MCP
"How do humans and AI collaborate?"	Shared workspace with evaluations UI

Add retrieval observability in two lines of code. Let AI agents help you evaluate.

Document Support

Format	Status	Notes
PDF	✅ Supported	Full support with chunk highlighting and source viewing
HTML	🧪 Experimental	Basic rendering, chunk highlighting may not work
Other formats	🧪 Experimental	Under development

Current Focus: SourceMapR is optimized for PDF documents. Support for HTML and other file types is experimental and under active development.

Quick Start

pip install sourcemapr
sourcemapr server

LlamaIndex

from sourcemapr import init_tracing, stop_tracing
init_tracing(endpoint="http://localhost:5000")

# Your existing LlamaIndex code — unchanged
from llama_index.core import SimpleDirectoryReader, VectorStoreIndex

documents = SimpleDirectoryReader("./papers").load_data()
index = VectorStoreIndex.from_documents(documents)

response = index.as_query_engine().query("What is attention?")
print(response)

stop_tracing()

LangChain

from sourcemapr import init_tracing, stop_tracing
init_tracing(endpoint="http://localhost:5000")

# Your existing LangChain code — unchanged
from langchain_community.document_loaders import PyPDFLoader
from langchain_text_splitters import RecursiveCharacterTextSplitter
from langchain_community.vectorstores import FAISS

loader = PyPDFLoader("./papers/attention.pdf")
documents = loader.load()

splitter = RecursiveCharacterTextSplitter(chunk_size=512)
chunks = splitter.split_documents(documents)

vectorstore = FAISS.from_documents(chunks, embeddings)
results = vectorstore.similarity_search("What is attention?")

stop_tracing()

Open http://localhost:5000 to see the full evidence lineage.

Supported Frameworks

Framework	Documents	Chunks	Retrieval	LLM Calls
LlamaIndex	✅	✅	✅	✅
LangChain	✅	✅	✅	✅
OpenAI	—	—	—	✅

Pipeline Support

⚠️ Experimental: Pipeline tracing (e.g., langchain_pipeline_demo.py) is currently experimental and does not have stable support. Basic functionality works but may have limitations.

See Supported Features for details.

Features

Trace LLM Answers to Sources — Trace responses to exact chunks with similarity scores and rankings
PDF Chunk Viewer — Click any chunk to see it highlighted in the original PDF
Full LLM Tracing — Prompts, responses, tokens, latency for every query
Experiment Tracking — Organize runs and compare chunking strategies
Evidence Lineage — Complete trace from document load → parse → chunk → embed → retrieve → answer
Debug RAG Hallucinations — Verify grounding without guessing
MCP Server — AI agents can read data and write evaluations via Model Context Protocol
Evaluations Tab — View LLM-as-judge scores, categorize queries, track quality over time

Human-AI Collaboration

SourceMapR enables a collaborative workflow between humans and AI agents:

┌─────────────────┐     ┌─────────────────┐     ┌─────────────────┐
│   RAG Pipeline  │────▶│   SourceMapR    │◀────│    AI Agent     │
│  (Your Code)    │     │   (Workspace)   │     │  (Claude, etc)  │
└─────────────────┘     └─────────────────┘     └─────────────────┘
        │                       │                       │
        │ Traces queries,       │ Stores everything     │ Reads queries,
        │ chunks, responses     │ in SQLite             │ writes evaluations
        │                       │                       │
        └───────────────────────┼───────────────────────┘
                                │
                                ▼
                    ┌─────────────────────┐
                    │   Human Reviews     │
                    │   in Dashboard UI   │
                    └─────────────────────┘

MCP Tools for AI Agents

Add to your Claude Code config (~/.claude.json):

{
  "mcpServers": {
    "sourcemapr": {
      "command": "python",
      "args": ["-c", "from sourcemapr.mcp_server import run; run()"]
    }
  }
}

Read Tools:

list_queries — List all retrieval queries
get_query — Get query details with retrieved chunks and LLM response
list_documents — List indexed documents
list_experiments — List experiments

Write Tools:

create_evaluation — Store LLM-as-judge evaluation (relevance, faithfulness, etc.)
add_query_category — Categorize queries (e.g., "financial", "technical")
list_evaluations — View stored evaluations

Example agent prompt:

"Use the sourcemapr MCP tools to evaluate all queries. For each query, score relevance (0-1) and faithfulness (0-1). Add reasoning for each score."

CLI Commands

# Server management
sourcemapr server            # Start server (foreground)
sourcemapr server -b         # Start server in background
sourcemapr server -p 8080    # Start on custom port
sourcemapr stop              # Stop running server
sourcemapr restart           # Restart server
sourcemapr status            # Check if server is running

# Data management
sourcemapr clear             # Clear all trace data (with confirmation)
sourcemapr clear -y          # Clear without confirmation
sourcemapr init              # Initialize database
sourcemapr init --reset      # Delete and recreate database

# Info
sourcemapr version           # Show version

Examples

# LlamaIndex with PDFs
python examples/llamaindex_pdf_demo.py

# LangChain with PDFs
python examples/langchain_pdf_demo.py

See Examples for more.

Installation

From PyPI

pip install sourcemapr

From Source

git clone https://github.com/kamathhrishi/sourcemapr.git
cd sourcemapr && pip install -e .

Documentation

Supported Features — Framework coverage
Examples — Usage examples
REST API — API endpoints

License

MIT

Retrieval observability where humans and AI debug together.

Website · GitHub

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
docs		docs
examples		examples
frontend		frontend
sourcemapr		sourcemapr
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SourceMapR

Why SourceMapR?

Document Support

Quick Start

LlamaIndex

LangChain

Supported Frameworks

Pipeline Support

Features

Human-AI Collaboration

MCP Tools for AI Agents

CLI Commands

Examples

Installation

From PyPI

From Source

Documentation

License

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

kamathhrishi/sourcemapr

Folders and files

Latest commit

History

Repository files navigation

SourceMapR

Why SourceMapR?

Document Support

Quick Start

LlamaIndex

LangChain

Supported Frameworks

Pipeline Support

Features

Human-AI Collaboration

MCP Tools for AI Agents

CLI Commands

Examples

Installation

From PyPI

From Source

Documentation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages