InteRank: Reasoning-Intensive Document Ranking with Small Language Models

InteRank is a novel approach for training compact language models (< 3B parameters) to perform reasoning-intensive document ranking with performance comparable to models over 20x larger. Our methodology combines knowledge distillation from a large teacher model with reinforcement learning optimization to create efficient yet powerful ranking models that can explain their decisions.

Overview

Key features:

Achieves state-of-the-art performance on the BRIGHT benchmark using only a 3B parameter model
Generates natural language explanations to justify ranking decisions
Uses a two-stage training approach combining knowledge distillation and reinforcement learning
Requires no human annotations for training
Supports both query generation and document relevance assessment

Training Process

Synthetic Data Generation: Automatically generates training data from StackExchange question-answer pairs using a large teacher model (Llama 3.3 70B)
Knowledge Distillation: Transfers initial reasoning capabilities from the teacher to a compact student model (Llama 3.2 3B)
Reinforcement Learning: Refines reasoning capabilities by rewarding high-quality explanations and accurate relevance predictions

Model Architecture

Base Model: Llama 3.2 3B
Training: QLoRA with 4-bit quantization and rank-64 adapters
Context Length: 4K tokens
Hardware Requirements: Single A100 GPU

Training Data

Automatically annotated ~100K queries-document pairs from 20K Stackexchange questions using Llama-3-70B, with relevance labels and explanations
Data available upon request for academic (non-commercial) use only

Results

Our 3B parameter model achieves:

27.4% average nDCG@10 across all domains on BRIGHT benchmark
3rd place on BRIGHT leaderboard
Outperforms recent approaches like Reason-to-Rank (nDCG@5 26.2 vs 19.6)
Performance comparable to ensemble models using 70B+ parameters

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
model.py		model.py
score_outputs.py		score_outputs.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

InteRank: Reasoning-Intensive Document Ranking with Small Language Models

Overview

Training Process

Model Architecture

Training Data

Results

About

Uh oh!

Releases

Packages

Languages

algoprog/InteRank

Folders and files

Latest commit

History

Repository files navigation

InteRank: Reasoning-Intensive Document Ranking with Small Language Models

Overview

Training Process

Model Architecture

Training Data

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages