CBDB LLM Evaluator

A tool for evaluating LLM responses against the China Biographical Database (CBDB).

Setup

Download the CBDB SQLite database from Hugging Face
Extract the downloaded .7z file
Rename the extracted database file to cbdb.db
Place cbdb.db in the root directory of this project

Files

cbdb_llm_eval.py - Main evaluation script
person_ids.txt - List of person IDs for evaluation. These IDs can be used with the narrative-cbdb project to retrieve narrative biographical texts for the listed persons for further RAG test.
cbdb_llm_eval.csv - CSV output of evaluation results
cbdb_llm_eval.xlsx - Excel format of evaluation results

Usage

When you run the program cbdb_llm_eval.py, it will randomly generate data for evaluation purposes.

Example Data

If you don't want to run the program yourself, you can access example data directly from the repository:

Example evaluation results (CSV)
Example evaluation results (Excel)
Example person IDs

Note

This project requires the CBDB SQLite database file to be named exactly as cbdb.db and placed in the project root directory. Without this database file, the evaluation scripts will not function properly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

CBDB LLM Evaluator

Setup

Files

Usage

Example Data

Note

Files

README.md

Latest commit

History

README.md

File metadata and controls

CBDB LLM Evaluator

Setup

Files

Usage

Example Data

Note