A tool for evaluating LLM responses against the China Biographical Database (CBDB).
- Download the CBDB SQLite database from Hugging Face
- Extract the downloaded
.7z
file - Rename the extracted database file to
cbdb.db
- Place
cbdb.db
in the root directory of this project
cbdb_llm_eval.py
- Main evaluation scriptperson_ids.txt
- List of person IDs for evaluation. These IDs can be used with the narrative-cbdb project to retrieve narrative biographical texts for the listed persons for further RAG test.cbdb_llm_eval.csv
- CSV output of evaluation resultscbdb_llm_eval.xlsx
- Excel format of evaluation results
When you run the program cbdb_llm_eval.py
, it will randomly generate data for evaluation purposes.
If you don't want to run the program yourself, you can access example data directly from the repository:
This project requires the CBDB SQLite database file to be named exactly as cbdb.db
and placed in the project root directory. Without this database file, the evaluation scripts will not function properly.