Skip to content

Latest commit

 

History

History
21 lines (16 loc) · 700 Bytes

README.md

File metadata and controls

21 lines (16 loc) · 700 Bytes

RAG_experiments

Setup the DB

  1. Download the dataset wget https://huggingface.co/datasets/m-ric/huggingface_doc/resolve/main/huggingface_doc.csv
  2. Build the db python build_vector_database.py

To run a basic RAG test python rag.py

Blog posts

RAG evaluation

  1. Generate the dataset python test_procedure_for_rag/generate_qa_pairs.py data/chroma_db_1000/ data/
  2. Generate the answers using the RAG python test_procedure_for_rag/generate_answers.py data/chroma_db_1000/ data/qa_dataset_limit\=10.csv data/
  3. Evaluate python test_procedure_for_rag/evaluate.py data/chroma_db_1000/ data/qa_dataset_limit\=10.csv data/qa_dataset_limit\=10_answers.csv