From 1adc02d063cec8fad2df4bbd4aef79ff54b6d80d Mon Sep 17 00:00:00 2001 From: Yan Gao Date: Wed, 11 Sep 2024 15:55:23 +0100 Subject: [PATCH] fix(benchmarks) Update medical evaluation readme (#4171) --- benchmarks/flowertune-llm/evaluation/medical/README.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/benchmarks/flowertune-llm/evaluation/medical/README.md b/benchmarks/flowertune-llm/evaluation/medical/README.md index 78de069460d8..628489ce8de6 100644 --- a/benchmarks/flowertune-llm/evaluation/medical/README.md +++ b/benchmarks/flowertune-llm/evaluation/medical/README.md @@ -22,6 +22,9 @@ huggingface-cli login ## Generate model decision & calculate accuracy +> [!NOTE] +> Please ensure that you use `quantization=4` to run the evaluation if you wish to participate in the LLM Leaderboard. + ```bash python eval.py \ --peft-path=/path/to/fine-tuned-peft-model-dir/ \ # e.g., ./peft_1