Skip to content

Commit e82887e

Browse files
committed
Add script to run evaluation before and after SFT training
1 parent 64d6d9b commit e82887e

File tree

8 files changed

+968
-152
lines changed

8 files changed

+968
-152
lines changed

src/MaxText/configs/base.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -488,6 +488,7 @@ tokenizer_path: "src/MaxText/assets/tokenizer.llama2"
488488
# hf pipeline only supports huggingface type, and will ignore tokenizer_type flag
489489
tokenizer_type: "sentencepiece" # Currently supporting: "tiktoken", "sentencepiece", "huggingface"
490490
use_chat_template: False
491+
chat_template_path: "" # path to chat template json file
491492
tokenize_train_data: True # False if the dataset is pre-tokenized
492493
tokenize_eval_data: True # False if the dataset is pre-tokenized
493494
add_bos: True
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
{
2+
"PROMPT_TEMPLATE": "You are given a mathematical problem. You must solve the problem and provide your reasoning. Place your entire thought process and steps between <reasoning> and </reasoning>. After your reasoning, provide only the final numerical answer, extracted from your reasoning steps, between <answer> and </answer>. The user's problem is:\n{question}",
3+
"COMPLETION_TEMPLATE": "<reasoning>\n{reasoning}\n</reasoning>\n<answer>{answer}</answer>",
4+
"REASONING_ANSWER_SEPARATOR": "####"
5+
}

0 commit comments

Comments
 (0)