Skip to content

Commit ac1dc93

Browse files
Merge pull request #2545 from AI-Hypercomputer:sft_eval
PiperOrigin-RevId: 826614476
2 parents 151fa9f + e82887e commit ac1dc93

File tree

8 files changed

+974
-152
lines changed

8 files changed

+974
-152
lines changed

src/MaxText/configs/base.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -489,6 +489,7 @@ tokenizer_path: "src/MaxText/assets/tokenizer.llama2"
489489
# hf pipeline only supports huggingface type, and will ignore tokenizer_type flag
490490
tokenizer_type: "sentencepiece" # Currently supporting: "tiktoken", "sentencepiece", "huggingface"
491491
use_chat_template: False
492+
chat_template_path: "" # path to chat template json file
492493
tokenize_train_data: True # False if the dataset is pre-tokenized
493494
tokenize_eval_data: True # False if the dataset is pre-tokenized
494495
add_bos: True
Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,5 @@
1+
{
2+
"PROMPT_TEMPLATE": "You are given a mathematical problem. You must solve the problem and provide your reasoning. Place your entire thought process and steps between <reasoning> and </reasoning>. After your reasoning, provide only the final numerical answer, extracted from your reasoning steps, between <answer> and </answer>. The user's problem is:\n{question}",
3+
"COMPLETION_TEMPLATE": "<reasoning>\n{reasoning}\n</reasoning>\n<answer>{answer}</answer>",
4+
"REASONING_ANSWER_SEPARATOR": "####"
5+
}

0 commit comments

Comments
 (0)