-
Notifications
You must be signed in to change notification settings - Fork 418
GRPO & SFT E2E tests #2497
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
GRPO & SFT E2E tests #2497
Conversation
ebbe014 to
480c7e4
Compare
8a115f5 to
c9f10d2
Compare
Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Need scan layers Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Fix Signed-off-by: Vladimir Suvorov <[email protected]> Add sft test as well Signed-off-by: Vladimir Suvorov <[email protected]> Add sft test as well Signed-off-by: Vladimir Suvorov <[email protected]> Add sft test as well Signed-off-by: Vladimir Suvorov <[email protected]> Fix vllm Signed-off-by: Vladimir Suvorov <[email protected]> fix Signed-off-by: Vladimir Suvorov <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Why do you need a separate file for E2E tests?
You can also run python3 -m MaxText.sft.sft_trainer MaxText/configs/sft.yml \ run_name=$RUN_NAME base_output_directory=$OUTPUT_PATH \ model_name=$MODEL_NAME load_parameters_path=$MODEL_CHECKPOINT_PATH \ hf_access_token=$HF_ACCESS_TOKEN tokenizer_path=$TOKENIZER_PATH"
| XLA_PYTHON_CLIENT_MEM_FRACTION: 0.75 | ||
| TF_FORCE_GPU_ALLOW_GROWTH: false | ||
| HF_TOKEN: ${{ secrets.HF_TOKEN }} | ||
| MAXTEXT_CHECKPOINT_PATH: gs://maxtext-model-checkpoints/llama3.1-8b/2025-01-23-19-04/scanned/0/items |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use the instruct checkpoint: gs://maxtext-model-checkpoints/llama3.1_8b_instruct/2025-10-16/scanned/0/items
| XLA_PYTHON_CLIENT_MEM_FRACTION: 0.75 | ||
| TF_FORCE_GPU_ALLOW_GROWTH: false | ||
| HF_TOKEN: ${{ secrets.HF_TOKEN }} | ||
| MODEL_CHECKPOINT_PATH: gs://maxtext-model-checkpoints/llama3.1-8b/2025-01-23-19-04/scanned/0/items |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Instruct checkpoint: gs://maxtext-model-checkpoints/llama3.1_8b_instruct/2025-10-16/scanned/0/items
| python3 -m pip install -e . --no-dependencies | ||
| - name: Install Tunix vLLM Requirements | ||
| run: | | ||
| bash src/MaxText/examples/install_tunix_vllm_requirement.sh |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't need vllm or tpu-commons to run SFT.
Description
Add tests for e2e grpo & sft
Tests
Added a separate TPU test for grpo
Checklist
Before submitting this PR, please make sure (put X in square brackets):
gemini-reviewlabel.