-
Couldn't load subscription status.
- Fork 418
GRPO & SFT E2E tests #2497
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
GRPO & SFT E2E tests #2497
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -34,6 +34,9 @@ concurrency: | |
| }} | ||
| cancel-in-progress: true | ||
|
|
||
| permissions: | ||
| contents: read | ||
|
|
||
| jobs: | ||
| prelim: | ||
| runs-on: ["self-hosted"] | ||
|
|
@@ -103,6 +106,53 @@ jobs: | |
| container_resource_option: "--privileged" | ||
| is_scheduled_run: ${{ github.event_name == 'schedule' }} | ||
|
|
||
| tpu_e2e_grpo_test: | ||
| needs: tpu_image | ||
| runs-on: linux-x86-ct4p-240-4tpu | ||
| container: | ||
| image: gcr.io/tpu-prod-env-multipod/maxtext_${{ github.run_id }}:tpu | ||
| env: | ||
| XLA_PYTHON_CLIENT_MEM_FRACTION: 0.75 | ||
| TF_FORCE_GPU_ALLOW_GROWTH: false | ||
| HF_TOKEN: ${{ secrets.HF_TOKEN }} | ||
| MAXTEXT_CHECKPOINT_PATH: gs://maxtext-model-checkpoints/llama3.1-8b/2025-01-23-19-04/scanned/0/items | ||
| options: "--privileged" | ||
| steps: | ||
| - uses: actions/checkout@v4 | ||
| - name: Install Tunix vLLM Requirements | ||
| run: | | ||
| bash src/MaxText/examples/install_tunix_vllm_requirement.sh | ||
| - name: Run GRPO Llama3.1 8B Demo | ||
| run: | | ||
| python3 -m pip install -e . --no-dependencies && | ||
| python3 src/MaxText/examples/grpo_llama3_1_8b_demo.py | ||
|
|
||
| tpu_e2e_sft_test: | ||
|
||
| needs: tpu_image | ||
| runs-on: linux-x86-ct4p-240-4tpu | ||
| container: | ||
| image: gcr.io/tpu-prod-env-multipod/maxtext_${{ github.run_id }}:tpu | ||
| env: | ||
| XLA_PYTHON_CLIENT_MEM_FRACTION: 0.75 | ||
| TF_FORCE_GPU_ALLOW_GROWTH: false | ||
| HF_TOKEN: ${{ secrets.HF_TOKEN }} | ||
| MODEL_CHECKPOINT_PATH: gs://maxtext-model-checkpoints/llama3.1-8b/2025-01-23-19-04/scanned/0/items | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Instruct checkpoint: |
||
| STEPS: 10 | ||
| options: "--privileged" | ||
| steps: | ||
| - uses: actions/checkout@v4 | ||
| - name: Install Dependencies | ||
| run: | | ||
| python3 -m pip install -e . --no-dependencies | ||
| - name: Install Tunix vLLM Requirements | ||
| run: | | ||
| bash src/MaxText/examples/install_tunix_vllm_requirement.sh | ||
|
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. We don't need |
||
| - name: Run SFT Llama3.1 8B Demo | ||
| run: | | ||
| python3 src/MaxText/examples/sft_llama3_demo.py \ | ||
| --skip_checkpoint_download \ | ||
| --model_checkpoint_path=${MODEL_CHECKPOINT_PATH} | ||
|
|
||
| gpu_unit_tests: | ||
| needs: gpu_image | ||
| uses: ./.github/workflows/run_tests_internal.yml | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Use the instruct checkpoint:
gs://maxtext-model-checkpoints/llama3.1_8b_instruct/2025-10-16/scanned/0/items