forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 11
Pull requests: ROCm/Megatron-LM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat: add LoRA adapter layer and Mixtral LoRA training
#53
opened Jan 31, 2025 by
mpashkovskii
•
Draft
Add FSDP arguments and example script to train model with FSDP-v2
#52
opened Jan 28, 2025 by
ryang-amd
Loading…
[Perf] Skip creating attention mask in llama dataloader
#40
opened Dec 13, 2024 by
billishyahao
Loading…
ProTip!
Adding no:label will show everything without a label.