-
Notifications
You must be signed in to change notification settings - Fork 4.3k
Pull requests: deepspeedai/DeepSpeed
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Replace calls to
python setup.py sdist
with python -m build --sdist
#7069
opened Feb 21, 2025 by
loadams
Loading…
Avoid graph breaks in torch.compile caused by inner classes in the backward hooks
#7062
opened Feb 20, 2025 by
deepcharm
Loading…
Fix, pipeline model with moe cause error when send grad
#7055
opened Feb 19, 2025 by
wukong1992
Loading…
Enabled high-performance Automatic Tensor Parallelism (auto TP) for the Qwen2-MoE and DeepSeek-V2 models on multiple GPUs/HPUs
#6964
opened Jan 21, 2025 by
gyou2021
Loading…
Update sharded_moe.py to support top2 gate with Tutel
#6948
opened Jan 14, 2025 by
xenshinu
Loading…
Fix: forbid repeated deepspeed.initialize on training objects
#6874
opened Dec 16, 2024 by
traincheck-team
Loading…
Training ops kernels: Speeding up the Llama-based MoE architectures
#6734
opened Nov 8, 2024 by
RezaYazdaniAminabadi
•
Draft
Support the parallel conversion from ZeRO checkpoints to FP32/FP16/BF16 param weight
#6655
opened Oct 23, 2024 by
xylian86
Loading…
5 tasks done
Previous Next
ProTip!
Exclude everything labeled
bug
with -label:bug.