deepspeedai / DeepSpeed Public

Notifications You must be signed in to change notification settings
Fork 4.4k
Star 38.7k

Code
Issues 1.1k
Pull requests 112
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Pull requests: deepspeedai/DeepSpeed

Labels 31 Milestones 0

New pull request New

112 Open 3,203 Closed

Author

Filter by author

Uh oh!

There was an error while loading. Please reload this page.

Label

Filter by label

Uh oh!

There was an error while loading. Please reload this page.

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Uh oh!

There was an error while loading. Please reload this page.

Milestones

Filter by milestone

Uh oh!

There was an error while loading. Please reload this page.

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Uh oh!

There was an error while loading. Please reload this page.

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

Improve Ulysses Plus Docs

#7335 opened Jun 4, 2025 by cynricfu

Loading…

Update config_utils.py

#7333 opened Jun 3, 2025 by qgallouedec

Loading…

fp16 optimizer timers fix - TypeError: 'NoneType' object is not callable

#7330 opened Jun 3, 2025 by rraminen

Loading…

Unpin and fix issues with latest pytest 8.4.0

#7327 opened Jun 2, 2025 by loadams

Loading…

Create COMMITTERS_RESPONSIBILITY.md

#7300 opened May 21, 2025 by PKUWZP

Loading…

docs: fix Adam paper link and correct grammatical error in fused_adam.py

#7294 opened May 19, 2025 by ishanjmukherjee

Loading…

Modal CI

#7289 opened May 16, 2025 by sfc-gh-truwase

Loading…

Update to use torch2.7 for nv-torch-latest

#7273 opened May 8, 2025 by loadams

Loading…

set device_id in torch's init_process_group

#7266 opened Apr 30, 2025 by stas00 • Draft

Fix issue with symint input

#7243 opened Apr 24, 2025 by tohtana

Loading…

DeepNVMe update

#7215 opened Apr 12, 2025 by tjruwase

Loading…

HF2UCP: Converting a pytorch_model.bin or .safetensors checkpoint to UCP

#7212 opened Apr 10, 2025 by Schwidola0607

Loading…

gather output layout support for column parallel

#7181 opened Mar 28, 2025 by inkcherry

Loading…

Fix pre-compile on cpu-only machines

#7168 opened Mar 22, 2025 by AlongWY

Loading…

Add DataStates-LLM: Asynchronous Checkpointing Engine Support

#7166 opened Mar 21, 2025 by mauryaavinash95

Loading…

fixed: Modified the topkgating function and modified the test_moe file for testing

#7163 opened Mar 21, 2025 by xiongjyu

Loading…

[bugfix] update results of state_dict loading, embedding resizing to secondary partitions (hpz)

#7130 opened Mar 11, 2025 by cyr0930

Loading…

[Draft] Add support for seq split in Domino

#7111 opened Mar 4, 2025 by duanhx1037 • Draft

Use transformers latest on v100 tests

#7088 opened Feb 27, 2025 by loadams

Loading…

Update Domino for Llama3

#7084 opened Feb 26, 2025 by shenzheyu

Loading…

Fix, pipeline model with moe cause error when send grad

#7055 opened Feb 19, 2025 by wukong1992

Loading…

Add pyproject.toml with legacy build backend to keep most logic in setup.py

#7033 opened Feb 13, 2025 by loadams

Loading…

4 of 5 tasks

Enable python 3.11 and 3.12 tests

#7007 opened Feb 6, 2025 by loadams

Loading…

Enable torch.autocast with ZeRO

#6993 opened Feb 3, 2025 by tohtana

Loading…

Improve overflow handling in ZeRO

#6976 opened Jan 28, 2025 by tjruwase

Loading…

4 of 6 tasks

Previous 1 2 3 4 5 Next

Previous Next

ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!