Skip to content

Conversation

@LucasLLC
Copy link
Owner

@LucasLLC LucasLLC commented Dec 14, 2023

FSDP

~/torchsnapshot/benchmarks/fsdp (main)]$ torchrun --nproc-per-node=2 main.py
7.25GB Transformer model

TSS

2.2 seconds

torch.save

2.08 seconds

torch.DCP

14.59 seconds

DDP

~/torchsnapshot/benchmarks/dcp (main)]$ torchrun --nproc-per-node=2 main.py

Model size: 20.0 GB

TSS

3.52

torch.save

33.69

DCP

28.21

@LucasLLC LucasLLC self-assigned this Dec 14, 2023
@LucasLLC LucasLLC changed the title DDP + FSDP DDP + FSDP [WIP] Dec 14, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants