Popular repositories Loading
- 
      Megatron-LMMegatron-LM PublicForked from lhb8125/Megatron-LM Ongoing research training transformer models at scale Python 
- 
      TransformerEngineTransformerEngine PublicForked from NVIDIA/TransformerEngine A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper, Ada and Blackwell GPUs, to provide better performance with lower memory… Python 
- 
      Megatron-MoE-ModelZooMegatron-MoE-ModelZoo PublicForked from yanring/Megatron-MoE-ModelZoo Best practices for training DeepSeek, Mixtral, Qwen and other MoE models using Megatron Core. Python 
- 
      Megatron-BridgeMegatron-Bridge PublicForked from NVIDIA-NeMo/Megatron-Bridge Training library for Megatron-based models Python 
If the problem persists, check the GitHub status page or contact support.

