Skip to content

DeepSeek V4 Roadmap #23602

@fzyzcjy

Description

@fzyzcjy

Information

Please temporarily refer to #23600 for details

Example performance

TODO

Roadmap

  • Finish W4A16 support on Hopper
  • Support non-standard extra chat template feature
  • Support pipeline parallel
  • Optimize MegaMoE (implemented, to be tested)
  • Further integrate and optimize various kernels
  • HiCache support
  • TODO

Metadata

Metadata

Assignees

No one assigned

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions