## Information Please temporarily refer to https://github.com/sgl-project/sglang/pull/23600 for details ## Example performance TODO ## Roadmap - [ ] Finish W4A16 support on Hopper - [ ] Support non-standard extra chat template feature - [ ] Support pipeline parallel - [ ] Optimize MegaMoE (implemented, to be tested) - [ ] Further integrate and optimize various kernels - [ ] HiCache support - [ ] TODO
Information
Please temporarily refer to #23600 for details
Example performance
TODO
Roadmap