Fixing vLLM: Incorrect Generation Results #66
Labels
bug
Something isn't working
enhancement
New feature or request
help wanted
Extra attention is needed
Description
vLLM accelerates generation by 5× on H800, but the output quality degrades significantly.
Observed Issues
Expected Behavior
Possible Causes
The issue is likely in the LM part, not the audio tokenizer or GAN.
Potential causes:
Steps to Reproduce
vllm
branch has been created. @hf-lin will adapt reproducible vLLM inference code based on Hugging Face.Additional Context
See system diagram.
The text was updated successfully, but these errors were encountered: