forked from vllm-project/vllm
-
Notifications
You must be signed in to change notification settings - Fork 31
Issues: ROCm/vllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[Bug]: Running Llama-2-70b inference on MI300x getting OOM
bug
Something isn't working
#397
opened Jan 31, 2025 by
PurvangL
1 task done
[Bug]: Multi-GPU AMD Setup Hangs without NCCL_P2P_DISABLE=1 and --disable-custom-all-reduce
bug
Something isn't working
Under Investigation
#390
opened Jan 27, 2025 by
taddeusb90
1 task done
[Bug]: Unable to load deepseek r1 on 8 x AMD MI300X AssertionError: FP8 weight padding is not supported in block quantization
bug
Something isn't working
#375
opened Jan 21, 2025 by
samos123
1 task done
ProTip!
What’s not been updated in a month: updated:<2025-01-02.