fix qwen2 vl failure in intel cpu and xpu #2709

sywangyi · 2024-10-31T05:33:30Z

Hi, I see your guys have added qwen2 vl into tgi, and I try the model in intel cpu, and find it does not work, failure like

data: {"error":"Request failed during generation: Server error: output with shape [1] doesn't match the broadcast shape [873]","error_type":"generation"}

I debug it and find following code is missing in modeling and cause prefill output shape is not as expected.

@drbh @Narsil please help review it

Signed-off-by: Wang, Yi A <[email protected]>

drbh · 2024-10-31T23:19:47Z

hi @sywangyi thank you for sharing this fix! I've included these changes in another PR that includes a couple other tweaks/fixes to qwen2-vl. #2708. Once that PR is merged I'll close this one. Thank you again for the fix! 🙏

sywangyi · 2024-11-01T03:32:19Z

2708 is merged. so close the pr

fix qwen2 failure in intel cpu

3836c3f

Signed-off-by: Wang, Yi A <[email protected]>

sywangyi changed the title ~~fix qwen2 failure in intel cpu~~ fix qwen2 vl failure in intel cpu Oct 31, 2024

sywangyi changed the title ~~fix qwen2 vl failure in intel cpu~~ fix qwen2 vl failure in intel cpu and xpu Oct 31, 2024

sywangyi closed this Nov 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix qwen2 vl failure in intel cpu and xpu #2709

fix qwen2 vl failure in intel cpu and xpu #2709

sywangyi commented Oct 31, 2024 •

edited

Loading

drbh commented Oct 31, 2024 •

edited

Loading

sywangyi commented Nov 1, 2024

fix qwen2 vl failure in intel cpu and xpu #2709

fix qwen2 vl failure in intel cpu and xpu #2709

Conversation

sywangyi commented Oct 31, 2024 • edited Loading

drbh commented Oct 31, 2024 • edited Loading

sywangyi commented Nov 1, 2024

sywangyi commented Oct 31, 2024 •

edited

Loading

drbh commented Oct 31, 2024 •

edited

Loading