[2/N][Refactor][Quantization] clean quantization patch #2785

22dimensions · 2025-09-05T09:36:36Z

What this PR does / why we need it?

quantization patch is unused code

Does this PR introduce any user-facing change?

No

How was this patch tested?

tested by CI

vLLM version: v0.10.1.1
vLLM main: vllm-project/vllm@f4962a6

gemini-code-assist

Code Review

This pull request provides a good cleanup by removing the unused quantization patching mechanism. The refactoring simplifies the codebase by deleting the func_wrapper.py file, its corresponding tests, and the complex monkey-patching logic in utils.py. Moving the necessary functionality from the patch directly into the AscendVocabParallelEmbedding class is a solid improvement for maintainability. I have one suggestion to complete the cleanup.

github-actions · 2025-09-05T09:56:24Z

👋 Hi! Thank you for contributing to the vLLM Ascend project. The following points will speed up your PR merge:‌‌

A PR should do only one thing, smaller PRs enable faster reviews.
Every PR should include unit tests and end-to-end tests ‌to ensure it works and is not broken by other future PRs.
Write the commit message by fulfilling the PR description to help reviewer and future developers understand.

If CI fails, you can run linting and testing checks locally according Contributing and Testing.

codecov · 2025-09-05T10:34:21Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 72.82%. Comparing base (dd087ef) to head (b052105).
⚠️ Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2785      +/-   ##
==========================================
+ Coverage   72.61%   72.82%   +0.20%     
==========================================
  Files         154      152       -2     
  Lines       21318    21077     -241     
==========================================
- Hits        15481    15349     -132     
+ Misses       5837     5728     -109

Flag	Coverage Δ
unittests	`72.82% <100.00%> (+0.20%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

vllm_ascend/quantization/utils.py

Signed-off-by: 22dimensions <[email protected]>

…2785) ### What this PR does / why we need it? quantization patch is unused code ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? tested by CI - vLLM version: v0.10.1.1 - vLLM main: vllm-project/vllm@f4962a6 Signed-off-by: 22dimensions <[email protected]> Signed-off-by: 1Fire4 <[email protected]>

…2785) ### What this PR does / why we need it? quantization patch is unused code ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? tested by CI - vLLM version: v0.10.1.1 - vLLM main: vllm-project/vllm@f4962a6 Signed-off-by: 22dimensions <[email protected]>

gemini-code-assist bot reviewed Sep 5, 2025

View reviewed changes

github-actions bot added module:tests module:ops module:quantization labels Sep 5, 2025

wangxiyuan approved these changes Sep 7, 2025

View reviewed changes

vllm_ascend/quantization/utils.py Outdated Show resolved Hide resolved

clean quantization patch

b052105

Signed-off-by: 22dimensions <[email protected]>

22dimensions force-pushed the remove_quant_patch branch from 397fcb5 to b052105 Compare September 8, 2025 06:50

wangxiyuan approved these changes Sep 8, 2025

View reviewed changes

wangxiyuan merged commit d51694a into vllm-project:main Sep 8, 2025
23 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[2/N][Refactor][Quantization] clean quantization patch #2785

[2/N][Refactor][Quantization] clean quantization patch #2785

22dimensions commented Sep 5, 2025 •

edited by github-actions bot

Loading

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

codecov bot commented Sep 5, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[2/N][Refactor][Quantization] clean quantization patch #2785

[2/N][Refactor][Quantization] clean quantization patch #2785

Conversation

22dimensions commented Sep 5, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What this PR does / why we need it?

Does this PR introduce any user-facing change?

How was this patch tested?

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

github-actions bot commented Sep 5, 2025

Uh oh!

codecov bot commented Sep 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

22dimensions commented Sep 5, 2025 •

edited by github-actions bot

Loading

codecov bot commented Sep 5, 2025 •

edited

Loading