Skip to content

Conversation

@chichun-charlie-liu
Copy link
Collaborator

Description of the change

  1. config_quant_smooth_lays() always resets qcfg["qskip_lay_name"] and loses user-defined info,
  2. act_scales didn't check existing files
  3. get_mod_act_scales() and cache_block0() didn't check and cache and use position_embeddings
  4. act_scale is usually saved in higher prec but QLinear didn't preserve dtype after scaling
  5. try to enable device_map="auto" for easier large model inference

Related issue number

How to verify the PR

Was the PR tested

  • I have added >=1 unit test(s) for every new method I have added.
  • I have ensured all unit tests pass

1. config_quant_smooth_lays() always resets qcfg["qskip_lay_name"] and loses user-defined info,
2. act_scales didn't check existing files
3. get_mod_act_scales() and cache_block0() didn't check and cache and use position_embeddings
4. act_scale is usually saved in higher prec but QLinear didn't preserve dtype after scaling
5. try to enable device_map="auto" for easier large model inference

Signed-off-by: cliu-us <[email protected]>
@chichun-charlie-liu chichun-charlie-liu changed the title multiple bug fixes: fix: multiple bug fixes: Feb 24, 2025
@github-actions github-actions bot added the fix label Feb 24, 2025
Signed-off-by: cliu-us <[email protected]>
Copy link
Collaborator

@andrea-fasoli andrea-fasoli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

looks good to me

@chichun-charlie-liu chichun-charlie-liu merged commit 136c12f into foundation-model-stack:main Feb 25, 2025
12 checks passed
@chichun-charlie-liu chichun-charlie-liu deleted the dq_fix branch February 25, 2025 03:59
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants