support deepspeed LinearLayer and LinearAllreduce #698

xin3he · 2025-07-31T03:44:43Z

support tensor-paralleled model from deepspeed

Signed-off-by: Xin He <[email protected]>

Copilot

Pull Request Overview

This PR adds support for DeepSpeed's tensor-parallel linear layers (LinearLayer and LinearAllreduce) to the auto_round quantization library. The changes enable quantization of models that use DeepSpeed's tensor-parallelized layers.

Key changes:

Added DeepSpeed dependency detection and conditional imports
Extended SUPPORTED_LAYER_TYPES to include DeepSpeed linear layers
Implemented specialized forward pass handling for LinearAllreduce layers with all-reduce communication

Reviewed Changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
auto_round/utils.py	Adds DeepSpeed detection, imports, and extends supported layer types
auto_round/wrapper.py	Implements DeepSpeed layer support with specialized forward methods and wrapper enhancements

auto_round/wrapper.py

auto_round/utils.py

Co-authored-by: Copilot <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Xin He <[email protected]>

for more information, see https://pre-commit.ci

support deepspeed LinearLayer and LinearAllreduce

d2a1fb4

Signed-off-by: Xin He <[email protected]>

xin3he requested review from wenhuach21, n1ck-guo and Copilot and removed request for wenhuach21 and n1ck-guo July 31, 2025 03:44

Copilot AI reviewed Jul 31, 2025

View reviewed changes

auto_round/wrapper.py Outdated Show resolved Hide resolved

auto_round/wrapper.py Outdated Show resolved Hide resolved

auto_round/utils.py Outdated Show resolved Hide resolved

xin3he and others added 6 commits July 31, 2025 11:46

Update auto_round/utils.py

5edf3d6

Co-authored-by: Copilot <[email protected]>

Update auto_round/wrapper.py

7fea2d9

Co-authored-by: Copilot <[email protected]>

Merge branch 'main' into xinhe/deepspeed

23bd8a9

[pre-commit.ci] auto fixes from pre-commit.com hooks

5703ff3

for more information, see https://pre-commit.ci

add import

b66f785

Signed-off-by: Xin He <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

7246dcd

for more information, see https://pre-commit.ci

wenhuach21 approved these changes Aug 1, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

support deepspeed LinearLayer and LinearAllreduce #698

support deepspeed LinearLayer and LinearAllreduce #698

Uh oh!

xin3he commented Jul 31, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

support deepspeed LinearLayer and LinearAllreduce #698

Are you sure you want to change the base?

support deepspeed LinearLayer and LinearAllreduce #698

Uh oh!

Conversation

xin3he commented Jul 31, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!