test: Unit test int8 #62

andrea-fasoli · 2025-02-13T21:30:59Z

Contributing first unit test for INT8 and clean up.
Quantization configuration tested: weights per-tensor, activation per-tensor, no smoothquant

Related issue number

n/a

Was the PR tested

I have added >=1 unit test(s) for every new method I have added.
I have ensured all unit tests pass

Signed-off-by: Andrea Fasoli <[email protected]>

Signed-off-by: andrea-fasoli <[email protected]>

chichun-charlie-liu · 2025-02-16T22:32:33Z

fms_mo/aiu_addons/i8i8/i8i8_aiu_op.py

        scale_x = 127 / a_cv
-        x_int = torch.round(x / sq * scale_x).clamp(-127, 127)
-        return x_int / scale_x * sq
+        x_int = torch.round(x / sq * scale_x).clamp(-127, 127).to(torch.int8)


is this type casting really necessary? seems like the next line will apply a division with a float, which seems to automatically upcast again?

It is not needed. I added the casting during debugging, as I was observing numerical discrepancies between this function output and a reference output. I will remove it

andrea-fasoli · 2025-02-17T17:30:40Z

The purpose of this unit test is to compare the output of the custom int8 op for AIU against a reference, in order to ensure the correctness of the operation if in the future it gets altered.
The operation consists of:

unpacking of qdata tensor containing all quantization metadata
dequantization of integer weights
quantization and dequantization of input activations
matmul between dequantized weights and dequantized activations

However, I found the output of this operation to be very sensitive to the quantization process, to the point that changing order of divisions or multiplications (nominally equivalent but different in practice due to the precision used), would result in a failed test. I am not sure yet how to set a meaningful threshold for passing this test.

Signed-off-by: chichun-charlie-liu <[email protected]>

andrea-fasoli added 2 commits February 13, 2025 20:36

First int8 unit test

441d375

Signed-off-by: Andrea Fasoli <[email protected]>

clean up

b29b371

Signed-off-by: Andrea Fasoli <[email protected]>

andrea-fasoli requested review from chichun-charlie-liu, kcirred, nwang-ibm and tharapalanivel as code owners February 13, 2025 21:31

chichun-charlie-liu changed the title ~~Unit test int8~~ test: Unit test int8 Feb 14, 2025

github-actions bot added the test label Feb 14, 2025

Merge branch 'main' into unit_test_int8

70348a8

Signed-off-by: andrea-fasoli <[email protected]>

chichun-charlie-liu reviewed Feb 16, 2025

View reviewed changes

Merge branch 'main' into unit_test_int8

7064a2a

Signed-off-by: chichun-charlie-liu <[email protected]>

chichun-charlie-liu approved these changes Feb 19, 2025

View reviewed changes

chichun-charlie-liu merged commit 02f5ff3 into foundation-model-stack:main Feb 19, 2025
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

test: Unit test int8 #62

test: Unit test int8 #62

Uh oh!

andrea-fasoli commented Feb 13, 2025

Uh oh!

chichun-charlie-liu Feb 16, 2025

Uh oh!

andrea-fasoli Feb 17, 2025

Uh oh!

andrea-fasoli commented Feb 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

test: Unit test int8 #62

test: Unit test int8 #62

Uh oh!

Conversation

andrea-fasoli commented Feb 13, 2025

Related issue number

Was the PR tested

Uh oh!

chichun-charlie-liu Feb 16, 2025

Choose a reason for hiding this comment

Uh oh!

andrea-fasoli Feb 17, 2025

Choose a reason for hiding this comment

Uh oh!

andrea-fasoli commented Feb 17, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants