[ONNX Importer] Incorrect return type for ops from "com.microsoft" domain (!torch.none) #949

pravg-amd · 2025-03-11T08:12:51Z

Some of the onnx zoo models have" QLinearAdd" op which results in return type !torch.none on importing the model.

  %88 = torch.operator "onnx.QLinearAdd"(%87, %43, %42, %46, %47, %48, %45, %44) : (!torch.vtensor<[1,4096],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[4096],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.none

Steps to reproduce:

Build SHARK-TestSuite by following the steps below

https://github.com/nod-ai/SHARK-TestSuite/tree/main/alt_e2eshark

Run the following command

python run.py -va -t bvlcalexnet-12-int8

The model will be available test-run/bvlcalexnet-12-int8/

bvlcalexnet-12-int8
squeezenet1.0-12-int8
caffenet-12-int8
densenet-12-int8
vgg16-12-int8
mobilenetv2-12-int8

The text was updated successfully, but these errors were encountered:

pravg-amd · 2025-03-11T08:14:53Z

This op is not part of https://onnx.ai/onnx/operators/

doc: https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#com.microsoft.QLinearAdd

More information regarding this: onnx/onnx#5895

pravg-amd · 2025-03-13T14:30:15Z

ONNX Runtime ticket to track the support -> microsoft/onnxruntime#24028

pravg-amd · 2025-03-26T06:01:40Z

ORT changes are merged . Working on supporting the op using the ort shape inference

pravg-amd · 2025-04-01T05:57:52Z

QLinearLeakyRelu - Model : yolov3-12-int8

 %673 = torch.operator "onnx.QLinearLeakyRelu"(%671, %43, %42, %41, %40) {torch.onnx.alpha = 1.000000e-01 : f32} : (!torch.vtensor<[?,32,?,?],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.none

pravg-amd · 2025-04-01T06:06:05Z

QLinearConcat - Model : version-RFB-320-int8

    %358 = torch.operator "onnx.QLinearConcat"(%158, %157, %356, %149, %148, %355, %143, %142, %357, %155, %154) {torch.onnx.axis = 1 : si64} : (!torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[1,16,30,40],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[1,16,30,40],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[1,16,30,40],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.none

pravg-amd · 2025-04-01T07:47:12Z

onnx.QLinearGlobalAveragePool - Model: squeezenet1.0-12-int8

%238 = torch.operator "onnx.QLinearGlobalAveragePool"(%237, %173, %172, %175, %174) {torch.onnx.channels_last = 0 : si64} : (!torch.vtensor<[1,1000,13,13],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -
> !torch.none

pravg-amd · 2025-04-02T10:06:56Z

@vivekkhandelwal1 IR with ort shape inference for QLinearAdd

func.func @test_qlinearadd(%arg0: !torch.vtensor<[1,4096],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[4096],ui8>, %arg4: !torch.vtensor<[],f32>, %arg5: !torch.vtensor<[],ui8>, %arg6: !torch.vtensor<[],f32>, %arg7: !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,4096],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
  %0 = torch.operator "onnx.QLinearAdd"(%arg0, %arg1, %arg2, %arg3, %arg4, %arg5, %arg6, %arg7) : (!torch.vtensor<[1,4096],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[4096],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,4096],ui8>
  return %0 : !torch.vtensor<[1,4096],ui8>
}

vivekkhandelwal1 · 2025-04-07T07:34:52Z

@vivekkhandelwal1 IR with ort shape inference for QLinearAdd

func.func @test_qlinearadd(%arg0: !torch.vtensor<[1,4096],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[4096],ui8>, %arg4: !torch.vtensor<[],f32>, %arg5: !torch.vtensor<[],ui8>, %arg6: !torch.vtensor<[],f32>, %arg7: !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,4096],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
  %0 = torch.operator "onnx.QLinearAdd"(%arg0, %arg1, %arg2, %arg3, %arg4, %arg5, %arg6, %arg7) : (!torch.vtensor<[1,4096],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[4096],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,4096],ui8>
  return %0 : !torch.vtensor<[1,4096],ui8>
}

The lowering for the QLinearAdd op is added here: llvm/torch-mlir#4113

vivekkhandelwal1 · 2025-04-07T07:36:49Z

Hi @pravg-amd, can you please add the repro IRs for other ops?

pravg-amd · 2025-04-07T14:35:57Z

QLinearLeakyRelu

func.func @test_qlinearleakyrelu(%arg0: !torch.vtensor<[?,32,?,?],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[],f32>, %arg4: !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,32,?,?],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
  %0 = torch.operator "onnx.QLinearLeakyRelu"(%arg0, %arg1, %arg2, %arg3, %arg4) {torch.onnx.alpha = 1.000000e-01 : f32} : (!torch.vtensor<[?,32,?,?],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,32,?,?],ui8>
  return %0 : !torch.vtensor<[?,32,?,?],ui8>
}

pravg-amd · 2025-04-07T14:57:32Z

QLinearConcat

func.func @test_qlinearconcat(%arg0: !torch.vtensor<[],f32>, %arg1: !torch.vtensor<[],ui8>, %arg2: !torch.vtensor<[?,?,?,?],ui8>, %arg3: !torch.vtensor<[],f32>, %arg4: !torch.vtensor<[],ui8>, %arg5: !torch.vtensor<[?,?,?,?],ui8>, %arg6: !torch.vtensor<[],f32>, %arg7: !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,?,?,?],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
  %0 = torch.operator "onnx.QLinearConcat"(%arg0, %arg1, %arg2, %arg3, %arg4, %arg5, %arg6, %arg7) {torch.onnx.axis = 1 : si64} : (!torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[?,?,?,?],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[?,?,?,?],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,?,?,?],ui8>
  return %0 : !torch.vtensor<[?,?,?,?],ui8>
}

pravg-amd · 2025-04-07T15:20:34Z

QLinearGlobalAveragePool

func.func @test_qlinearglobalavgpool(%arg0: !torch.vtensor<[1,1000,13,13],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[],f32>, %arg4: !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,1000,1,1],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
    %0 = torch.operator "onnx.QLinearGlobalAveragePool"(%arg0, %arg1, %arg2, %arg3, %arg4) {torch.onnx.channels_last = 0 : si64} : (!torch.vtensor<[1,1000,13,13],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,1000,1,1],ui8>
    return %0 : !torch.vtensor<[1,1000,1,1],ui8>
}

vivekkhandelwal1 · 2025-04-08T06:23:17Z

QLinearLeakyRelu

func.func @test_qlinearleakyrelu(%arg0: !torch.vtensor<[?,32,?,?],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[],f32>, %arg4: !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,32,?,?],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
  %0 = torch.operator "onnx.QLinearLeakyRelu"(%arg0, %arg1, %arg2, %arg3, %arg4) {torch.onnx.alpha = 1.000000e-01 : f32} : (!torch.vtensor<[?,32,?,?],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,32,?,?],ui8>
  return %0 : !torch.vtensor<[?,32,?,?],ui8>
}

The lowering for the QLinearLeakyRelu op is added here: llvm/torch-mlir#4115.

vivekkhandelwal1 · 2025-04-09T12:02:26Z

The lowering for remaining 2 ops are added here:
llvm/torch-mlir#4116
llvm/torch-mlir#4120

vivekkhandelwal1 · 2025-04-14T03:52:38Z

All the PRs, related to this issue are merged.

pravg-amd · 2025-04-15T05:57:20Z

QLinearSigmoid

func.func @test_qlinear_sigmoid(%arg0: !torch.vtensor<[?,?],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[],f32>, %arg4: !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,?],ui8> attributes {torch.onnx_meta.ir_version = 7 : si64, torch.onnx_meta.opset_version = 21 : si64, torch.onnx_meta.opset_versions = {com.microsoft = 1 : si64}, torch.onnx_meta.producer_name = "onnx.quantize", torch.onnx_meta.producer_version = "0.1.0"} {
  %0 = torch.operator "onnx.QLinearSigmoid"(%arg0, %arg1, %arg2, %arg3, %arg4) : (!torch.vtensor<[?,?],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[?,?],ui8> 
  return %0 : !torch.vtensor<[?,?],ui8>
}

pravg-amd · 2025-04-15T07:51:08Z

FusedMatMul

func.func @test_fusedMatmul(%arg0: !torch.vtensor<[?,12,256,64],f32>, %arg1: !torch.vtensor<[?,12,256,64],f32>) -> !torch.vtensor<[?,12,256,256],f32> attributes {torch.onnx_meta.ir_version = 7 : si64, torch.onnx_meta.opset_version = 21 : si64, torch.onnx_meta.opset_versions = {com.microsoft = 1 : si64}, torch.onnx_meta.producer_name = "onnx.quantize", torch.onnx_meta.producer_version = "0.1.0"} {
    %0 = torch.operator "onnx.FusedMatMul"(%arg0, %arg1) {torch.onnx.alpha = 1.250000e-01 : f32, torch.onnx.transA = 0 : si64, torch.onnx.transB = 1 : si64} : (!torch.vtensor<[?,12,256,64],f32>, !torch.vtensor<[?,12,256,64],f32>) -> !torch.vtensor<[?,12,256,256],f32>
    return %0 : !torch.vtensor<[?,12,256,256],f32>
}

pravg-amd · 2025-04-15T18:06:59Z

QLinearAveragePool

func.func @test_qlinearAveragePool(%arg0: !torch.vtensor<[1,128,56,56],ui8>, %arg1: !torch.vtensor<[],f32>, %arg2: !torch.vtensor<[],ui8>, %arg3: !torch.vtensor<[],f32>, %arg4: !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,128,28,28],ui8> attributes {torch.onnx_meta.ir_version = 5 : si64, torch.onnx_meta.opset_version = 10 : si64, torch.onnx_meta.producer_name = "backend-test", torch.onnx_meta.producer_version = ""} {
    %0 = torch.operator "onnx.QLinearAveragePool"(%arg0, %arg1, %arg2, %arg3, %arg4) {torch.onnx.auto_pad = "NOTSET", torch.onnx.ceil_mode = 0 : si64, torch.onnx.count_include_pad = 0 : si64, torch.onnx.kernel_shape = [2 : si64, 2 : si64], torch.onnx.pads = [0 : si64, 0 : si64, 0 : si64, 0 : si64], torch.onnx.strides = [2 : si64, 2 : si64]} : (!torch.vtensor<[1,128,56,56],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>, !torch.vtensor<[],f32>, !torch.vtensor<[],ui8>) -> !torch.vtensor<[1,128,28,28],ui8>
    return %0 : !torch.vtensor<[1,128,28,28],ui8>
}

pravg-amd self-assigned this Mar 24, 2025

pravg-amd mentioned this issue Apr 1, 2025

[Tracker] All the issue related with ONNX model zoo models #886

Open

pravg-amd changed the title ~~[ONNX Importer] QLinearAdd op issue with incorrect return type (!torch.none)~~ [ONNX Importer] Incorrect return type for ops from "com.microsoft" domain (!torch.none) Apr 1, 2025

pravg-amd assigned vivekkhandelwal1 Apr 2, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX Importer] Incorrect return type for ops from "com.microsoft" domain (!torch.none) #949

[ONNX Importer] Incorrect return type for ops from "com.microsoft" domain (!torch.none) #949

pravg-amd commented Mar 11, 2025

pravg-amd commented Mar 11, 2025

pravg-amd commented Mar 13, 2025

pravg-amd commented Mar 26, 2025

pravg-amd commented Apr 1, 2025 •

edited

Loading

pravg-amd commented Apr 1, 2025

pravg-amd commented Apr 1, 2025

pravg-amd commented Apr 2, 2025

vivekkhandelwal1 commented Apr 7, 2025

vivekkhandelwal1 commented Apr 7, 2025

pravg-amd commented Apr 7, 2025

pravg-amd commented Apr 7, 2025

pravg-amd commented Apr 7, 2025

vivekkhandelwal1 commented Apr 8, 2025

vivekkhandelwal1 commented Apr 9, 2025

vivekkhandelwal1 commented Apr 14, 2025

pravg-amd commented Apr 15, 2025

pravg-amd commented Apr 15, 2025

pravg-amd commented Apr 15, 2025

[ONNX Importer] Incorrect return type for ops from "com.microsoft" domain (!torch.none) #949

[ONNX Importer] Incorrect return type for ops from "com.microsoft" domain (!torch.none) #949

Comments

pravg-amd commented Mar 11, 2025

pravg-amd commented Mar 11, 2025

pravg-amd commented Mar 13, 2025

pravg-amd commented Mar 26, 2025

pravg-amd commented Apr 1, 2025 • edited Loading

pravg-amd commented Apr 1, 2025

pravg-amd commented Apr 1, 2025

pravg-amd commented Apr 2, 2025

vivekkhandelwal1 commented Apr 7, 2025

vivekkhandelwal1 commented Apr 7, 2025

pravg-amd commented Apr 7, 2025

pravg-amd commented Apr 7, 2025

pravg-amd commented Apr 7, 2025

vivekkhandelwal1 commented Apr 8, 2025

vivekkhandelwal1 commented Apr 9, 2025

vivekkhandelwal1 commented Apr 14, 2025

pravg-amd commented Apr 15, 2025

pravg-amd commented Apr 15, 2025

pravg-amd commented Apr 15, 2025

pravg-amd commented Apr 1, 2025 •

edited

Loading