Merge with fixes of 087fea06 (19) #248

mgehre-amd · 2024-08-20T15:55:35Z

No description provided.

1. onnx.MatMulInteger now converts to aten.matmul instead of aten.mm 2. aten.matmul, for ranks >=2, now allows quantized inputs and will lower to linalg::quantized_matmul or linalg::quantized_batch_matmul. 3. added AtenMatmulOp to the FuseQuantizeOps rewrite patters QuantizeOperands, QuantizeTransposedOperands, and QuantizeAccumulator 4. added several tests, including some to test AtenMmOp with varying quantization signed-ness. 5. a quantized matmul mat-vec test is added to verify the failure to lower to linalg; cleaned of out-of-date code related to common torch-mlir lowering xfails. 6. in debugging a real model with quantized matmuls, I found a bug on the scalarize-shapes pass which resulted from the aten.full op folder returning an incompatible result type. This is fixed by the small change here to [lib/Dialect/Torch/IR/TorchOps.cpp](https://github.com/llvm/torch-mlir/compare/main...zjgarvey:torch-mlir:MatMulIntegerFix?expand=1#diff-dc8ed165c207918e606490eee3984b1ad51d7034e6aac36fc046bf47f6f03f4f).

@export

To pass test "MatmulStaticBroadcast_basic" in stablehlo: ```python class MatmulStaticBroadcast(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([4, 1, 6, 7], torch.float32, True), ([8, 1, 5, 7, 6], torch.float32, True), ]) def forward(self, lhs, rhs): return torch.matmul(lhs, rhs) @register_test_case(module_factory=lambda: MatmulStaticBroadcast()) def MatmulStaticBroadcast_basic(module, tu: TestUtils): module.forward(tu.rand(4, 1, 6, 7), tu.rand(8, 1, 5, 7, 6)) ```

Replace the torchdynamo e2e with the fx_importer e2e

…and schema type (llvm#3163)

Adds OnnxToTorch Lowering for the ReduceL1 op.

…vm#3167) The new cases added for quantized matmuls are: 1. vec-vec 2. vec-mat 3. mat-vec each of which are now lowered to expand(s), quantized_matmul, and collapse.

Remove the `kwarg_only` limitation, for example ``` torch.add(x, 3.0, alpha=2) ``` compiled to ``` %0 = torch.aten.add.Scalar %arg0, %float3.000000e00, %int1 ``` fix to ``` %0 = torch.aten.add.Scalar %arg0, %float3.000000e00, %int2 ```

By canonicalize Aten_CastLongOp into AtenToDtypeOp

…llvm#3171) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <[email protected]>

) weights and biases and other model parameters appear as a separate data structure to the traced graph, but are needed when running the MLIR compiled code; this PR implements that extended functionality

@export

Decomposition RepeatInterleaveSelfInt with following ops: ```python def my_repeat_interleave(input, repeats, dim=None): if dim is None: # Flatten the input and then repeat return input.flatten().unsqueeze(-1).tile((1, repeats)).flatten() else: # Calculate the shape after repeat expanded_shape = list(input.shape) expanded_shape[dim] *= repeats # Repeat the tensor along the specified dimension repeat_shape = [1] * (input.dim() + 1) repeat_shape[dim + 1] = repeats input = input.unsqueeze(-1) # Tile and then reshape tiled = torch.tile(input, repeat_shape) # Rearrange and reshape repeated = tiled.reshape(*expanded_shape) return repeated ``` I passed the tests of stablehlo and linalg. When testing onnx, strange things happened. In torch-mlir's CI **torch_nightly** and my own environment(torch==2.4.0.dev20240318+cpu), it can **pass the pass**. In torch-mlir's CI **torch_stable**, it **failed**. The test case is `RepeatInterleaveSelfIntNoDimModule_basic`, the result shape should be [120]. ```python class RepeatInterleaveSelfIntNoDimModule(torch.nn.Module): def __init__(self): super().__init__() @export @annotate_args([ None, ([3, 4, 5], torch.float32, True), ]) def forward(self, x): return x.repeat_interleave(2) @register_test_case(module_factory=lambda: RepeatInterleaveSelfIntNoDimModule()) def RepeatInterleaveSelfIntNoDimModule_basic(module, tu: TestUtils): module.forward(tu.rand(3, 4, 5)) ``` The error log is as follows: ``` Unexpected outcome summary: (onnx) ****** Failed tests - 1 tests FAIL - "RepeatInterleaveSelfIntNoDimModule_basic" @ trace item #0 - call to "forward" @ output of call to "forward" ERROR: shape (torch.Size([6, 4, 5])) is not equal to golden shape (torch.Size([120])) ``` @rsuderman Would you please help me check what's wrong with my PR? Thanks a lot.

Set PyTorch and TorchVision version to nightly release 2024-04-16. Signed-Off By: Vivek Khandelwal <[email protected]>

Need to perform an expand in the case where the indices is rank-0.

We can map to `tensor.reshape` for handling multiple output dynamic shapes. Later we can perform a more complex analysis for indentifying expand/collapse cases from the tensor.reshape. Initially we planned to handle this identification at the `torch` level however it will be easier to handle once converted to core mlir-dialects.

Reclassifying what the source of failures are for various bugs so we can reprioritize what failures are common.

The FX importer will pass static shapes to the Torch dialect, so it needs to generate a StableHLO that satisfies shape inference.

See unit test below: ``` // CHECK-LABEL: func.func @torch.aten.tensor.float( // CHECK-NEXT: torch.vtensor.literal(dense<1.000000e+01> : tensor<f32>) : !torch.vtensor<[],f32> func.func @torch.aten.tensor.float() -> !torch.vtensor<[],f32> { %none = torch.constant.none %false = torch.constant.bool false %float1.000000e01 = torch.constant.float 1.000000e+01 %67 = torch.aten.tensor.float %float1.000000e01, %none, %none, %false : !torch.float, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],f32> return %67 : !torch.vtensor<[],f32> } // CHECK-LABEL: func.func @torch.aten.tensor.int( // CHECK-NEXT: torch.vtensor.literal(dense<45> : tensor<si32>) : !torch.vtensor<[],si32> func.func @torch.aten.tensor.int() -> !torch.vtensor<[],si32> { %none = torch.constant.none %false = torch.constant.bool false %int45 = torch.constant.int 45 %67 = torch.aten.tensor.int %int45, %none, %none, %false : !torch.int, !torch.none, !torch.none, !torch.bool -> !torch.vtensor<[],si32> return %67 : !torch.vtensor<[],si32> } ```

Need to perform a bool cast to support `onnx.Not` on non-bool inputs.

Previous implementation erroneously mixed up num_outputs with slice_size. New version correctly computs the slice size and directly performs slicing rather than leveraging `aten.split.tensor`. This is due to `onnx` supporting a fixed number of splits making the size computation more easily computeable when lowering to `aten` rather than deferring to `aten.split.tensor`. --------- Co-authored-by: Robert Suderman <[email protected]>

Version number was set too high. Lowered to support more cases allows more tests to pass. Co-authored-by: Robert Suderman <[email protected]>

…ze op (llvm#2991) This commit also cleans up the OnnxToTorch lowering for the Squeeze and Unsqueeze op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal <[email protected]>

This commit adds the OnnxToTorch lowering for Onnx's RandomNormal, RandomNormalLike, RandomUniform, and RandomUniformLike op.

Like llvm#3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated

This is part 1 of ~3, formatting all miscellaneous text files and CPP files matched by a first run of pre-commit. These tend to be low change-traffic and are likely not disruptive. Subsequent patches will format Python files and remaining CPP files.

) This is a large change because prior to this point, Python files in the project were not consistently formatted. This reformats them all with black defaults. Based on experience with prior projects, if you have a dev/long-term branch with Python patches, you can minimize merge conflicts prior to rebasing to include this commit by running `black` on your modified Python files, squashing, and then rebasing/merging.

The pre-commit hook will only run on changed files, whereas this runs on push and will check everything.

…3246)

CELU(x)=max(0,x)+min(0,α∗(exp(x/α)−1))

)

* Update black version to support 3.11/3.12 * Reformat code

…lvm#3221) Signed-Off By: Vivek Khandelwal <[email protected]>

…lvm#3230)

This scenario was uncovered in a downstream test that failed with a previous snapshot of torch-mlir. See https://github.com/cruise-automation/mlir-tcp/actions/runs/8605480116/job/23581829102?pr=65. ``` File "/home/runner/.cache/bazel/_bazel_runner/ce288f117ee4ca92dc028a6a28476a3d/sandbox/processwrapper-sandbox/2380/execroot/mlir-tcp/bazel-out/k8-opt-exec-2B5CBBC6/bin/test/AotCompile/broadcast_unit_dim_to_dynamic_with_unchanged_dim_dynamic_torch_exporter.runfiles/pip_deps_torch_mlir/site-packages/torch_mlir/extras/fx_importer.py", line 969, in value_info_to_type raise NotImplementedError( NotImplementedError: Could not deduce type from value info: tensor_meta=None, val=s1, sparsity=None ``` It seems to have resolved on current HEAD. Adding this test to ensure coverage in the future.

Set PyTorch and TorchVision version to nightly release 2024-04-28. Signed-Off By: Vivek Khandelwal <[email protected]>

zjgarvey and others added 30 commits April 15, 2024 16:06

[CI] Enable the tests for fx_importer in the CI (llvm#3168)

10b6062

Replace the torchdynamo e2e with the fx_importer e2e

[FxImporter] Type conversion to resolve the mismatch between Py type …

af5509c

…and schema type (llvm#3163)

[MLIR][TORCH] Add OnnxToTorch lowering for ReduceL1 Op (llvm#3146)

a0232e9

Adds OnnxToTorch Lowering for the ReduceL1 op.

[TorchToLinalg] Adds Support for Remaining Quantized Matmul Cases (ll…

7a1ad0d

…vm#3167) The new cases added for quantized matmuls are: 1. vec-vec 2. vec-mat 3. mat-vec each of which are now lowered to expand(s), quantized_matmul, and collapse.

[FxImporter] Fix fx importer test config and clean xfail set (llvm#3176)

e4b11a0

[Torch] Support Aten_CastLongOp. (llvm#3160)

d2ba956

By canonicalize Aten_CastLongOp into AtenToDtypeOp

[FxImporter] Replace local_scalar_dense in fx_importer (llvm#3180)

3aa81f7

[onnx][torch][linalg] Implementing align-corner modes for gridsampler (…

b66eabd

…llvm#3171) Align corner modes which select what the corners mean. Either the center of the corner points or the edges of the edge points. --------- Co-authored-by: Rob Suderman <[email protected]>

[torch-mlir][sparse] pre-pend named buffers to parameter list (llvm#3178

491f482

) weights and biases and other model parameters appear as a separate data structure to the traced graph, but are needed when running the MLIR compiled code; this PR implements that extended functionality

build: manually update PyTorch version (llvm#3170)

6e5630d

Set PyTorch and TorchVision version to nightly release 2024-04-16. Signed-Off By: Vivek Khandelwal <[email protected]>

[torch] Support rank-0 index for torch index select (llvm#3182)

4c21e20

Need to perform an expand in the case where the indices is rank-0.

[onnx] Update the failure triage for onnx (llvm#3186)

be742a9

Reclassifying what the source of failures are for various bugs so we can reprioritize what failures are common.

[stablehlo] add aten.clamp.Tensor op conversion support (llvm#3185)

6c4f7de

[FxImporter] Add fx importer to stablehlo e2e test config (llvm#3183)

0a60734

[StableHLO] Fix aten.clamp.Tensor in FxImporter2StableHLO (llvm#3190)

5a98c72

The FX importer will pass static shapes to the Torch dialect, so it needs to generate a StableHLO that satisfies shape inference.

[onnx] Fix onnx.Not for non-bool inputs (llvm#3187)

b01245c

Need to perform a bool cast to support `onnx.Not` on non-bool inputs.

[stablehlo] add aten.remainder.Tensor op conversion support (llvm#3197)

ea0ecb6

[stablehlo] add aten.fmod.Tensor op conversion support (llvm#3198)

b6b0160

[onnx] Extend op version number of onnx.ScatterElements (llvm#3195)

8222637

Version number was set too high. Lowered to support more cases allows more tests to pass. Co-authored-by: Robert Suderman <[email protected]>

[stablehlo] add aten.expm1 op conversion support (llvm#3199)

a60e84e

[Torch] Emit and decompose prims.iota op (llvm#3132)

e5bdd71

[MLIR][TORCH] Fix OnnxToLinalg lowering issue for Squeeze and Unsquee…

6abc737

…ze op (llvm#2991) This commit also cleans up the OnnxToTorch lowering for the Squeeze and Unsqueeze op and adds the support for handling edge cases. Signed-Off By: Vivek Khandelwal <[email protected]>

[onnx] Add onnx-to-torch lowering for random ops (llvm#3193)

3c252cd

This commit adds the OnnxToTorch lowering for Onnx's RandomNormal, RandomNormalLike, RandomUniform, and RandomUniformLike op.

penguin-wwy and others added 13 commits April 27, 2024 14:00

Fix deprecated uses of cast/dyn_cast/dyn_cast_or_null/isa (llvm#3243)

6679728

Like llvm#3130, gradually replace the deprecated code https://github.com/llvm/mlir-www/blob/main/website/content/deprecation/_index.md#deprecated

Enable post commit run of pre-commit hooks over all files. (llvm#3245)

a339d7b

The pre-commit hook will only run on changed files, whereas this runs on push and will check everything.

[Torch] emit aten.log_sigmoid and decompose it to log(sigmoid) (llvm#…

46c0f3c

…3246)

[Torch] emit aten.celu and decompose it (llvm#3247)

5684dc0

CELU(x)=max(0,x)+min(0,α∗(exp(x/α)−1))

[FxImporter] Synchronize the collection of symbolic torch ops (llvm#3236

9f64748

)

[Torch] emit aten.__contains__.str_list and add folder (llvm#3249)

aed2cf3

[NFC] Update black version (llvm#3256)

b218519

* Update black version to support 3.11/3.12 * Reformat code

[ONNX] Fix Onnx.Selu lowering and canonicalizer for IntImplicit op (l…

b1e2241

…lvm#3221) Signed-Off By: Vivek Khandelwal <[email protected]>

[stablehlo] Support PrimsCollapseOp and PrimsSplitDimOp in stablehlo (l…

0a5ff68

…lvm#3230)

build: manually update PyTorch version (llvm#3257)

087fea0

Set PyTorch and TorchVision version to nightly release 2024-04-28. Signed-Off By: Vivek Khandelwal <[email protected]>

mgehre-amd force-pushed the bump_to_087fea06 branch from 36973de to f3849d8 Compare August 20, 2024 15:58

mgehre-amd changed the title ~~Merge with fixes of 5708ee7e (18)~~ Merge with fixes of 087fea06 (19) Aug 20, 2024

mgehre-amd force-pushed the bump_to_087fea06 branch 2 times, most recently from 5bc4220 to 7e5a35d Compare August 20, 2024 16:04

mgehre-amd changed the base branch from bump_to_83cba8c6 to bump_to_5708ee7e August 20, 2024 16:05

Merge commit '087fea06' into bump_to_087fea06

e344453

mgehre-amd force-pushed the bump_to_087fea06 branch from 7e5a35d to e344453 Compare August 20, 2024 16:06

mgehre-amd added 3 commits August 21, 2024 15:09

Fixes

9c7e3b8

Fix xfail

aeaceb7

Update xfail

f3e53f2

mgehre-amd requested a review from cferry-AMD August 21, 2024 20:27

Update xfail

70e6f39

Base automatically changed from bump_to_5708ee7e to bump_to_197ef422 August 22, 2024 06:12

cferry-AMD approved these changes Aug 22, 2024

View reviewed changes

Base automatically changed from bump_to_197ef422 to feature/backport_ea1_ops August 22, 2024 07:17

mgehre-amd merged commit 6317302 into feature/backport_ea1_ops Aug 22, 2024
4 checks passed

mgehre-amd deleted the bump_to_087fea06 branch August 22, 2024 15:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge with fixes of 087fea06 (19) #248

Merge with fixes of 087fea06 (19) #248

mgehre-amd commented Aug 20, 2024

Merge with fixes of 087fea06 (19) #248

Merge with fixes of 087fea06 (19) #248

Conversation

mgehre-amd commented Aug 20, 2024