Feat (mx): unpadding during dequantization #1134

Giuseppe5 · 2024-12-18T16:58:08Z

Reason for this PR

Groupwise quantization requires padding when the input channel shape is not divisible by groupsize.

Padding works well until it doesn't, and there are important edge cases that were not covered by the previous implementation.
(e.g., weight only quantization where padding was required. Until now, we also had to force activation quantization because otherwise we had shape mismatch).

Changes Made in this PR

With the current implementation, we un-pad when dequantizing, taking care of all the edge cases

Few todos:

Consolidate dequantization for groupwise tensors + inference export
Fix typing in runtime activations
Testing

Testing Summary

Risk Highlight

This PR includes code from another work (please detail).
This PR contains API-breaking changes.
This PR depends on work in another PR (please provide links/details).
This PR introduces new dependencies (please detail).
There are coverage gaps not covered by tests.
Documentation updates required in subsequent PR.

Checklist

Code comments added to any hard-to-understand areas, if applicable.
Changes generate no new warnings.
Updated any relevant tests, if applicable.
No conflicts with destination dev branch.
I reviewed my own code changes.
Initial CI/CD passing.
1+ reviews given, and any review issues addressed and approved.
Post-review full CI/CD passing.

Giuseppe5 · 2024-12-19T10:48:47Z

src/brevitas/proxy/groupwise_float_parameter_quant.py

@@ -28,6 +28,7 @@ def apply_input_view(self, x):
        return x.flatten(start_dim, start_dim + 1)

    def create_quant_tensor(self, qt_args: Tuple[Any]) -> GroupwiseFloatQuantTensor:
+        shape = self.tracked_parameter_list[0].shape


We don't support weight quant sharing for groupwise anyway, so this is safe, but it is ugly.

src/brevitas/quant_tensor/groupwise_float_quant_tensor.py

nickfraser

Please check guards for the optional argument x. I think this can crash under certain circumstances. If there are preconditions that means that this can never occur, maybe add a comment about this.

src/brevitas/proxy/groupwise_float_runtime_quant.py

src/brevitas/proxy/float_runtime_quant.py

nickfraser

Approved once the function signature of create_quant_tensor is updated.

Giuseppe5 commented Dec 19, 2024

View reviewed changes

src/brevitas/quant_tensor/groupwise_float_quant_tensor.py Outdated Show resolved Hide resolved

Giuseppe5 requested a review from nickfraser December 19, 2024 14:19

nickfraser requested changes Jan 8, 2025

View reviewed changes

src/brevitas/proxy/groupwise_float_runtime_quant.py Show resolved Hide resolved

src/brevitas/proxy/groupwise_float_runtime_quant.py Show resolved Hide resolved

src/brevitas/proxy/float_runtime_quant.py Show resolved Hide resolved

nickfraser approved these changes Jan 10, 2025

View reviewed changes

Giuseppe5 added 10 commits January 10, 2025 22:54

Feat (mx): unpadding during dequantization

e7a21fc

unpadding everything

584aade

fix for tensor

84a8e23

Fix zero point

d0a9335

Fix weight residual computation

ad0aab8

fix

7f2dd15

fix

c909178

Last fixes

0f61e8e

group dim

e6b7bcc

typing

1fdd111

Giuseppe5 force-pushed the unpadding_groupwise branch from 4422543 to 1fdd111 Compare January 10, 2025 22:55

Giuseppe5 requested a review from nickfraser January 10, 2025 23:01

Giuseppe5 merged commit adeeec3 into Xilinx:dev Jan 11, 2025
382 of 396 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat (mx): unpadding during dequantization #1134

Feat (mx): unpadding during dequantization #1134

Giuseppe5 commented Dec 18, 2024 •

edited

Loading

Giuseppe5 Dec 19, 2024

nickfraser left a comment

nickfraser left a comment •

edited

Loading

Feat (mx): unpadding during dequantization #1134

Feat (mx): unpadding during dequantization #1134

Conversation

Giuseppe5 commented Dec 18, 2024 • edited Loading

Reason for this PR

Changes Made in this PR

Testing Summary

Risk Highlight

Checklist

Giuseppe5 Dec 19, 2024

Choose a reason for hiding this comment

nickfraser left a comment

Choose a reason for hiding this comment

nickfraser left a comment • edited Loading

Choose a reason for hiding this comment

Giuseppe5 commented Dec 18, 2024 •

edited

Loading

nickfraser left a comment •

edited

Loading