Treat negative zero as equivalent to positive zero in sm90_sparse_gemm_compressor.hpp #2110

tlrmchlsmth · 2025-02-13T19:21:45Z

This PR handles negative zero in sm90_sparse_gemm_compressor.hpp, treating it as equivalent to positive zero. This is easy to solve by the caller before calling the compressor however it involves an extra pass of reading and writing the uncompressed sparse matrix, and it would be nice to generally remove this as a potential footgun.

Signed-off-by: Tyler Michael Smith <[email protected]>

hwu36 · 2025-02-19T03:21:29Z

@sklevtsov-nvidia , could you please review it?

sklevtsov-nvidia

LGTM, thank you for the contribution!

include/cutlass/transform/kernel/sm90_sparse_gemm_compressor.hpp

sklevtsov-nvidia · 2025-02-20T01:39:03Z

include/cutlass/numeric_type_traits.h

+
+// Default case - no negative zero
+template <typename T>
+struct has_negative_zero : std::false_type {};


@hwu36 to decide if we want a new file for this trait or if it fits in numeric_types.h or elsewhere

Signed-off-by: Tyler Michael Smith <[email protected]>

alexsamardzic · 2025-02-20T21:22:59Z

include/cutlass/numeric_type_traits.h

+struct has_negative_zero : std::false_type {};
+
+// Float types that support negative zero
+// Note that this is false for float8_e4m3_t and float8_e5m2_t


Shouldn't it be true, otherwise how is 0x80 value interpreted for these data types? According to the paper it's -0.0, see table 1.

@alexsamardzic you are correct, we need to add this for all of fp8/fp6/fp4. @tlrmchlsmth feel free to do this if you can, otherwise we can fix this in #2122

Here is a patch to apply on top of this PR, with FP8 and MX data types added (please check that all of them are there, and make sense here), and also minor changes from my #2122 that may be worth adding. For test/unit/transform/device/sm90_sparse_gemm_compressor_legacy.hpp, I don't think it would work with sub-byte data types, but at the moment I don't have access to hardware to test - so I've put there simpler check for negative zero, but also a static assert that would fail if a sub-byte data type used. I hope we can put the patch into this PR, and I'm closing mine now.

patch.txt

ah, you're right - I actually got confused between float8_e4m3fnuz and float8_e4m3fn on this one. I'll apply the patch, thanks!

@alexsamardzic thanks for the patch. I've applied it and it looks good to me. PTAL!

Signed-off-by: Tyler Michael Smith <[email protected]>

Treat negative zero as zero in the sparse gemm compressor

833ab3c

Signed-off-by: Tyler Michael Smith <[email protected]>

sklevtsov-nvidia approved these changes Feb 20, 2025

View reviewed changes

format

e34d8ea

Signed-off-by: Tyler Michael Smith <[email protected]>

alexsamardzic mentioned this pull request Feb 20, 2025

Recognize -0.0 as zero in the sparse compressor #2122

Closed

alexsamardzic reviewed Feb 20, 2025

View reviewed changes

Apply patch

0bcd3a0

Signed-off-by: Tyler Michael Smith <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Treat negative zero as equivalent to positive zero in sm90_sparse_gemm_compressor.hpp #2110

Treat negative zero as equivalent to positive zero in sm90_sparse_gemm_compressor.hpp #2110

tlrmchlsmth commented Feb 13, 2025

hwu36 commented Feb 19, 2025

sklevtsov-nvidia left a comment

sklevtsov-nvidia Feb 20, 2025

alexsamardzic Feb 20, 2025

sklevtsov-nvidia Feb 20, 2025

alexsamardzic Feb 21, 2025

tlrmchlsmth Feb 22, 2025

tlrmchlsmth Feb 22, 2025

Treat negative zero as equivalent to positive zero in sm90_sparse_gemm_compressor.hpp #2110

Are you sure you want to change the base?

Treat negative zero as equivalent to positive zero in sm90_sparse_gemm_compressor.hpp #2110

Conversation

tlrmchlsmth commented Feb 13, 2025

hwu36 commented Feb 19, 2025

sklevtsov-nvidia left a comment

Choose a reason for hiding this comment

sklevtsov-nvidia Feb 20, 2025

Choose a reason for hiding this comment

alexsamardzic Feb 20, 2025

Choose a reason for hiding this comment

sklevtsov-nvidia Feb 20, 2025

Choose a reason for hiding this comment

alexsamardzic Feb 21, 2025

Choose a reason for hiding this comment

tlrmchlsmth Feb 22, 2025

Choose a reason for hiding this comment

tlrmchlsmth Feb 22, 2025

Choose a reason for hiding this comment