Add more compile compatibility for Float8Tensor ops #285

ani300 · 2024-06-14T23:48:52Z

No description provided.

drisspg · 2024-06-16T22:57:43Z

We have been pretty targeted with the ops we support for Float8Tensor, I am curious if you have any concrete usecases for these ops. Could you also add some tests cases. Otherwise thanks for the contributions, I am surprised that the contstructor isnt working correctly would also love a test case there!

ani300 · 2024-06-18T13:57:48Z

Yes, most of these ops are due to using the Float8Tensor to handle an FP8 kv-cache. The example usage for all of these will be in https://github.com/pytorch-labs/fp468-llm today or tomorrow at the latest. I'll add the tests for both the ops and the constructor. The issue on the constructor was that that it didn't matter what the original dtype was, it always returned fp32

ani300 · 2024-06-20T16:01:54Z

@drisspg as I'm writing the unit tests, I'm thinking of what a correct copy_ operation looks like: If we try to copy an FP32/FP16/BF16 tensor into an FP8 one, should we do some scaling if the Float8Tensor has it? Or what does the opposite operation look like as well? Say copying an FP8 tensor with scales into an FP32/FP16 one? should we unscale through the FromFloat8Constructor?

drisspg · 2024-06-20T18:37:38Z

@ani300 Great questions

For a copy from scaled_fp8 to hp_type I think we should unscale and copy into.

For copy from hp to an fp8 tensors, I think the semantics are a little hazier. Do you have a clear need for this operation? Otherwise I would potentially ban this for now. Some options:

Calculate new scales and copy in both data and new scales, Probably my option 1
Create a temporary FP8 tensor using the scales that exist on the LHS:

float8_experimental/float8_experimental/float8_tensor.py

Line 65 in edae9a3

def to_fp8_no_autograd(

and then copy in the data
Just directly convert existing tensor to a float8_e* type, and only copy in the data.

I actually recently thought about a related problem when adding copy_ dispatch to NF4Tensor. This was to enable Subclass -> Subclass copy_. The most reasonable semantic I could come up with is to use the high-precision dtype as the intermediary between the conversion: pytorch/ao#45

@vkuzo any strong thoughts on the semantics here?

vkuzo · 2024-06-20T19:03:40Z

If we try to copy an FP32/FP16/BF16 tensor into an FP8 one, should we do some scaling if the Float8Tensor has it? Or what does the opposite operation look like as well?

IMO:

if user wants to use copy_ to copy bf16 to float8, the copy is done with a direct cast, without scaling, and user gets back a torch.Tensor with dtype float8_... and not a Float8Tensor. If user actually wants scaling, there should be some wrapper code which scales the data whichever scaling strategy is relevant (per-tensor/row/group/block, dynamic/delayed/static, etc). I'm not a fan of defining copy_ with an assumed scaling strategy which returns Float8Tensor because of the ambiguity of the scaling details.
if user wants to usecopy_ to copy a Float8Tensor to a bf16 tensor, I think it's fine to unscale and copy as there is no ambiguity.

ani300 · 2024-06-20T22:48:51Z

Thanks @drisspg and @vkuzo for your comments and opinions! I'll implement the FP8 -> BF16 copy (which is the one I'm using anyways), add the Float8Tensor to Float8Tensor with everything equal (scale, mm_config, etc.), and ban everything else.

ani300 · 2024-06-20T22:49:45Z

For the failing unit test, I'm waiting on pytorch CI not failing to run at all to land this: pytorch/pytorch#128758

vkuzo

awesome!

float8_experimental/float8_ops.py

drisspg · 2024-06-25T23:23:10Z

@ani300 failing CI is becuase we are still using last nights nightly

facebook-github-bot · 2024-06-26T17:54:27Z

@drisspg has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-06-26T18:21:46Z

@drisspg merged this pull request in b5a444a.

ani300 added 5 commits May 17, 2024 10:41

few fixes for FMS

60f6ca9

Disable check

97cc11e

fix

6f402bb

remove Jamie PR

2065e36

Merge branch 'main' into upstream_compile

ac6445e

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Jun 14, 2024

drisspg self-requested a review June 16, 2024 22:57

Add test for index_put

eec175a

revert change to scale dtype as it seems to work now

35a5dfc

ani300 changed the title ~~Add more compile compatibility for Float8Tensor ops and fix FromFloat8Construct to return original dtype~~ Add more compile compatibility for Float8Tensor ops Jun 20, 2024

Add tests and improve copy_ semantics

30e6349

vkuzo approved these changes Jun 25, 2024

View reviewed changes

float8_experimental/float8_ops.py Outdated Show resolved Hide resolved

float8_experimental/float8_ops.py Outdated Show resolved Hide resolved

Update test for copy

0469357

drisspg approved these changes Jun 25, 2024

View reviewed changes

facebook-github-bot closed this in b5a444a Jun 26, 2024

facebook-github-bot added the Merged label Jun 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more compile compatibility for Float8Tensor ops #285

Add more compile compatibility for Float8Tensor ops #285

ani300 commented Jun 14, 2024

drisspg commented Jun 16, 2024

ani300 commented Jun 18, 2024

ani300 commented Jun 20, 2024

drisspg commented Jun 20, 2024 •

edited

Loading

vkuzo commented Jun 20, 2024 •

edited

Loading

ani300 commented Jun 20, 2024

ani300 commented Jun 20, 2024

vkuzo left a comment

drisspg commented Jun 25, 2024

facebook-github-bot commented Jun 26, 2024

facebook-github-bot commented Jun 26, 2024

Add more compile compatibility for Float8Tensor ops #285

Add more compile compatibility for Float8Tensor ops #285

Conversation

ani300 commented Jun 14, 2024

drisspg commented Jun 16, 2024

ani300 commented Jun 18, 2024

ani300 commented Jun 20, 2024

drisspg commented Jun 20, 2024 • edited Loading

vkuzo commented Jun 20, 2024 • edited Loading

ani300 commented Jun 20, 2024

ani300 commented Jun 20, 2024

vkuzo left a comment

Choose a reason for hiding this comment

drisspg commented Jun 25, 2024

facebook-github-bot commented Jun 26, 2024

facebook-github-bot commented Jun 26, 2024

drisspg commented Jun 20, 2024 •

edited

Loading

vkuzo commented Jun 20, 2024 •

edited

Loading