[WIP] Add cross_entropy to torchcompile_cat executor #1655

riccardofelluga · 2025-01-17T10:36:52Z

What does this PR do?

In an attempt to fix #1654 and partially #1552, this PR adds the necessary ops in the torchcompile_cat list to capture HF CausalLMLoss.

Before merging this PR needs testing with other models, I will write a message with the results of the benchmarks.

IvanYashchuk · 2025-01-21T14:16:57Z

This executor is meant for fusions with a cat operation. Cross entropy is usually quite far from RoPE (which uses cat). How does adding cross_entopy to torchcompile_cat help?

riccardofelluga · 2025-01-21T18:56:25Z

@IvanYashchuk Adding cross entropy does indeed not help with rope but it allows thunder to use a very efficient fused cross entropy triton kernel which perf currently cannot be matched with the other executors(not even apex). From lines 211 and 212, it does seem that while the set of ops was originally though for rope, it is suggested that other ops could have been added in the future making this a fusion executor that comes in rescue of nvFuser when it can improve performance. Do you think that it would be better to create another executor entry for inductor cross entropy only?

add cross_entropy to torchcompile_cat executor

6c5b0e1

riccardofelluga mentioned this pull request Jan 17, 2025

Investigate Memory and Performance difference using nvfuser vs torch.compile executor on Qwen2 #1552

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add cross_entropy to torchcompile_cat executor #1655

[WIP] Add cross_entropy to torchcompile_cat executor #1655

riccardofelluga commented Jan 17, 2025

IvanYashchuk commented Jan 21, 2025

riccardofelluga commented Jan 21, 2025

[WIP] Add cross_entropy to torchcompile_cat executor #1655

Are you sure you want to change the base?

[WIP] Add cross_entropy to torchcompile_cat executor #1655

Conversation

riccardofelluga commented Jan 17, 2025

What does this PR do?

IvanYashchuk commented Jan 21, 2025

riccardofelluga commented Jan 21, 2025