We still don't use BFDOT on macOS CPU because Apple compiler used for PyTorch wheel is outdated #1444
Labels
Known Gaps
These are known Gaps/Issues/Bug items in torchchat
MPS/Metal
Issues related to Metal MPS/env set up
performance
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
🚀 The feature, motivation and pitch
See pytorch/pytorch#143913 . This is blocking improved CPU performance for bfloat16 decoding on Mac; setting up an issue on torchchat side to track.
Alternatives
robust setup for decoding using accelerators on Mac
Additional context
No response
RFC (Optional)
No response
The text was updated successfully, but these errors were encountered: