-
Notifications
You must be signed in to change notification settings - Fork 97
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
#16948: Add fixes to support unpadded inner dim for pf + mm and add s…
…upport for DRAM in1 inputs for ring matmul. (#17311) ### Ticket - #17060 - #16948 ### Problem description With the new validation being added to tensor_spec, the current implementation of ring matmul with unpadded shapes fails. Also, the matmul currently does not support DRAM interleaved in1 weights, which is required for matmuls with large weights that cannot fit in L1 (such as the LM head in Llama). ### What's changed - Internally round up the inner dim - Add support for DRAM_INTERLEAVED in1
- Loading branch information
Showing
7 changed files
with
243 additions
and
32 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.