This repository has been archived by the owner on Aug 7, 2024. It is now read-only.
Docs should say what's the smallest model users will see a benefit for #280
Labels
documentation
Improvements or additions to documentation
I was working on a minimal example to showcase the benefits of fp8 an H100 without forcing users to download a chunky model like here #279
I guess it's expected that fp8 will be slower for tiny models because of overhead in which case we should say in docs what's the minimal model size people should try
The text was updated successfully, but these errors were encountered: