Make a sense of the new GGML Quantized Methods? #559
Unanswered
JacobGoldenArt
asked this question in
Q&A
Replies: 1 comment
-
I'm curious if/when we will get Metal support for these other types of GGML quants. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi, I've been using LLAMA-CPP-Python successfully on my M2 mac specifically. So far I've been only using Q4_0 versions of ggml models given the metal installation steps :
But as I see from The Blokes model cards, There's a bunch of new quantized methods. I've read the descriptions of each method but not being a ML dev, I'm not really sure of the benefits of each and also if these knew methods are compatible with llama-cpp-python (specifically for mac (metal). Any thoughts Appreciated.
Beta Was this translation helpful? Give feedback.
All reactions