Skip to content

Do we need to build Core ML versions of quantized models? #1829

Answered by bobqianic
7k50 asked this question in Q&A
Discussion options

You must be logged in to vote

CoreML is solely utilized for encoding, while decoding is managed through GGML. Although CoreML doesn't support these specialized quantization methods, theoretically, it's still feasible to run the decoder using a quantized model because it is managed through GGML.

Unquantized CoreML model + quantized GGML model

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by ggerganov
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants