Embeddings model: bge-base-en-v1.5 #3667
-
Hello, Thanks in advance! |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 9 replies
-
bge-base-en-v1.5 is not on the list of supported models. Curious, what about this model makes it better than some of the many other models that llama.cpp already supports? |
Beta Was this translation helpful? Give feedback.
-
Supporting embedding models is on the way, starting from BERT and its variants. And it will be accompanied by a RAG example directly in C/C++. |
Beta Was this translation helpful? Give feedback.
-
Estimating one week or so. |
Beta Was this translation helpful? Give feedback.
-
hey @francis2tm @zerodrift and other guys, if you are still interested, I've implemented BGE series model with ggml. Check this embeddings.cpp. |
Beta Was this translation helpful? Give feedback.
-
hey @francis2tm bge-large-zh 8bit quant model speed can reach 749.12 tok/s on Apple M1 Soc with 4-thread. |
Beta Was this translation helpful? Give feedback.
Estimating one week or so.