Cannot use /v1/embeddings api with BGE-M3 model #793

chillizex · 2024-05-16T08:01:02Z

fastchat version 0.2.36

First, start the services:

python3 -m fastchat.serve.controller --port 21002 --host 0.0.0.0
python3 -m fastchat.serve.model_worker --port 21003 --model-names bge-m3 --model-path /root/proj/models/bge-m3 --device cpu --controller-address http://0.0.0.0:21002 --host 0.0.0.0
python3 -m fastchat.serve.openai_api_server --controller-address http://0.0.0.0:21002 --port 8001 --host 0.0.0.0

Then, post requests to the 8001 port:

import requests

url = "http://10.99.106.214:8001/v1/embeddings"
headers = {
"Content-Type": "application/json"
}
data = {
"model": "bge-m3",
"input": "Hello world!"
}

response = requests.post(url, headers=headers, json=data)

查看响应内容

print(response.text)

and get the response:
{"object":"error","message":"Not Found","code":50001}

The text was updated successfully, but these errors were encountered:

staoxiao · 2024-05-16T10:32:15Z

@chillizex , this error is from fastchat, so you can open a issue in fastcaht repo. We're sorry that we cannot address this issue.

chillizex · 2024-05-22T03:14:23Z

@chillizex , this error is from fastchat, so you can open a issue in fastcaht repo. We're sorry that we cannot address this issue.

Thank you, btw, do you have some recommand deploy method that offers openai compatible api?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot use /v1/embeddings api with BGE-M3 model #793

Cannot use /v1/embeddings api with BGE-M3 model #793

chillizex commented May 16, 2024 •

edited

查看响应内容

staoxiao commented May 16, 2024

chillizex commented May 22, 2024

Cannot use /v1/embeddings api with BGE-M3 model #793

Cannot use /v1/embeddings api with BGE-M3 model #793

Comments

chillizex commented May 16, 2024 • edited

查看响应内容

staoxiao commented May 16, 2024

chillizex commented May 22, 2024

chillizex commented May 16, 2024 •

edited