vLLM: Add ipex-llm entrypoint #10738

xiangyuT · 2024-04-11T07:05:38Z

Description

Provide ipex_llm.serving.vllm.entrypoints to run vLLM. Need to install vllm first.

Executable with python -m ipex_llm.serving.vllm.entrypoints.openai.api_server
Optimize with ipex-llm
Optimize with ipex
Examples/README

gc-fu · 2024-04-16T07:01:55Z

python/llm/src/ipex_llm/optimize.py

-    invalidInputError(model.device.type in ('cpu', 'meta'),
-                      "Expect model on device `cpu` or `meta`, "
-                      f"but got device type {model.device.type}")
+    # invalidInputError(model.device.type in ('cpu', 'meta'),


Consider change to this?

if hasattr(model, 'device'): invalidInputError(model.device.type in ('cpu', 'meta'), "Expect model on device `cpu` or `meta`, " f"but got device type {model.device.type}")

xiangyuT added 18 commits April 11, 2024 14:34

init

3d86450

fix

d3f1de8

fix

a90862f

refine

8acf126

init add for ipex_llm opt

c0a4f1a

fix

cc35e75

fix

3d1c192

apply ipex llm patch

3dcc44b

fix

60f8112

fix

68b64c1

refine

b1f37e2

fix

6e55e4d

fix

a5d6f89

format

9c85058

add ipex convert

10b8d5b

format

33948f9

refine

13dd1ac

fix

6569fd4

xiangyuT changed the title ~~[WIP] vLLM: Add ipex-llm entrypoint~~ vLLM: Add ipex-llm entrypoint Apr 16, 2024

xiangyuT marked this pull request as ready for review April 16, 2024 01:11

xiangyuT requested a review from glorysdj April 16, 2024 01:11

gc-fu reviewed Apr 16, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vLLM: Add ipex-llm entrypoint #10738

vLLM: Add ipex-llm entrypoint #10738

xiangyuT commented Apr 11, 2024 •

edited

gc-fu Apr 16, 2024

vLLM: Add ipex-llm entrypoint #10738

Are you sure you want to change the base?

vLLM: Add ipex-llm entrypoint #10738

Conversation

xiangyuT commented Apr 11, 2024 • edited

Description

gc-fu Apr 16, 2024

Choose a reason for hiding this comment

xiangyuT commented Apr 11, 2024 •

edited