Imitater

A unified language model server built upon vllm and infinity.

Usage

Install

pip install -e .

Launch Server

imitater -c config/example.yaml

Show configuration instruction.

Add an OpenAI model

- name: OpenAI model name
- token: OpenAI token

Add a chat model

- name: Display name
- path: Model name on hub or local model path
- device: Device IDs
- port: Port ID
- maxlen: Maximum model length (optional)
- agent_type: Agent type (optional) {react, aligned}
- template: Template jinja file (optional)
- gen_config: Generation config folder (optional)

Add an embedding model

- name: Display name
- path: Model name on hub or local model path
- device: Device IDs (does not support multi-gpus)
- port: Port ID
- batch_size: Batch size (optional)

Note

Chat template is required for the chat models.

Use export USE_MODELSCOPE_HUB=1 to download model from modelscope.

Test Server

python tests/test_openai.py -c config/example.yaml

Roadmap

Response choices.
Rerank model support.

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
.github/workflows		.github/workflows
config		config
generation_config		generation_config
src/imitater		src/imitater
templates		templates
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

License

the-seeds/imitater

Folders and files

Latest commit

History

Repository files navigation

Imitater

Usage

Install

Launch Server

Add an OpenAI model

Add a chat model

Add an embedding model

Test Server

Roadmap

About

Resources

License

Stars

Watchers

Forks

Languages