InternLM / lmdeploy Public

Notifications
Fork 230
Star 2.5k

Code
Issues 95
Pull requests 21
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: InternLM/lmdeploy

[Benchmark] benchmarks on different cuda architecture with mo...

#815 opened Dec 11, 2023 by lvhan028

Open 6

报名参加书生·浦语大模型实战营——两周带你玩转微调部署评测全链路

#890 opened Dec 26, 2023 by vansin

Open

Labels 32 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

95 Open 692 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Qwen-7B模型的实现与原版本有差异

#1588 opened May 13, 2024 by sleepwalker2017

[Feature] Support W4A8KV4 Quantization(QServe/QoQ)

#1587 opened May 13, 2024 by wanzhenchn

增加对chatglm3的支持[Feature] awaiting response

#1585 opened May 11, 2024 by LixiangHello

[Bug] change h_input_length_buf_ before synchronization

#1584 opened May 11, 2024 by mengmeexix

2 tasks done

[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B awaiting response

#1583 opened May 11, 2024 by Iven2132

[Feature] 是否支持enc-dec类型模型中decoder的persistent batch awaiting response

#1581 opened May 10, 2024 by Oldpan

使用turbomind部署CodeQwen1.5模型，推理效果变差

#1580 opened May 10, 2024 by Lanyu123

[Bug] 使用docker部署internlm/internlm-xcomposer-vl-7b和internlm/internlm-xcomposer2-vl-7b均报错

#1577 opened May 10, 2024 by ye7love7

2 tasks done

[Bug] Error deploying HuggingFace model llava-v1.6-mistral-7b with lmdeploy: Unrecognized model type llava_mistral awaiting response

#1573 opened May 10, 2024 by enogalaxies

1 of 2 tasks

[Bug] chat模板不支持自定义的role awaiting response

#1572 opened May 9, 2024 by shell-nlp

2 tasks done

并发请求得到的回复有差异

#1571 opened May 9, 2024 by ghntd

qos模块源码有问题

#1569 opened May 9, 2024 by ghntd

[Docs] Can we have an up-to-date version of the persistent batching gif?

#1564 opened May 9, 2024 by youkaichao

[Bug] Parallel GPU Memory Capacity unbalance

#1563 opened May 9, 2024 by ruifengma

2 tasks done

[Bug] 3090量化llama3 70b爆显存 awaiting response

#1562 opened May 8, 2024 by lg123666

1 of 2 tasks

[Bug] Qwen1.5-32B在做awq量化并convert的时候会报错 awaiting response

#1560 opened May 8, 2024 by dingjingzhen

2 tasks

[Feature] support torch 2.3.0

#1558 opened May 8, 2024 by NiuBlibing

[Bug] 关于流式并发相关

#1557 opened May 8, 2024 by Dingxiangxiang

2 tasks

[Feature] Support DeepSeek-V2 Model

#1556 opened May 8, 2024 by ispobock

0.4.0 流式接口返回空 awaiting response

#1554 opened May 7, 2024 by frankxyy

Why did the model not understand the previous conversations?

#1551 opened May 7, 2024 by Iven2132

2 tasks done

[Bug] set logprobs = true and top_logprobs = 5 in restful server. The number of top logrobs is 4 which is unexpected.

#1548 opened May 6, 2024 by zhulinJulia24

2 tasks

[Feature] throw Turbomind error to python

#1539 opened May 1, 2024 by lijing1996

[Bug] llama3和Chinese-LLaMA-Alpaca-3 无论是pipeline方式，还是lmdeploy chat方式都无法生成结果（llama2都是好的）

#1538 opened May 1, 2024 by zhanghui-china

2 tasks

[Bug] 0.4.0是还不支持qwen1.5 110b吗?

#1536 opened Apr 30, 2024 by starsliao

2 tasks done

Previous 1 2 3 4 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly