support mistral and llava_mistral in turbomind #1579

lvhan028 · 2024-05-10T08:24:44Z

Thanks for your contribution and we appreciate it a lot. The following instructions would make your pull request more healthy and more easily receiving feedbacks. If you do not understand some items, don't worry, just make the pull request and seek help from maintainers.

Motivation

resolve #1573

Note that slide window attention is not supported yet.

support llava-mistral

Tests

from lmdeploy import pipeline, TurbomindEngineConfig
from lmdeploy.vl import load_image

pipe = pipeline('/workspace/lmdeploy/workspace/llava-v1.6-mistral-7b',
                backend_config=TurbomindEngineConfig(session_len=8192, tp=1),
                log_level='INFO')

image_urls=['workspace/Chongqing_Small.jpeg', 'workspace/Shanghai_Small.jpeg']

images = [load_image(img_url) for img_url in image_urls]
response = pipe(('describe the two pictures in detail respectively', images))
print(response)

lmdeploy/turbomind/supported_models.py

AllentDan

Tested OK with pipeline.

AllentDan · 2024-05-16T09:58:29Z

May resolve conflicts.

AllentDan · 2024-05-16T09:59:30Z

What is the merge plan? since it has conflicts with the other two VL PRs

lvhan028 · 2024-05-16T11:38:44Z

After other PRs merged

AllentDan · 2024-05-27T04:54:00Z

Auto backend UT failed.

lmdeploy/vl/model/llava.py

irexyc · 2024-05-27T05:39:19Z

lmdeploy/lmdeploy/model.py

Lines 294 to 296 in 7758151

    
           path = model_path.lower() 
        
           if 'llava' in path and 'v1' in path and 'v1.6-34b' not in path: 
        
               return 'llava-v1'

should add mistral not in path

lmdeploy/vl/model/llava.py

irexyc · 2024-05-27T08:14:43Z

多轮对话时，模版跟llava有一些不一样，不确定是否重要。

lmdeploy:
[INST] <IMAGE_TOKEN>\nq1 [/INST]a1</s>[INST] q2 [/INST]
llava:
[INST] <IMAGE_TOKEN>\nq1 [/INST] a1 </s>[INST] q2 [/INST]

https://github.com/haotian-liu/LLaVA/blob/main/llava/conversation.py#L350-L359
https://github.com/haotian-liu/LLaVA/blob/main/llava/conversation.py#L90

lvhan028 · 2024-05-27T09:07:07Z

多轮对话时，模版跟llava有一些不一样，不确定是否重要。

lmdeploy: [INST] <IMAGE_TOKEN>\nq1 [/INST]a1</s>[INST] q2 [/INST] llava: [INST] <IMAGE_TOKEN>\nq1 [/INST] a1 </s>[INST] q2 [/INST]

https://github.com/haotian-liu/LLaVA/blob/main/llava/conversation.py#L350-L359 https://github.com/haotian-liu/LLaVA/blob/main/llava/conversation.py#L90

I found in the interactive inference mode, the model generate " " at the end of its answer.
So, I think we can use lmdeploy's chat template if users don't strip the blanks of the answer.

support mistral in turbomind engine

342fc5f

lvhan028 added improvement WIP labels May 10, 2024

irexyc mentioned this pull request May 10, 2024

[Bug] Error deploying HuggingFace model llava-v1.6-mistral-7b with lmdeploy: Unrecognized model type llava_mistral #1573

Closed

2 tasks

lvhan028 added 3 commits May 10, 2024 19:49

update readme

1576733

use internlm2-chat-7b as example in user guide

100946a

support llava_mistral

ecd0f7c

lvhan028 removed the WIP label May 11, 2024

lvhan028 changed the title ~~support mistral in turbomind engine~~ support mistral and llava_mistral May 11, 2024

lvhan028 changed the title ~~support mistral and llava_mistral~~ support mistral and llava_mistral in turbomind May 11, 2024

lvhan028 requested review from irexyc and AllentDan May 11, 2024 09:34

lvhan028 mentioned this pull request May 13, 2024

[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B #1583

Closed

irexyc reviewed May 15, 2024

View reviewed changes

lmdeploy/turbomind/supported_models.py Outdated Show resolved Hide resolved

AllentDan approved these changes May 16, 2024

View reviewed changes

lvhan028 added 2 commits May 27, 2024 12:25

merge main

7758151

update

ed82c34

irexyc reviewed May 27, 2024

View reviewed changes

lmdeploy/vl/model/llava.py Outdated Show resolved Hide resolved

irexyc reviewed May 27, 2024

View reviewed changes

lmdeploy/vl/model/llava.py Show resolved Hide resolved

fix according to reviewers' comments

3384491

fix ut

74d66f4

irexyc approved these changes May 27, 2024

View reviewed changes

lvhan028 merged commit e6468e7 into InternLM:main May 27, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support mistral and llava_mistral in turbomind #1579

support mistral and llava_mistral in turbomind #1579

lvhan028 commented May 10, 2024 •

edited

AllentDan left a comment

AllentDan commented May 16, 2024

AllentDan commented May 16, 2024

lvhan028 commented May 16, 2024

AllentDan commented May 27, 2024

irexyc commented May 27, 2024

irexyc commented May 27, 2024

lvhan028 commented May 27, 2024

support mistral and llava_mistral in turbomind #1579

support mistral and llava_mistral in turbomind #1579

Conversation

lvhan028 commented May 10, 2024 • edited

Motivation

Tests

AllentDan left a comment

Choose a reason for hiding this comment

AllentDan commented May 16, 2024

AllentDan commented May 16, 2024

lvhan028 commented May 16, 2024

AllentDan commented May 27, 2024

irexyc commented May 27, 2024

irexyc commented May 27, 2024

lvhan028 commented May 27, 2024

lvhan028 commented May 10, 2024 •

edited