[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B #1583

Iven2132 · 2024-05-11T06:17:04Z

Motivation

It outperforms existing open-source models like Intern-VL-1.5

https://llava-vl.github.io/blog/2024-05-10-llava-next-stronger-llms/#:~:text=Live%20Demo-,Benchmark%20Results,-Results%20with%20LMMs

Related resources

https://llava-vl.github.io/blog/2024-05-10-llava-next-stronger-llms/

Additional context

No response

White-Friday · 2024-05-13T04:00:07Z

请问现在是否已经支持LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B了吗

lvhan028 · 2024-05-13T05:24:26Z

Yes. They are supported.

lvhan028 · 2024-05-13T06:48:33Z

It probably has problem when the model is large.
#1563 explains the reason

Iven2132 · 2024-05-13T10:32:11Z

Yes. They are supported.

@lvhan028 So I can use the LLaVA-NeXT Qwen1.5-72B, LLaMA3-8B?

lvhan028 · 2024-05-13T11:09:43Z

Sorry, my bad.
It probably needs to make some changes like PR #1579 does.
I didn't find the checkpoints. Could you share the huggingface repo_id?

Iven2132 · 2024-05-13T14:05:33Z

Sorry, my bad. It probably needs to make some changes like PR #1579 does. I didn't find the checkpoints. Could you share the huggingface repo_id?

lmms-lab/llama3-llava-next-8b: https://huggingface.co/lmms-lab/llama3-llava-next-8b

lmms-lab/llava-next-72b: https://huggingface.co/lmms-lab/llava-next-72b

lmms-lab/llava-next-110b: https://huggingface.co/lmms-lab/llava-next-110b

Please add system prompt feature too

babla9 · 2024-05-13T19:56:16Z

Do you know what minimum GPU memory requirements would be to serve these VL models? Thanks!

Iven2132 · 2024-05-14T04:35:14Z

Do you know what minimum GPU memory requirements would be to serve these VL models? Thanks!

A100s are best if you want you can run these on Modal, they are giving $30 free credits

babla9 · 2024-05-15T06:14:13Z

Do you know what minimum GPU memory requirements would be to serve these VL models? Thanks!

A100s are best if you want you can run these on Modal, they are giving $30 free credits

Thanks, for LLaVA, do you know what the minimum configuration would be? eg 1 x A100 40gb? Would V100s work (eg 2xV100)? Anyway I can serve quantized models using V100s via lmdeploy?

Iven2132 · 2024-05-15T07:38:41Z

Do you know what minimum GPU memory requirements would be to serve these VL models? Thanks!

A100s are best if you want you can run these on Modal, they are giving $30 free credits

Thanks, for LLaVA, do you know what the minimum configuration would be? eg 1 x A100 40gb? Would V100s work (eg 2xV100)? Anyway I can serve quantized models using V100s via lmdeploy?

I'm not sure but you can run the llava-next 8b model with one A100

Iven2132 · 2024-05-16T10:29:41Z

@lvhan028 Will it support Llava-next?

lvhan028 · 2024-05-16T12:02:02Z

Regarding llava-next, models in https://huggingface.co/collections/liuhaotian/llava-16-65b9e40155f60fd046a5ccf2 have already been supported except https://huggingface.co/liuhaotian/llava-v1.6-mistral-7b

We will support the following in June

lmms-lab/llama3-llava-next-8b: https://huggingface.co/lmms-lab/llama3-llava-next-8b
lmms-lab/llava-next-72b: https://huggingface.co/lmms-lab/llava-next-72b
lmms-lab/llava-next-110b: https://huggingface.co/lmms-lab/llava-next-110b

github-actions · 2024-05-24T02:11:11Z

This issue is marked as stale because it has been marked as invalid or awaiting response for 7 days without any further response. It will be closed in 5 days if the stale label is not removed or if there is no further response.

lvhan028 added the awaiting response label May 13, 2024

github-actions bot added the Stale label May 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B #1583

[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B #1583

Iven2132 commented May 11, 2024

White-Friday commented May 13, 2024

lvhan028 commented May 13, 2024

lvhan028 commented May 13, 2024

Iven2132 commented May 13, 2024

lvhan028 commented May 13, 2024

Iven2132 commented May 13, 2024

babla9 commented May 13, 2024

Iven2132 commented May 14, 2024

babla9 commented May 15, 2024

Iven2132 commented May 15, 2024

Iven2132 commented May 16, 2024

lvhan028 commented May 16, 2024

github-actions bot commented May 24, 2024

[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B #1583

[Feature] Support for LLaVA-NeXT Qwen1.5-110, Qwen1.5-72B, LLaMA3-8B #1583

Comments

Iven2132 commented May 11, 2024

Motivation

Related resources

Additional context

White-Friday commented May 13, 2024

lvhan028 commented May 13, 2024

lvhan028 commented May 13, 2024

Iven2132 commented May 13, 2024

lvhan028 commented May 13, 2024

Iven2132 commented May 13, 2024

babla9 commented May 13, 2024

Iven2132 commented May 14, 2024

babla9 commented May 15, 2024

Iven2132 commented May 15, 2024

Iven2132 commented May 16, 2024

lvhan028 commented May 16, 2024

github-actions bot commented May 24, 2024