Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Image transformation for InternVL-1.5 #166

Open
ruifengma opened this issue May 13, 2024 · 1 comment
Open

Image transformation for InternVL-1.5 #166

ruifengma opened this issue May 13, 2024 · 1 comment

Comments

@ruifengma
Copy link

I found that there is a image transformation step on load_image function from the example on huggingface repo (transformers based), but there is not any image processing on the gradio_web_server example (the performance is still fine). Then my question is that if there is a comparing research on performance difference with image processing and without image processing? Thanks in advance

@czczup
Copy link
Member

czczup commented May 16, 2024

Hello, in fact, in the gradio_web_server image transformation is also done, but the code is a little different. In gradio_web_server, the image transformation is conducted by CLIPImageProcessor, which is actually aligned.

See this line of code for more detail: https://github.com/OpenGVLab/InternVL/blob/main/internvl_chat/internvl/serve/model_worker.py#L80

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants