how to realize multi-image correlation in vqa task? #200

fansticOne · 2024-01-31T07:48:28Z

In vqa task, I want to input two images and ask a question about the two images,how to realize it?

LukeForeverYoung · 2024-02-02T17:27:22Z

You can pass a list of images and place the same number of "<|image|>" in your prompt.

fansticOne · 2024-02-04T07:36:10Z

I pass a list of images, say 2 images, and modify the prompt. The image_tensor after preprocess has batch size of 2, while the input_ids has batch size of 1,then I run model.generate(), I do get a result, however the result is wrong. Do I misunderstand?

LukeForeverYoung · 2024-02-07T07:19:31Z

I pass a list of images, say 2 images, and modify the prompt. The image_tensor after preprocess has batch size of 2, while the input_ids has batch size of 1,then I run model.generate(), I do get a result, however the result is wrong. Do I misunderstand?

Could you provide an example and the incorrect response generated by the owl? Btw, the owl has not been trained on SFT data that includes multiple images. Therefore, it is reasonable to expect that it might fail in some cases.

fansticOne · 2024-02-08T08:35:40Z

Here are the two images I passed

the prompt is
'USER: <|image|><|image|>{}\nAnswer the question using a single word or phrase. ASSISTANT:'.format('Does the dog in the first picture have same color with the dog in the second picture?')
the response generated by the owl is 'Yes'

LukeForeverYoung closed this as completed Feb 2, 2024

LukeForeverYoung reopened this Feb 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how to realize multi-image correlation in vqa task? #200

how to realize multi-image correlation in vqa task? #200

fansticOne commented Jan 31, 2024

LukeForeverYoung commented Feb 2, 2024

fansticOne commented Feb 4, 2024 •

edited

LukeForeverYoung commented Feb 7, 2024

fansticOne commented Feb 8, 2024

how to realize multi-image correlation in vqa task? #200

how to realize multi-image correlation in vqa task? #200

Comments

fansticOne commented Jan 31, 2024

LukeForeverYoung commented Feb 2, 2024

fansticOne commented Feb 4, 2024 • edited

LukeForeverYoung commented Feb 7, 2024

fansticOne commented Feb 8, 2024

fansticOne commented Feb 4, 2024 •

edited