cls_token problem with image. #207

evertonipx · 2024-02-14T00:05:50Z

When I use only prompt text mPLUG-Owl2 works fine. But when I include an image have this error:

File "C:\py projects\IPXCopilot_OWLVersion\mplug_owl2\model\visual_encoder.py", line 117, in forward if self.cls_token : RuntimeError: Boolean value of Tensor with more than one value is ambiguous

If I change to: if self.cls_token is not None: I got this error:

File "C:\py projects\IPXCopilot_OWLVersion\mplug_owl2\model\visual_encoder.py", line 123, in forward embeddings = embeddings + get_abs_pos(self.position_embedding,embeddings.size(1)) RuntimeError: The size of tensor a (1024) must match the size of tensor b (1049600) at non-singleton dimension 2

Anyone with the same problem? Worked fine before the update

The text was updated successfully, but these errors were encountered:

findalexli · 2024-02-14T07:22:47Z

Following as well, getting this exact issue

jiaqixuac · 2024-02-18T04:53:38Z

It seems that the updated code does not deal with cls_token well.
See 54b508a
If modify if embeddings.shape[1] != self.num_patches: -> if self.cls_token is None and embeddings.shape[1] != self.num_patches:, it can work.

vateye · 2024-02-18T08:58:33Z

Fixed.

evertonipx changed the title ~~cls_token wproblem with image.~~ cls_token problem with image. Feb 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cls_token problem with image. #207

cls_token problem with image. #207

evertonipx commented Feb 14, 2024 •

edited

findalexli commented Feb 14, 2024

jiaqixuac commented Feb 18, 2024

vateye commented Feb 18, 2024

cls_token problem with image. #207

cls_token problem with image. #207

Comments

evertonipx commented Feb 14, 2024 • edited

findalexli commented Feb 14, 2024

jiaqixuac commented Feb 18, 2024

vateye commented Feb 18, 2024

evertonipx commented Feb 14, 2024 •

edited