Add gen reg tests #689

gpucce · 2023-10-22T12:22:34Z

This adds regression tests for generative models.

@rwightman this should be almost done, however there seems to be a regression error for coca that I had not noticed

rwightman · 2023-10-22T15:51:12Z

@gpucce do you have any idea what might be causing it? what's the symptom and by how much is it 'off'? there are numerical changes across versions of pytorch, etc so some difference is expected

gpucce · 2023-10-22T15:56:51Z

@gpucce do you have any idea what might be causing it? what's the symptom and by how much is it 'off'? there are numerical changes across versions of pytorch, etc so some difference is expected

@rwightman I have been keeping testing and I believe it might be just be me still doing smth wrong with github ci and not a real issue. When I run small tests locally it seems that everything is perfectly equal. I will keep trying to see if I manage to get the whole thing working.

rwightman · 2023-10-22T16:29:06Z

@gpucce have you run same random inputs through the different towers, save results to verify closeness within some float eps on same env but with current main and previous release?

ie something along these lines

torch.manual_seed(0)
img = torch.randn(16, 3, 224, 224)
text = torch.randint(0, vocab_size, (16, 77))
outputs = model(img, text)
torch.save(outputs)
...

gpucce · 2023-10-22T16:33:02Z

@gpucce have you run same random inputs through the different towers, save results to verify closeness within some float eps on same env but with current main and previous release?

ie something along these lines
torch.manual_seed(0)

img = torch.randn(16, 3, 224, 224)

text = torch.randint(0, vocab_size, (16, 77))

outputs = model(img, text)

torch.save(outputs)

...

@rwightman yeah, everything I do by hand seems to be fine. It's with making proper regression tests that I get errors, will check again later or tomorrow, now I must be doing smth dumb but can't find out what

rwightman · 2023-10-22T17:10:57Z

Also, not sure if this is a factor, HF generate functionality might have changed slightly over transformers versions in a way that impacted how it was being used here...

…

On Sun, Oct 22, 2023, 9:33 AM Giovanni Puccetti ***@***.***> wrote: @gpucce <https://github.com/gpucce> have you run same random inputs through the different towers, save results to verify closeness within some float eps on same env but with current main and previous release? ie something along these lines torch.manual_seed(0) img = torch.randn(16, 3, 224, 224) text = torch.randint(0, vocab_size, (16, 77)) outputs = model(img, text) torch.save(outputs) ... @rwightman <https://github.com/rwightman> yeah, everything I do by hand seems to be fine. It's with making proper regression tests that I get errors, will check again later or tomorrow, now I must be doing smth dumb but can't find out what — Reply to this email directly, view it on GitHub <#689 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ABLQICFDBOHCZOAWLPIWBADYAVDERAVCNFSM6AAAAAA6K3TFMWVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTONZUGEZTQMZSGQ> . You are receiving this because you were mentioned.Message ID: ***@***.***>

rwightman · 2023-10-22T21:23:17Z

FWIW using your cat.jpg, I get '<start_of_text>a cat sitting on its hind legs looking up . <end_of_text>' for both PT 2.1 w/ transformer 4.34 and latest main branch AND same on PT 1.13, transformers 4.24, open_clip 2.16.2.

gpucce · 2023-10-22T21:51:53Z

@rwightman I think I found it, could it be the new open_clip.tokenize generates sequences with length 76 in some cases?

gpucce · 2023-10-22T22:05:15Z

@rwightman I think I found it, could it be the new open_clip.tokenize generates sequences with length 76 in some cases?

specifically open_clip.get_tokenizer("coca_ViT-B-32")("some text") has shape [1, 76] in the current version but it had shape [1, 77] in v2.22.0

rwightman · 2023-10-22T22:05:59Z

@gpucce I'd avoid using the singleton tokenizer by calling the open_clip.tokenize(), and us factory to get one for your model.
But yeah, the coca configs say context length is 76 so the get_tokenizer will return a tokenizer that outputs 76. context_len is used by tokenizer now as we have multiple models with different context lengths now.

gpucce added 4 commits October 21, 2023 19:20

generation mixin

9f7561a

adapt coca model to generator

3591ee7

Merge remote-tracking branch 'upstream/main' into add_gen_reg_tests

c719393

add generative tests

63b18df

gpucce marked this pull request as draft October 22, 2023 12:23

gpucce added 19 commits October 22, 2023 14:32

add generative regression tests to ci

6545337

actually add tests

5f146ee

type in -m

18b88c7

pytest change

2a61a17

typo

aba2e96

different gen revsion

de47222

temporarily remove cat image

f9dcbde

test only trained models for now

20a2cb3

conert to set

003d357

temporarily disable duration test

861142c

add duration back

ea16275

single regession

393163a

remove all caches

870855f

remove all new things

8f428b8

revert to ci to main

9512a0e

compare with 2.22

d685aa2

compare with 2.14

4fa42fc

add coca_ViT-B-32 to regression tests

5db1745

add gen reg to ci

984b4f3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add gen reg tests #689

Add gen reg tests #689

gpucce commented Oct 22, 2023

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023 •

edited

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023

rwightman commented Oct 22, 2023 via email

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023

gpucce commented Oct 22, 2023

rwightman commented Oct 22, 2023

Add gen reg tests #689

Are you sure you want to change the base?

Add gen reg tests #689

Conversation

gpucce commented Oct 22, 2023

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023 • edited

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023

rwightman commented Oct 22, 2023 via email

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023

gpucce commented Oct 22, 2023

rwightman commented Oct 22, 2023

gpucce commented Oct 22, 2023 •

edited