MLEM-loaded model performs consistently worse #641

rocco-fortuna · 2023-03-13T13:54:10Z

I have a Pytorch text classification model I cannot disclose the architecture of. Whenever the model is loaded with the relative library, it consistently performs slightly better than the model saved and then loaded with MLEM.
As detailed on the Discord discussion with @aguschin:

It's a Pytorch sequence classification model. Ran the eval four times each:

the original model
the mlem_model saved and loaded with:

# load the model with Pytorch model class
model = MyModel.from_pretrained('./model_path')

# save
from mlem.api import save
save(model, "./checkpoints/v070_mlem")

#
from mlem.api import load
mlem_model = load("./checkpoints/v070_mlem")

And did eval 4 times each on 5k samples, getting the accuracies:

original:

0.7868
0.7874
0.7844
0.7864

mlem_model:

0.7778
0.783
0.7808
0.7816

So almost the same, but consistently lower by about 0.6% on average.

The text was updated successfully, but these errors were encountered:

rocco-fortuna · 2023-03-13T13:56:00Z

@aguschin mentioned:

I think we can [try] one of Pytorch examples to see if this can be reproduced there. If that won't help us, we can try to dig deeper into some specifics.

Let me know if you need any additional info.

aguschin · 2023-03-14T08:03:57Z

@mike0sv do you have any ideas why this could be the case?

mike0sv · 2023-03-14T13:30:05Z

Under the hood, mlem save and loads model with torch.save and torch.load (or torch.jit.save and torch.jit.load). We do not do anything else with the model. Can you confirm that this logic is to blame by running something like this

# load the model with Pytorch model class
model = MyModel.from_pretrained('./model_path')

# save
torch.save(model, "...")

#
model = torch.load("...")

and running evaluation?
If your model is isinstance(model, torch.jit.ScriptModule), use torch.jit

rocco-fortuna · 2023-03-16T20:01:28Z

That yielded:

0.7832
0.7862
0.7888
0.7892

Consistently with the original model's performance

aguschin added bug Something isn't working question Further information is requested ml-framework ML Framework support serialization Dumping and loading Python objects and removed question Further information is requested labels Apr 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MLEM-loaded model performs consistently worse #641

MLEM-loaded model performs consistently worse #641

rocco-fortuna commented Mar 13, 2023 •

edited

rocco-fortuna commented Mar 13, 2023 •

edited

aguschin commented Mar 14, 2023

mike0sv commented Mar 14, 2023 •

edited

rocco-fortuna commented Mar 16, 2023 •

edited

MLEM-loaded model performs consistently worse #641

MLEM-loaded model performs consistently worse #641

Comments

rocco-fortuna commented Mar 13, 2023 • edited

rocco-fortuna commented Mar 13, 2023 • edited

aguschin commented Mar 14, 2023

mike0sv commented Mar 14, 2023 • edited

rocco-fortuna commented Mar 16, 2023 • edited

rocco-fortuna commented Mar 13, 2023 •

edited

rocco-fortuna commented Mar 13, 2023 •

edited

mike0sv commented Mar 14, 2023 •

edited

rocco-fortuna commented Mar 16, 2023 •

edited