feat: add meta device initialization for pretrained models, 5x faster load times #501

ErwannMillon · 2023-04-19T21:00:24Z

Uses accelerates init_empty_weights context manager to initialize models on the meta device (create empty dummy tensors without allocating memory or spending time initializing weights randomly.)

Checks whether accelerator is installed, and uses the default open_clip behavior if the package is unavailable so that accelerate is an optional dependency.

Loads CLIP ViT-H-14 in about 4 seconds.

…r load times

rwightman · 2023-04-19T21:20:33Z

@ErwannMillon thanks for the PR, most of the meta device code/logic is in torch itself, I believe (correct me if I'm wrong) the lines of code to implement the rest would be less than the accelerate import guards so would rather just do it natively if possible

ErwannMillon · 2023-04-19T23:33:14Z

Sure, just removed the accelerate dep

ErwannMillon · 2023-04-20T04:21:24Z

Not sure what's failing,
this was just a quick pr I made because the feature was something I needed for my work. Don't have the time to dig into this right now but might be a good first issue for someone else

rwightman · 2023-04-20T04:28:33Z

@ErwannMillon k, it can definitely be useful, especially as the models get larger. Aside from test failing, there are a few things I want to verify and could probably clean it up / compact it a bit. We can leave it as a draft for now?

There's also the possibility of doing it the way pytorch was intending, https://pytorch.org/docs/stable/generated/torch.nn.utils.skip_init.html#torch.nn.utils.skip_init .. but requires modifying all models to accept device args and pass them through which is a bit meh, but then the context manager approach is a bit of a glorious hack and has possibility of breakage with changes in pytorch.

ErwannMillon · 2023-04-20T04:40:17Z

Sure,no worries, thanks for taking the time to look at it

feat: add meta device initialization for pretrained models, 5x faste…

b292b61

…r load times

remove accelerate dep

e7cda08

rwightman marked this pull request as draft April 20, 2023 15:30

add log message to meta init

38b75a3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add meta device initialization for pretrained models, 5x faster load times #501

feat: add meta device initialization for pretrained models, 5x faster load times #501

ErwannMillon commented Apr 19, 2023

rwightman commented Apr 19, 2023

ErwannMillon commented Apr 19, 2023

ErwannMillon commented Apr 20, 2023

rwightman commented Apr 20, 2023

ErwannMillon commented Apr 20, 2023

feat: add meta device initialization for pretrained models, 5x faster load times #501

Are you sure you want to change the base?

feat: add meta device initialization for pretrained models, 5x faster load times #501

Conversation

ErwannMillon commented Apr 19, 2023

rwightman commented Apr 19, 2023

ErwannMillon commented Apr 19, 2023

ErwannMillon commented Apr 20, 2023

rwightman commented Apr 20, 2023

ErwannMillon commented Apr 20, 2023