Supporting device_map = 'auto' similar to the one in .from_pretrained method from Huggingface #36

Ahmed-Roushdy · 2023-11-16T13:14:42Z

The pruned model is saved using torch.save and torch.load for loading the model. I was wondering if there is a way to use a similar method such as device_map='auto' similar to the one in .from_pretrained method from Huggingface

horseee · 2023-11-19T05:00:39Z

Hi. It would be challenging to do this since the pruned model does not follow a uniform configuration, like different dimensions for different modules and different head numbers for different layers.

Ahmed-Roushdy · 2023-11-21T11:34:13Z

Thank you. Do you suggest any similar methods that can enable distributed training while working with the Train method from Huggingface?

KBhandari11 · 2023-12-05T04:19:05Z

I am interested in this as well. Is it possible to implement it on Llama2-70b? Or use some distributed method to prune the model?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supporting device_map = 'auto' similar to the one in .from_pretrained method from Huggingface #36

Supporting device_map = 'auto' similar to the one in .from_pretrained method from Huggingface #36

Ahmed-Roushdy commented Nov 16, 2023

horseee commented Nov 19, 2023

Ahmed-Roushdy commented Nov 21, 2023

KBhandari11 commented Dec 5, 2023

Supporting device_map = 'auto' similar to the one in .from_pretrained method from Huggingface #36

Supporting device_map = 'auto' similar to the one in .from_pretrained method from Huggingface #36

Comments

Ahmed-Roushdy commented Nov 16, 2023

horseee commented Nov 19, 2023

Ahmed-Roushdy commented Nov 21, 2023

KBhandari11 commented Dec 5, 2023