Add instructions to convert Hugging Face models to PyTorch #3523

iseeyuan · 2024-05-06T23:05:50Z

As titled. It's pretty common that users download the LLM models in safetensor format. Add instructions and example script to convert them to PyTorch format so that export_llama script can accept. It leverages the utils from TorchTune.

Thanks @l3utterfly and @kartikayk for the discussions and suggestions!

More context in #3303

pytorch-bot · 2024-05-06T23:05:53Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3523

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 12f514e with merge base 1b73db4 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2024-05-06T23:07:54Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-05-06T23:37:07Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

mergennachin · 2024-05-07T14:14:10Z

examples/models/llama2/README.md

+sd = convert_weights.tune_to_meta(sd['model'])
+
+print("saving checkpoint")
+torch.save(sd, "/destination/dir/checkpoint.pth")


np: /the/destination/dir/checkpoint.pth

mergennachin · 2024-05-07T14:16:05Z

examples/models/llama2/README.md

@@ -117,6 +117,33 @@ You can export and run the original Llama3 8B model.

    Due to the larger vocabulary size of Llama3, we recommend quantizing the embeddings with `--embedding-quantize 4,32` to further reduce the model size.

+### Option D: Download models from Hugging Face and convert


and convert from safetensor format to state dict

facebook-github-bot · 2024-05-07T15:47:28Z

@iseeyuan has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

facebook-github-bot · 2024-05-08T01:51:13Z

@iseeyuan merged this pull request in 2c1e283.

Add instructions to convert Hugging Face safetensor models to PyTorch

bc70dd6

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label May 6, 2024

iseeyuan linked an issue May 6, 2024 that may be closed by this pull request

How can I convert llama3 safetensors to the pth file needed to use with executorch? #3303

Closed

lint

deb1327

mergennachin self-requested a review May 7, 2024 14:13

mergennachin approved these changes May 7, 2024

View reviewed changes

comments

12f514e

facebook-github-bot closed this in 2c1e283 May 8, 2024

facebook-github-bot added the Merged label May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add instructions to convert Hugging Face models to PyTorch #3523

Add instructions to convert Hugging Face models to PyTorch #3523

iseeyuan commented May 6, 2024 •

edited

pytorch-bot bot commented May 6, 2024 •

edited

facebook-github-bot commented May 6, 2024

facebook-github-bot commented May 6, 2024

mergennachin May 7, 2024

mergennachin May 7, 2024

facebook-github-bot commented May 7, 2024

facebook-github-bot commented May 8, 2024

		@@ -117,6 +117,33 @@ You can export and run the original Llama3 8B model.

		Due to the larger vocabulary size of Llama3, we recommend quantizing the embeddings with `--embedding-quantize 4,32` to further reduce the model size.

		### Option D: Download models from Hugging Face and convert

Add instructions to convert Hugging Face models to PyTorch #3523

Add instructions to convert Hugging Face models to PyTorch #3523

Conversation

iseeyuan commented May 6, 2024 • edited

pytorch-bot bot commented May 6, 2024 • edited

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/3523

✅ No Failures

facebook-github-bot commented May 6, 2024

facebook-github-bot commented May 6, 2024

mergennachin May 7, 2024

Choose a reason for hiding this comment

mergennachin May 7, 2024

Choose a reason for hiding this comment

facebook-github-bot commented May 7, 2024

facebook-github-bot commented May 8, 2024

iseeyuan commented May 6, 2024 •

edited

pytorch-bot bot commented May 6, 2024 •

edited