Convert HF Llama Checkpoints to Neox Checkpoints #994

sxthunder · 2023-07-10T03:10:33Z

Hello, I am excited that gpt-neox now support llama model. However, the script in tools/convert_raw_llama_weights_to_neox.py only support origin llama weight. Considering the large number of users currently using Huggingface, would it be possible to provide a script for converting the Huggingface Llama model into Neox?

In my experiments, training speed and memory usage in gpt-neox is much better than other language model framework, even if training by Lora. So I want to use gpt-neox to train if it support the model.

StellaAthena · 2023-10-12T14:06:01Z

@haileyschoelkopf You were the one who ported LLaMA2 right? Do you think it would be easy to adapt that script to porting LLaMA1?

sxthunder added the feature request New feature or request label Jul 10, 2023

StellaAthena mentioned this issue Oct 12, 2023

Interoperability and GPT-NeoX #1058

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert HF Llama Checkpoints to Neox Checkpoints #994

Convert HF Llama Checkpoints to Neox Checkpoints #994

sxthunder commented Jul 10, 2023

StellaAthena commented Oct 12, 2023

Convert HF Llama Checkpoints to Neox Checkpoints #994

Convert HF Llama Checkpoints to Neox Checkpoints #994

Comments

sxthunder commented Jul 10, 2023

StellaAthena commented Oct 12, 2023