Reproducing paper results #34

grigorn · 2023-11-13T21:19:34Z

I run LLM pruner with the command specified in the ReadMe to prune LLama-7B

python hf_prune.py --pruning_ratio 0.25 \
      --block_wise \
      --block_mlp_layer_start 4 --block_mlp_layer_end 30 \
      --block_attention_layer_start 4 --block_attention_layer_end 30 \
      --pruner_type taylor \
      --test_after_train \
      --device cpu  --eval_device cuda \
      --save_ckpt_log_name llama_prune

I get the following results

#Param before: 6738415616, #Param after: 5422977024, Ratio = 80.4785%
PPL after pruning: {'wikitext2': 19.96819234893607, 'ptb': 80.37625124290746}

Perplexities reported in Table 1 in the paper are WikiText2 - 19.09 and PTB - 34.21. Is there any reason for the difference in thses perplexities especially PTB? Thanks

The text was updated successfully, but these errors were encountered:

horseee · 2023-11-14T08:55:26Z

Hi. Can I check which LLaMa-7B checkpoint you use? decapoda-research/llama-7b-hf in my code is not available currently and I'm not sure if it is the reason that causes this difference.

grigorn · 2023-11-14T09:30:10Z

I am using 'yahma/llama-7b-hf'

horseee · 2023-11-19T05:04:27Z

Have you tried the copied version of decapoda-research/llama-7b-hf, e.g., https://huggingface.co/baffo32/decapoda-research-llama-7B-hf?

We would try that kind of checkpoint these days to see if the results are reproducible in those available checkpoints.

grigorn · 2023-11-20T12:44:44Z

With the checkpoint you specified, I could replicate the metrics. Do you know what is the difference between those 2? I thought there is one LLama and the checkpoints should be the same

horseee · 2023-11-20T17:43:32Z

I have no idea about this😢.
I guess the possible reasons may be: (1) the EOS token issue or (2) the weight between these two is slightly different.

grigorn · 2023-11-23T13:42:51Z

I checked both the model and the tokenizer. Model weights and tokenizer.get_vocab() are the same, but there is the difference of special tokens - for baffo32 all three special tokens are empty strings. Can this be reason of these differences? If yes, do you know which one is the "true" LLama?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reproducing paper results #34

Reproducing paper results #34

grigorn commented Nov 13, 2023

horseee commented Nov 14, 2023

grigorn commented Nov 14, 2023

horseee commented Nov 19, 2023

grigorn commented Nov 20, 2023

horseee commented Nov 20, 2023

grigorn commented Nov 23, 2023

Reproducing paper results #34

Reproducing paper results #34

Comments

grigorn commented Nov 13, 2023

horseee commented Nov 14, 2023

grigorn commented Nov 14, 2023

horseee commented Nov 19, 2023

grigorn commented Nov 20, 2023

horseee commented Nov 20, 2023

grigorn commented Nov 23, 2023