param_first and param_mix result the same ppl #23

Kausal-Lei · 2023-09-01T11:21:32Z

I simply use the following commands to run:
python hf_prune.py --pruning_ratio 0.62785 --block_wise --block_mlp_layer_start 0 --block_mlp_layer_end 32 --block_attention_layer_start 32 --block_attention_layer_end 32 --pruner_type taylor --base_model /mnt/petrelfs/xxx/llama2-7b --device cpu --eval_device cuda --taylor param_first --save_ckpt_log_name llama_prune --save_model --num_examples 128
python hf_prune.py --pruning_ratio 0.62785 --block_wise --block_mlp_layer_start 0 --block_mlp_layer_end 32 --block_attention_layer_start 32 --block_attention_layer_end 32 --pruner_type taylor --base_model /mnt/petrelfs/xxx/llama2-7b --device cpu --eval_device cuda --taylor param_mix --save_ckpt_log_name llama_prune --save_model --num_examples 128
But they result in the same ppl.

The text was updated successfully, but these errors were encountered:

Kausal-Lei · 2023-09-01T11:52:51Z

It seems that the grad is very small(e.g. 1e-5), so the acc_gard is near to zero, which will have little effect.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

param_first and param_mix result the same ppl #23

param_first and param_mix result the same ppl #23

Kausal-Lei commented Sep 1, 2023

Kausal-Lei commented Sep 1, 2023

param_first and param_mix result the same ppl #23

param_first and param_mix result the same ppl #23

Comments

Kausal-Lei commented Sep 1, 2023

Kausal-Lei commented Sep 1, 2023