Add Qwen2MoE #593

bozheng-hit · 2024-03-14T09:25:15Z

Adding Qwen2MoE

This PR adds the support of codes for the coming Qwen2MoE models. For information about Qwen, please visit https://github.com/QwenLM/Qwen.

fxmarty · 2024-03-18T07:08:22Z

auto_gptq/modeling/_utils.py

@@ -737,6 +737,21 @@ def get_checkpoints(model_name_or_path: str, extensions: List[str], possible_mod

    return False, resolved_archive_file, true_model_basename

+
+def get_moe_inside_layer_modules(inside_layer_modules, num_experts):


Could you add type hints and a small description here?

fxmarty · 2024-03-18T07:11:15Z

auto_gptq/modeling/_base.py

+        if hasattr(self.model.config, "num_experts"):
+            inside_layer_modules = get_moe_inside_layer_modules(self.inside_layer_modules,
+                                                                self.model.config.num_experts)


Why is self.inside_layer_modules not enough?

We cannot write the names of all the parameters in our MoE model because we may have more experts. Besides, we may have different numbers of experts for different model sizes.

I agree it is a bit clunky but you can just add like the highest amount of experts you may have and the models with less will work too.

I agree it is a bit clunky but you can just add like the highest amount of experts you may have and the models with less will work too.

It will work, but I think the current codes look much better than writing hundreds of parameter names in the modeling file.

I don't disagree, but I would prefer a separate PR for for something like this that others could rely on. I know that is just for slightly easier git history. I just prefer PRs to stay focused and do the minimum changes required. Feel free to ignore me, it's not that important.

LaaZa · 2024-03-23T11:10:59Z

My review isn't going to help here. I can't merge and I don't have the memory to even try this. Actually I'm not even quite sure where the reference model is. Since this is adding a new feature, would you consider making a tiny version of the model and create a test for it?

LaaZa · 2024-04-22T15:59:24Z

TypeError: qwen2_moe isn't supported yet. (base)

Name Version Build Channel

auto-gptq 0.7.1 pypi_0 pypi (base)

Are you even using this PR? Also you need at least transformers>=4.39.0

wellcasa · 2024-04-23T01:50:18Z

No, I haven't used this PR yet. I saw this branch and I'm very happy. Let's wait and merge it into the main branch.
The speed of Qwen2moe is very fast and the effect is not bad. Great.

However, currently vllm does not support qwen2moe int4. Also waiting, seeking support.

LaaZa · 2024-04-23T01:59:48Z

No, I haven't used this PR yet. I saw this branch and I'm very happy. Let's wait and merge it into the main branch. The speed of Qwen2moe is very fast and the effect is not bad. Great.

However, currently vllm does not support qwen2moe int4. Also waiting, seeking support.

Don't post errors here then, we know it isn't supported yet until this is merged. You can say you want this but posting random errors is not the way.

wellcasa · 2024-04-23T02:26:47Z

No, I haven't used this PR yet. I saw this branch and I'm very happy. Let's wait and merge it into the main branch. The speed of Qwen2moe is very fast and the effect is not bad. Great.
However, currently vllm does not support qwen2moe int4. Also waiting, seeking support.

Don't post errors here then, we know it isn't supported yet until this is merged. You can say you want this but posting random errors is not the way.

Okay, sorry, I deleted him,

support Qwen2MoE

b8fd802

fxmarty reviewed Mar 18, 2024

View reviewed changes

add description

255fe29

bozheng-hit requested review from fxmarty and LaaZa March 20, 2024 10:28

Merge branch 'main' into qwen2_moe

bf5b702

bozheng-hit mentioned this pull request Apr 30, 2024

我想通过gptq量化qwen-moe-a2.7b。但是好像不支持，请问官方怎么量化的。 QwenLM/Qwen1.5#328

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Qwen2MoE #593

Add Qwen2MoE #593

bozheng-hit commented Mar 14, 2024

fxmarty Mar 18, 2024

fxmarty Mar 18, 2024

bozheng-hit Mar 18, 2024

LaaZa Mar 18, 2024

bozheng-hit Mar 19, 2024

LaaZa Mar 19, 2024

LaaZa commented Mar 23, 2024

LaaZa commented Apr 22, 2024

Name Version Build Channel

wellcasa commented Apr 23, 2024

LaaZa commented Apr 23, 2024

wellcasa commented Apr 23, 2024

		@@ -737,6 +737,21 @@ def get_checkpoints(model_name_or_path: str, extensions: List[str], possible_mod

		return False, resolved_archive_file, true_model_basename


		def get_moe_inside_layer_modules(inside_layer_modules, num_experts):

Add Qwen2MoE #593

Are you sure you want to change the base?

Add Qwen2MoE #593

Conversation

bozheng-hit commented Mar 14, 2024

Adding Qwen2MoE

fxmarty Mar 18, 2024

Choose a reason for hiding this comment

fxmarty Mar 18, 2024

Choose a reason for hiding this comment

bozheng-hit Mar 18, 2024

Choose a reason for hiding this comment

LaaZa Mar 18, 2024

Choose a reason for hiding this comment

bozheng-hit Mar 19, 2024

Choose a reason for hiding this comment

LaaZa Mar 19, 2024

Choose a reason for hiding this comment

LaaZa commented Mar 23, 2024

LaaZa commented Apr 22, 2024

Name Version Build Channel

wellcasa commented Apr 23, 2024

LaaZa commented Apr 23, 2024

wellcasa commented Apr 23, 2024