Bug in ORTSmoothQuant._adjust_weights() #1435

pavelkochnev · 2023-11-30T16:56:51Z

I think there is a bug in ORTSmoothQuant._adjust_weights(). Part of this method presented below:

    def _adjust_weights(self, scales):
        """Adjust the weights with scale.

        Args:
            scales (dict): The input scales
        """
        for tensor_name, nodes in self.tensors_to_node.items():
            for node_info in nodes:
                key = node_info[0] if self.scales_per_op else tensor_name
                if key not in scales:
                    continue
                input = node_info[1][1]
                node = self.model.input_name_to_nodes[input][0]
                weight = numpy_helper.to_array(
                    self.model.get_initializer(input),
                    base_dir=os.path.dirname(self.model.model_path) if self.model.model_path is not None else "",
                )

The error occurs on this line:

node = self.model.input_name_to_nodes[input][0]

self.model.input_name_to_nodes[input] can return list with more then one element, but no matter what, code take the fist element

I think this line should look like something like this:

node = self.model.get_node(node_info[0])

This line had the same issue, but can be removed.

Also I suggest you to check ORTSmoothQuant._get_smooth_scales() method. The same issue here and here.

The text was updated successfully, but these errors were encountered:

mengniwang95 · 2023-12-04T05:43:00Z

Hi @pavelkochnev, thank you for your suggestions.

input = node_info[1][1]

is the weight name of node.
If each node has its own weight, self.model.input_name_to_nodes[input] will return a list with length of 1 and the following processes are no problem.
If more than one node share one weight tensor, the result will be wrong if that weight is updated twice.
Maybe adding warning and process for the shared weight is what I need to do. 😃

pavelkochnev · 2023-12-04T13:10:31Z

@mengniwang95, thanks for reply. I think you are right. I have researched several models that I have and each conv has its own weight. However, I still think that replacing the line will make the code more readable and code should raise error when model have shared weights (for now).

mengniwang95 · 2023-12-08T08:39:49Z

Hi @pavelkochnev , replacing the line will do make the code more readable.
But in more detail, get_node will iterate nodes of the model until find it and its time complexity is O(n), input_name_to_nodes[input] use key to get node from dict and its time complexity is O(1).
So I use input_name_to_nodes[input] rather than get_node. What do you think about it?

pavelkochnev · 2023-12-11T08:25:58Z

@mengniwang95 You are right again. Would you like to add additional method to class?
Something like this:

    def _get_node_by_weight(self, weight_name):
        nodes = self.model.input_name_to_nodes[weight_name]
        if len(nodes) == 1:
            return nodes[0]
        else:
            raise NotImplementedError("Models with shared weights not supported")

mengniwang95 · 2023-12-14T01:42:39Z

Hi @pavelkochnev , sure, we will put this into our enhancement plan.

chensuyue assigned mengniwang95 Dec 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug in ORTSmoothQuant._adjust_weights() #1435

Bug in ORTSmoothQuant._adjust_weights() #1435

pavelkochnev commented Nov 30, 2023

mengniwang95 commented Dec 4, 2023

pavelkochnev commented Dec 4, 2023

mengniwang95 commented Dec 8, 2023

pavelkochnev commented Dec 11, 2023

mengniwang95 commented Dec 14, 2023

Bug in ORTSmoothQuant._adjust_weights() #1435

Bug in ORTSmoothQuant._adjust_weights() #1435

Comments

pavelkochnev commented Nov 30, 2023

mengniwang95 commented Dec 4, 2023

pavelkochnev commented Dec 4, 2023

mengniwang95 commented Dec 8, 2023

pavelkochnev commented Dec 11, 2023

mengniwang95 commented Dec 14, 2023