LayerNormalization on Dnnl execution provider causing side-effect of input #20623

amancini-N · 2024-05-09T16:08:21Z

Describe the issue

Running a model with LayerNormalization op brings wrong result when all the following conditions apply:

Using the oneDNN execution provider
Input of the LayerNormalization op is used also in following op

The following graph is reproducing the issue:

This appears to be caused by a side-effect done by the Dnnl kernel of LayerNormalization on the input

To reproduce

Following snippet builds the minimal example model and compares result between CPU and oneDNN EPs:

import onnx
import onnx.helper
import numpy as np

import onnxruntime


def create_layer_norm_model():
    # Define input and output names
    input_name = 'input'
    output_name = 'output'

    # Define input shape
    input_shape = ('batch_size', 3)

    # Create input tensor
    input_tensor = onnx.helper.make_tensor_value_info(input_name, onnx.TensorProto.FLOAT, input_shape)

    # Constant scale
    scale = onnx.helper.make_node(
        'Constant',
        inputs=[],
        outputs=['scale'],
        value=onnx.numpy_helper.from_array(np.ones((3,), dtype=np.float32))
    )

    # Constant bias (zeros)
    bias = onnx.helper.make_node(
        'Constant',
        inputs=[],
        outputs=['bias'],
        value=onnx.numpy_helper.from_array(np.zeros((3,), dtype=np.float32))
    )

    # Create layer normalization node
    layer_norm_node = onnx.helper.make_node(
        'LayerNormalization',
        inputs=[input_name, 'scale', 'bias'],
        outputs=[output_name],
        epsilon=1e-5
    )

    # Create addition node
    add_node = onnx.helper.make_node(
        'Add',
        inputs=[input_name, output_name],
        outputs=[output_name + '_add']
    )

    # Create output tensor
    output_tensor = onnx.helper.make_tensor_value_info(output_name + '_add', onnx.TensorProto.FLOAT, input_shape)

    # Create graph
    graph_def = onnx.helper.make_graph(
        nodes=[layer_norm_node, add_node, scale, bias],
        name='layer_norm_model',
        inputs=[input_tensor],
        outputs=[output_tensor]
    )

    # Create model
    return onnx.helper.make_model(graph_def, producer_name='onnx-example', doc_string="A model with input x and output y = LayerNormalization(x) + x")


def test_model_against_cpu_and_dnnl_eps(model):

    # Generate dummy input
    input_data = np.random.rand(3, 3).astype(np.float32)

    # Create ONNX runtime session with different execution providers
    sess_cpu = onnxruntime.InferenceSession(model, providers=['CPUExecutionProvider'])
    sess_dnnl = onnxruntime.InferenceSession(model, providers=['DnnlExecutionProvider'])

    # Run the model on both providers
    output_cpu = sess_cpu.run(["output_add"], {'input': input_data})
    output_dnnl = sess_dnnl.run(["output_add"], {'input': input_data})

    # Check if the outputs are the same
    np.testing.assert_allclose(output_cpu, output_dnnl, rtol=1e-03, atol=1e-05)


# Create the ONNX model
model = create_layer_norm_model()

# Save the model to a file
onnx.save(model, 'layer_norm_model.onnx')

# Test the model against CPU and DNNL execution providers
test_model_against_cpu_and_dnnl_eps('layer_norm_model.onnx')

Urgency

No response

Platform

Linux

OS Version

Ubuntu 20.04.5 LTS

ONNX Runtime Installation

Built from Source

ONNX Runtime Version or Commit ID

737eb48

ONNX Runtime API

Python

Architecture

X64

Execution Provider

oneDNN

Execution Provider Library Version

No response

The text was updated successfully, but these errors were encountered:

amancini-N · 2024-05-09T16:13:34Z

Opened a PR fixing the issue: #20624

github-actions bot added the ep:oneDNN questions/issues related to DNNL EP label May 9, 2024

amancini-N linked a pull request May 9, 2024 that will close this issue

[oneDNN EP] LayerNorm in OneDNN altering input tensor #20624

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LayerNormalization on Dnnl execution provider causing side-effect of input #20623

LayerNormalization on Dnnl execution provider causing side-effect of input #20623

amancini-N commented May 9, 2024

amancini-N commented May 9, 2024

LayerNormalization on Dnnl execution provider causing side-effect of input #20623

LayerNormalization on Dnnl execution provider causing side-effect of input #20623

Comments

amancini-N commented May 9, 2024

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

amancini-N commented May 9, 2024