[CUDA EP] Fix BeamSearch on T5 with sequence_as_input_ids (#20667) #20668

amancini-N · 2024-05-13T11:26:48Z

Description

Change the implementation of BeamSearch op when using CUDA EP: in case of T5 model, and in case the decoder input_ids are sequences, copy the sequences device-to-device instead of host-to-device

Motivation and Context

Fixes BeamSearch op returning wrong results on CUDA execution provider when sequence is used as input_ids #20667

tianleiwu · 2024-05-13T16:09:50Z

/azp run Windows ARM64 QNN CI Pipeline,Windows x64 QNN CI Pipeline,Windows CPU CI Pipeline,Windows GPU CI Pipeline,Windows GPU TensorRT CI Pipeline,ONNX Runtime Web CI Pipeline,Linux CPU CI Pipeline,Linux CPU Minimal Build E2E CI Pipeline,Linux GPU CI Pipeline,Linux GPU TensorRT CI Pipeline

tianleiwu · 2024-05-13T16:09:52Z

/azp run Linux OpenVINO CI Pipeline,Linux QNN CI Pipeline,MacOS CI Pipeline,orttraining-amd-gpu-ci-pipeline,orttraining-linux-ci-pipeline,orttraining-linux-gpu-ci-pipeline,orttraining-ortmodule-distributed,onnxruntime-binary-size-checks-ci-pipeline,Big Models,Android CI Pipeline

tianleiwu · 2024-05-13T16:09:53Z

/azp run iOS CI Pipeline,ONNX Runtime React Native CI Pipeline

azure-pipelines · 2024-05-13T16:10:11Z

Azure Pipelines successfully started running 2 pipeline(s).

azure-pipelines · 2024-05-13T16:10:31Z

Azure Pipelines successfully started running 10 pipeline(s).

azure-pipelines · 2024-05-13T16:10:35Z

Azure Pipelines successfully started running 10 pipeline(s).

tianleiwu · 2024-05-14T20:01:55Z

@amancini-N, could you take a look at those build and test errors in CI pipeline. Let me know if you need help to resolve them.

Fix BeamSearch on T5 with sequence_as_input_ids (microsoft#20667)

6d8dfe7

amancini-N mentioned this pull request May 13, 2024

BeamSearch op returning wrong results on CUDA execution provider when sequence is used as input_ids #20667

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CUDA EP] Fix BeamSearch on T5 with sequence_as_input_ids (#20667) #20668

[CUDA EP] Fix BeamSearch on T5 with sequence_as_input_ids (#20667) #20668

amancini-N commented May 13, 2024

tianleiwu commented May 13, 2024

tianleiwu commented May 13, 2024

tianleiwu commented May 13, 2024

azure-pipelines bot commented May 13, 2024

azure-pipelines bot commented May 13, 2024

azure-pipelines bot commented May 13, 2024

tianleiwu commented May 14, 2024

[CUDA EP] Fix BeamSearch on T5 with sequence_as_input_ids (#20667) #20668

Are you sure you want to change the base?

[CUDA EP] Fix BeamSearch on T5 with sequence_as_input_ids (#20667) #20668

Conversation

amancini-N commented May 13, 2024

Description

Motivation and Context

tianleiwu commented May 13, 2024

tianleiwu commented May 13, 2024

tianleiwu commented May 13, 2024

azure-pipelines bot commented May 13, 2024

azure-pipelines bot commented May 13, 2024

azure-pipelines bot commented May 13, 2024

tianleiwu commented May 14, 2024