TensorRT 8.6.3.1 package in Python PyPy for Triton Nvidia Inference Server version > 24.01 #7221

aptmess · 2024-05-15T08:48:31Z

Description

For Triton Nvidia Inference Server version bigger than 24.01 (started with 24.02) the supported version of tensorrt is 8.6.3.1. I am using tensorrt python package and script to convert onnx weights to trt engine, but the last available version in pypy is 8.6.1.6 and because of this I can't use tensorrt_backend in triton and got this error:

The engine plan file is not compatible with this version of TensorRT, 
       expecting library version 8.6.3.1 got 8.6.1.6, please rebuild.

Is it possible to upload this package version (8.6.3.1) to pypy? Or how can I rewrite this script using other tools?

import tensorrt as trt

explicit_batch_flag = 1 << int(trt.NetworkDefinitionCreationFlag.EXPLICIT_BATCH)
max_batch_size: int = 32
max_workspace_size: int = 1 << 30

logger = trt.Logger(trt.Logger.INFO)

with (
    trt.Builder(logger) as builder, 
    builder.create_network(explicit_batch_flag) as network,
    trt.OnnxParser(network, logger) as parser
):
    config = builder.create_builder_config()
    config.set_memory_pool_limit(trt.MemoryPoolType.WORKSPACE, max_workspace_size)
    ...

Triton Information

24.02-24.04, docker image (without additional building)

To Reproduce

pip install tensorrt==8.6.1.6
run script with compiling onnx to tensorrt
run triton with version > 24.01 (works on 24.01, but not 24.02)

Linked issue

Expected behavior

I can use tensorrt backend with correct version

The text was updated successfully, but these errors were encountered:

statiraju · 2024-05-15T20:22:13Z

@mc-nv can you take a look?

mc-nv · 2024-05-15T20:29:54Z

Triton is a part of NVIDIA Optimized Framework

It's not possible to change subset of the libraries release with NVIDIA Optimized Framework on demand.
https://docs.nvidia.com/deeplearning/frameworks/support-matrix/index.html

We understand the request. If you want to use TensorRT 8.6.1 within latest container please feel free to modify image per own needs.

aptmess · 2024-05-16T14:07:52Z

thanks!

@mc-nv can you help me - how to install in triton image "tensorrt==8.6.1.6"?

I am trying to use this guide (https://docs.nvidia.com/deeplearning/tensorrt/archives/tensorrt-861/install-guide/index.html#installing-debian), but I am getting errors like:

dpkg: error: cannot access archive '...': No such file or directory

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TensorRT 8.6.3.1 package in Python PyPy for Triton Nvidia Inference Server version > 24.01 #7221

TensorRT 8.6.3.1 package in Python PyPy for Triton Nvidia Inference Server version > 24.01 #7221

aptmess commented May 15, 2024

statiraju commented May 15, 2024

mc-nv commented May 15, 2024

aptmess commented May 16, 2024

TensorRT 8.6.3.1 package in Python PyPy for Triton Nvidia Inference Server version > 24.01 #7221

TensorRT 8.6.3.1 package in Python PyPy for Triton Nvidia Inference Server version > 24.01 #7221

Comments

aptmess commented May 15, 2024

statiraju commented May 15, 2024

mc-nv commented May 15, 2024

aptmess commented May 16, 2024