模型部署归档 - 第3页共4页

TensorRT – TensorRT was linked against cuBLAS/cuBLAS LT 11.6.1 but loaded cuBLAS/cuBLAS LT 11.5.4错误提示

1 TensorRT was linked against cuBLAS/cuBLAS LT 11.6.1 but loaded cuBLAS/cuBLAS LT 11.5.4 使用TensorRT推理模型时，有时候会看到以下TensorRT自己打出的日志 TensorRT was linked a…

StubbornHuang
TensorRT
2023-03-02
854
0

TensorRT – workspace的作用

1 TensorRT中workspace的作用 TensorRT提供了workspace作为每层网络执行时的临时存储空间，该空间时共享的，目的是为了减少显存占用。在使用TensorRT SDK时，在构建Builder时通过调用getWorkspaceSize()来确定workspace的需求，如果…

StubbornHuang
TensorRT
2022-12-21
3,944
1

TensorRT – Using PreviewFeaturekFASTER_DYNAMIC_SHAPES_0805 can help improve performance and resolve potential functional issues

1 问题今天使用TensorRT 8.5的trtexec工具转换具有动态维度输入的onnx模型时出现了以下的提示信息 Using PreviewFeaturekFASTER_DYNAMIC_SHAPES_0805 can help improve performance and resolve p…

StubbornHuang
TensorRT
2022-12-15
2,567
0

TensorRT – 使用C++ SDK出现无法解析的外部符号 “class sample::Logger sample::gLogger”错误

1 使用TensorRT C++ SDK编译出现无法解析的外部符号 "class sample::Logger sample::gLogger" 错误今天在使用TensorRT C++ SDK写模型部署代码的时候，在代码中有这样一行代码 initLibNvInferPlugins(&sa…

StubbornHuang
TensorRT
2022-12-12
1,802
0

TensorRT – 使用Polygraphy工具比较onnx模型和TensorRT模型的推理结果是否一致

1 比较onnx模型和TensorRT模型推理结果的必要性在进行模型部署的时候，如果我们使用TensorRT对模型加速，那么一般都会采用深度学习框架>onnx模型>TensorRT模型的技术路线，在onnx模型转换到TensorRT模型的过程中有可能某些算子会丢失精度，从而影响后续的模型推理结果，…

StubbornHuang
TensorRT
2022-12-12
4,180
1

TensorRT – onnx_graphsurgeon工具库的安装与API简介

1 onnx_grapgsurgeon 1.1 onnx_grapgsurgeon简介 onnx_grapgsurgeon是NVIDIA提供的创建和修改onnx模型的便捷易用工具库。 Github：https://github.com/NVIDIA/TensorRT/tree/master/tool…

StubbornHuang
TensorRT
2022-12-05
5,042
1

TensorRT – 转换onnx模型出现Slice_74 requires bool or uint8 I/O but node can not be handled by Myelin错误

1 Error Code 2: Internal Error (Slice_74 requires bool or uint8 I/O but node can not be handled by Myelin错误 1.1 问题今天在用TensorRT转化onnx模型出现了以下错误转换onnx模…

StubbornHuang
TensorRT
2022-12-02
2,690
0

Python – onnx导出模型出现RuntimeError: Exporting the operator pad_sequence to ONNX opset version 13 is not supported错误

1 onnx导出模型出现RuntimeError: Exporting the operator pad_sequence to ONNX opset version 13 is not supported错误今天在pytorch中导出模型为onnx时，由于使用了from torch.nn.uti…

StubbornHuang
模型部署
2022-12-01
2,731
1

TensortRT – 转换模型出现Could not locate zlibwapi.dll. Please make sure it is in your library path!错误

1 TensortRT转换模型出现Could not locate zlibwapi.dll. Please make sure it is in your library path!错误 1.1 问题今天在使用TensorRT转换onnx模型时出现了以下错误 [12/01/2022-09:03:…

StubbornHuang
TensorRT
2022-12-01
2,075
0

Python – 使用onnxruntime加载和推理onnx模型

1 onnxruntime Onnx runtime是一个跨平台的机器学习模型加速器，可以在不同的硬件和操作系统上运行，可以加载和推理任意机器学习框架导出的onnx模型并进行加速。如要使用onnxruntime，一般通过以下步骤：从机器学习框架中将模型导出为onnx 使用onnxruntime加…

StubbornHuang
模型部署
2022-11-30
3,092
0

TensorRT – 扩展TensorRT C++API的模型输入维度，增加Dims5,Dims6,Dims7,Dims8

1 TensorRT C++ API支持的模型输入维度在TensorRT 7.0及以上版本，我们通常使用以下语句指定输入维度： const std::string input_name = "input"; const std::string output_name = "output"; con…

StubbornHuang
TensorRT
2021-10-19
3,075
0

TensorRT – 喜大普奔，TensorRT8.2 EA起开始支持Einsum爱因斯坦求和算子

1 TensorRT 8.2 EA版本支持爱因斯坦求和算子Einsum NVIDIA在2021年10月6日发布的TensorRT新版本 8.2 Early Access版本终于开始支持爱因斯坦求和算子Einsum，这个消息真是为模型工业化部署的同学省了不少的时间。 TensorRT中具体可支持的On…

StubbornHuang
TensorRT
2021-10-12
3,007
0

联系我

资助我们