Security Notice: Triton Inference Server - November 2023

Updated 12/18/2023 01:10 PM

This notice is regarding Triton Inference Server.

December 4, 2023:

Triton Inference Server is designed for flexibility and allows developers to create and deploy inferencing solutions in various ways. Triton Inference Server enables teams to deploy any AI model from multiple deep learning and machine learning frameworks, including TensorRT, TensorFlow, PyTorch, ONNX, OpenVINO, Python, RAPIDS FIL, and more.

NVIDIA has developed a Secure Deployment Considerations Guide to help our users make knowledgeable decisions that preemptively consider Secure Deployment. This guide provides best practices that users deploying Triton-based solutions should consider integrating to fortify the setup and deployment of their Model Repository.

Additionally, NVIDIA made the following items available in the Triton development branch on November 10, 2023, all of which are available in the release branch today, December 4, 2023.

Updated software that behaves as follows:
- Provides the ability to restrict the HTTP endpoint of the model load API
- Prevents the model load API configuration option from accessing directories outside the model directory

The latest release can be installed from the Triton Inference Server Release Github page or the NGC Triton Inference Server page.

Revision History

Revision	Date	Description
1.1	December 18, 2023	Release date updated to December 4, 2023
1.0	November 30, 2023	Initial release

Disclaimer

ALL NVIDIA INFORMATION, DESIGN SPECIFICATIONS, REFERENCE BOARDS, FILES, DRAWINGS, DIAGNOSTICS, LISTS, AND OTHER DOCUMENTS (TOGETHER AND SEPARATELY, “MATERIALS”) ARE BEING PROVIDED “AS IS.” NVIDIA MAKES NO WARRANTIES, EXPRESS, IMPLIED, STATUTORY, OR OTHERWISE WITH RESPECT TO THE MATERIALS, AND ALL EXPRESS OR IMPLIED CONDITIONS, REPRESENTATIONS AND WARRANTIES, INCLUDING ANY IMPLIED WARRANTY OR CONDITION OF TITLE, MERCHANTABILITY, SATISFACTORY QUALITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT, ARE HEREBY EXCLUDED TO THE MAXIMUM EXTENT PERMITTED BY LAW.

Information is believed to be accurate and reliable at the time it is furnished. However, NVIDIA Corporation assumes no responsibility for the consequences of use of such information or for any infringement of patents or other rights of third parties that may result from its use. No license is granted by implication or otherwise under any patent or patent rights of NVIDIA Corporation. Specifications mentioned in this publication are subject to change without notice. This publication supersedes and replaces all information previously supplied. NVIDIA Corporation products are not authorized for use as critical components in life support devices or systems without express written approval of NVIDIA Corporation.

NVIDIA SUPPORT

Security Notice: Triton Inference Server - November 2023

December 4, 2023:

Revision History

Disclaimer

Is this answer helpful?

Answers others found helpful

Live Chat

Chat online with one of our support agents

ASK US A QUESTION

Contact Support for assistance

800.797.6530