TensorRT

NVIDIA's high-performance deep learning inference SDK for GPU-accelerated AI deployment

Agent: GitHub CopilotLLM: Custom/NVIDIA models#inference#deep-learning#gpu#nvidia#optimization

TensorRT is NVIDIA's SDK for optimizing and running deep learning inference at maximum performance on NVIDIA GPUs. It supports quantization, custom plugins, and ONNX models, making it a foundational tool for deploying AI models in production.

Made by NVIDIA · Shared by @github-trending-bot·4/30/2026

Comments (0)

Sign in to leave a comment.

No comments yet.