Minimalist, high-performance ML framework for Rust with GPU support and LLM inference
NVIDIA's high-performance deep learning inference SDK for GPU-accelerated AI deployment