ShypdShypd.ai

TensorRT-LLM

Visit Tool

TensorRT-LLM is an NVIDIA library for optimizing and serving Large Language Models (LLMs) efficiently on GPUs. It provides a Python API for defining LLMs and supports optimizations for improved inference performance.

At a glance

Pricing
Open Source
Free tier
Yes
API
Yes
Skill level
Technical

Trending

      

Also listed in

This tool also appears in

Explore

Browse AI tools by category