ShypdShypd.ai

Exllamav2

Visit Tool

ExLlamaV2 is an inference library for running local LLMs on modern consumer GPUs, offering fast performance and supporting various quantization formats. It provides dynamic batching and smart prompt caching for efficient generation.

No Views Yet

At a glance

Pricing
Open Source
Free tier
Yes
API
Yes
Skill level
Technical

Trending

      

Also listed in

This tool also appears in

Explore

Browse AI tools by category