LLMLingua
Visit ToolLLMLingua is an AI Frameworks & Infra tool that compresses prompts and KV-caches for Large Language Models. It achieves up to 20x compression with minimal performance loss, speeding up inference and enhancing key information perception.
At a glance
Trending
Also listed in