SqueezeLLM
Visit ToolSqueezeLLM is an AI Frameworks & Infra tool that provides dense-and-sparse quantization for efficient LLM serving. It enables deployment of large language models with reduced memory footprint and improved accuracy.
At a glance
Trending