Optillm
Visit ToolOptiLLM is an Open Source & Models tool that optimizes LLM inference for improved accuracy and performance. It acts as an OpenAI API-compatible proxy, implementing 20+ state-of-the-art techniques without requiring model training.
At a glance
Trending