Hqq
Visit Toolhqq is an open-source implementation of Half-Quadratic Quantization (HQQ) that reduces the size of large machine learning models. It accelerates inference and optimizes memory usage without requiring calibration data.
At a glance
Trending