BitBLAS
Visit ToolBitBLAS is an open-source library that supports mixed-precision matrix multiplications on GPUs, especially for quantized LLM deployment. It enables efficient low-precision deep learning computing through hardware-aware tensor transformation.
At a glance
Trending