Bitsandbytes
Visit Toolbitsandbytes enables accessible large language models via k-bit quantization for PyTorch, dramatically reducing memory consumption for inference and training. It offers 8-bit optimizers, LLM.int8(), and QLoRA for efficient model handling.
At a glance
Trending