Llama2-Webui
Visit Toolllama2-webui allows users to run Llama 2 models locally with a Gradio web UI on various operating systems. It supports different Llama 2 models and backends, including 8-bit and 4-bit inference.
At a glance
Trending
llama2-webui allows users to run Llama 2 models locally with a Gradio web UI on various operating systems. It supports different Llama 2 models and backends, including 8-bit and 4-bit inference.
Trending
About
llama2-webui is an open-source tool designed for running Llama 2 models locally through a Gradio web UI. It offers broad compatibility, supporting all Llama 2 models (7B, 13B, 70B, GPTQ, GGML, GGUF, CodeLlama) and various backends like transformers, bitsandbytes (8-bit inference), AutoGPTQ (4-bit inference), and llama.cpp. The tool can be deployed on Linux, Windows, and Mac, utilizing either GPU or CPU resources. Developers can also leverage `llama2-wrapper` as a local Llama 2 backend for generative agents and applications, and it provides an OpenAI-compatible API for seamless integration with existing clients and libraries. Benchmarking scripts are included to evaluate performance on different devices.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending