Baichuan-7B
Visit ToolBaichuan-7B is an open-source large-scale 7B pretraining language model developed by BaiChuan-Inc. It supports both Chinese and English, trained on 1.2 trillion tokens with a 4096 context window.
At a glance
Trending
Baichuan-7B is an open-source large-scale 7B pretraining language model developed by BaiChuan-Inc. It supports both Chinese and English, trained on 1.2 trillion tokens with a 4096 context window.
Trending
About
Baichuan-7B is a large-scale 7B parameter pre-training language model developed by BaiChuan-Inc. Based on the Transformer structure, it was trained on approximately 1.2 trillion tokens and supports both Chinese and English languages. The model features a context window length of 4096 and has demonstrated strong performance on standard Chinese and English benchmarks like C-Eval and MMLU. It includes optimizations for training stability and throughput, such as efficient operators, operator splitting, mixed precision, and communication optimizations, achieving high GPU peak compute utilization. The model also features an optimized tokenizer for Chinese language compression and improved mathematical capabilities.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending
Also listed in