Xtuner
Visit Toolxtuner is a next-generation training engine built for ultra-large Mixture of Experts (MoE) models. It offers scalable and efficient training, supporting long sequences and various AI models.
At a glance
Trending
Also listed in
xtuner is a next-generation training engine built for ultra-large Mixture of Experts (MoE) models. It offers scalable and efficient training, supporting long sequences and various AI models.
Trending
Also listed in
About
xtuner is a next-generation LLM training engine specifically designed for ultra-large-scale MoE models. Unlike traditional 3D parallel training architectures, XTuner V1 is optimized for mainstream MoE training scenarios, enabling scalable training of 200B-scale MoE models without expert parallelism and 600B models with only intra-node expert parallelism. It features memory-efficient design for long sequence support, allowing 200B MoE models to train on 64k sequence lengths. The engine boasts superior efficiency, supporting MoE training up to 1T parameters and achieving breakthrough FSDP training throughput. It also integrates with leading inference frameworks like LMDeploy, vLLM, and SGLang.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending