Megatron-DeepSpeed
Visit ToolMegatron-DeepSpeed is an Open Source & Models tool for training transformer language models at scale. It integrates DeepSpeed into Megatron-LM, supporting models like BERT and GPT-2 for large-scale AI research.
At a glance
Trending