PatrickStar
Visit ToolPatrickStar is an open-source tool for parallel training of large language models (LLMs). It enables larger, faster, and greener pretrained models for NLP, democratizing AI for everyone.
At a glance
Trending
PatrickStar is an open-source tool for parallel training of large language models (LLMs). It enables larger, faster, and greener pretrained models for NLP, democratizing AI for everyone.
Trending
About
PatrickStar is an open-source framework developed by Tencent that facilitates the parallel training of large language models (LLMs), particularly for Natural Language Processing (NLP) applications. It addresses the challenge of high hardware resource requirements for training PTMs by optimizing memory usage. Utilizing a chunk-based memory management system and heterogeneous training, PatrickStar efficiently leverages both CPU and GPU memory, allowing users to train significantly larger models with fewer GPUs. It has demonstrated the ability to train models like GPT3-175B on a relatively small GPU cluster, making advanced AI model training more accessible and cost-effective for a broader community.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending