Open-R1
Visit ToolOpen-r1 is an open-source coding & development tool that provides a full reproduction of DeepSeek-R1. It offers scripts for training, evaluating, and generating synthetic data for large language models.
At a glance
Trending
Open-r1 is an open-source coding & development tool that provides a full reproduction of DeepSeek-R1. It offers scripts for training, evaluating, and generating synthetic data for large language models.
Trending
About
Open-r1 is a fully open-source project dedicated to reproducing the DeepSeek-R1 model, providing a comprehensive framework for researchers and developers. The repository includes essential scripts for training models using techniques like Supervised Fine-Tuning (SFT) and Group Relative Policy Optimization (GRPO), as well as tools for evaluating model performance and generating synthetic data. It supports various hardware configurations and integrates with platforms like Hugging Face Hub and Weights and Biases. The project emphasizes community contribution and aims to build the missing pieces of the R1 pipeline, making advanced LLM development accessible and reproducible. It also features specialized functionalities like a code interpreter reward function for competitive programming tasks, supporting sandboxes like E2B and Morph.
Capabilities
Pricing & Plans
Open Source
Free
FAQs
Trending