Miles
Visit Toolmiles is an enterprise-facing reinforcement learning framework for post-training LLMs and VLMs. It emphasizes high-performance rollout, low precision training, and production stability.
At a glance
Trending
miles is an enterprise-facing reinforcement learning framework for post-training LLMs and VLMs. It emphasizes high-performance rollout, low precision training, and production stability.
Trending
About
miles is an enterprise-grade reinforcement learning framework specifically designed for the post-training phase of large language models (LLMs) and vision-language models (VLMs). It is developed as a fork of the 'slime' project, with which it co-evolves, indicating a focus on cutting-edge research and development. The framework's core strengths lie in its ability to facilitate high-performance rollout, enable efficient low-precision training, and ensure robust production stability for complex AI models. It is available as an open-source project on GitHub, promoting transparency and community contributions.
Capabilities
Pricing & Plans
open-source
Free
FAQs
Trending