Tunix
Visit ToolTunix is a JAX-based library designed to streamline the post-training of Large Language Models (LLMs). It provides efficient and scalable support for supervised fine-tuning, reinforcement learning, and agentic RL on TPUs.
At a glance
Trending