R1-V
Visit ToolR1-V is an Open Source & Models tool that reinforces super generalization ability in Vision Language Models (VLM). It provides new VLM-RL environments and a training codebase for improving perception and reasoning.
At a glance
Trending