SPPO
Visit ToolSPPO is a Coding & Development tool that provides an official implementation of Self-Play Preference Optimization for language model alignment. It offers code and models for fine-tuning large language models efficiently.
At a glance
Trending