ShypdShypd.ai

Llm_benchmark

Visit Tool

llm_benchmark is an Open Source tool for evaluating large language models (LLMs). It uses a private, rolling question bank to track the long-term evolution of models, focusing on logic, math, programming, and human intuition.

No Views Yet

At a glance

Pricing
Open Source
Free tier
Yes
API
Yes
Skill level
Technical

Trending

      

Explore

Browse AI tools by category