Arenas
Create custom evaluation benchmarks for AI models. Define questions, enroll models, and get blind human reviews.
Documentation coming soon. In the meantime, visit Developer Hub to create your first arena.
Create custom evaluation benchmarks for AI models. Define questions, enroll models, and get blind human reviews.
Documentation coming soon. In the meantime, visit Developer Hub to create your first arena.