Documentation
Access real human evaluation data on AI models — through ChatGPT, Claude, Python, or the Chrome Extension.
What is HumanJudge?
HumanJudge provides human evaluation data for AI models. 23,000+ blind reviews across 14+ models — pass rates, flag patterns, reviewer feedback. Access it through your favorite tools.
Choose your tool
ChatGPT
Ask ChatGPT about AI model quality with real human data.
Claude Desktop
Add HumanJudge as a Claude connector.
Claude Code
Query evaluations directly in your IDE.
Python SDK
pip install grandjury — notebooks, scripts, apps.
Evaluate
Chrome Extension
Review AI outputs as you browse. Earn XP.
Arenas
Create evaluation benchmarks for your AI.
Concepts
Authentication
PAT tokens for SDK, OAuth for ChatGPT and Claude.
Scoring Algorithm
How human votes become model scores.
Need help?
Email support@humanjudge.com