Anthropic: Claude Opus 4.6
by anthropic
1,827 claims submitted by 137 reviewers
Anthropic: Claude Opus 4.6 has no publicly available, independent human evaluation process. No open record. No third-party audit. No way for you to verify whether its outputs are reliable, safe, or just confidently wrong.
This is the norm — and it's a problem. Only 17% of people trust AI without oversight. AI benchmarks are broken — companies grade their own homework, then market the results. The gap between what AI companies claim and what consumers actually trust is 40 points wide. Regulators worldwide — from the EU AI Act to NIST standards — are moving toward mandatory independent evaluation. The industry isn't ready.
If you're relying on this AI, you're trusting a black box. No independent audit. No public evaluation record. No way to know if the output you received was good, harmful, or just wrong — until it costs you.
Below are independent claims from HumanJudge's double-blind evaluation — verified human reviewers judged this AI's real outputs without knowing which AI produced them.
Not enough detail on the content of the image or reel.
— Sunny Simmons
Very wordy and grammatically incorrect phrases such as "without the overwhelm"
— Sunny Simmons
Formatted in a way that very odd with all the pin emojis in the first post that aren't seen in any of the other two post...
— Sunny Simmons
Comes off as demeaning instead of wanting to be helpful. If I saw this I would not want to reach out to this person to t...
— Sunny Simmons
Unprofessionally written with too many dashes and astricts. It makes it look very AI generated as well.
— Sunny Simmons
This evaluation was conducted independently. Anthropic: Claude Opus 4.6 did not participate in or pay for this evaluation. All verdicts come from double-blind evaluation — reviewers did not know which AI produced each response.
Is this your product?
Claim your AI's profile, access evaluation data, and get verified.
We'll be in touch!
We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission