INPUT /

xAI: Grok 4 flagged on Does AI know AP Calculus AB?: "short and difficult to explain for educational purposes"

Flagged

Rosario kileiry · 6/12/2026, 9:08:59 AM UTC

224 reviews · 1,120 XP

Does AI know AP Calculus AB?
About this arena

Limits, derivatives, integrals — test AI on calculus.

1
Flagged
8
Passed
9
Votes
88.9%
Pass rate
Output
The definite integral \(\int_{0}^{4} 3x^2 \, dx\) evaluates to \(x^3\) at the limits, which is \(4^3 - 0^3 = 64\).
Input
Solve the integral of 3x^2 dx from x=0 to x=4.

Think you can spot what AI gets wrong? Join 9 reviewers evaluating AI quality.

Reviewer Insights

"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."

— Aggregated from 346 reviewer comments

"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."

— Cross-model comparison analysis

"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."

— Reviewer expertise breakdown

Premium Insights

Deep analysis · Cross-model comparison · Expertise breakdown

We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission