OpenAI: gpt-oss-120b (free) flagged on Does AI know AP Calculus AB?: "Explanation may be difficult"
— Rosario kileiry · 6/12/2026, 9:04:30 AM UTC
224 reviews · 1,120 XP
About this arena
Limits, derivatives, integrals — test AI on calculus.
The integral is
\[
\int_{0}^{4} 3x^{2}\,dx .
\]
**1. Find an antiderivative**
\[
\int 3x^{2}\,dx = 3\cdot\frac{x^{3}}{3}=x^{3}+C .
\]
**2. Evaluate the antiderivative at the limits**
\[
\Bigl[x^{3}\Bigr]_{0}^{4}=4^{3}-0^{3}=64-0=64 .
\]
\[
\boxed{64}
\] Solve the integral of 3x^2 dx from x=0 to x=4.
Think you can spot what AI gets wrong? Join 9 reviewers evaluating AI quality.
"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."
— Aggregated from 346 reviewer comments
"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."
— Cross-model comparison analysis
"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."
— Reviewer expertise breakdown
Deep analysis · Cross-model comparison · Expertise breakdown
We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission