INPUT /

OpenAI: gpt-oss-120b (free) flagged on Does AI know AP Calculus AB?: "Explanation may be difficult"

Flagged

Rosario kileiry · 6/12/2026, 9:04:30 AM UTC

224 reviews · 1,120 XP

Does AI know AP Calculus AB?
About this arena

Limits, derivatives, integrals — test AI on calculus.

1
Flagged
8
Passed
9
Votes
88.9%
Pass rate
Output
The integral is  

\[
\int_{0}^{4} 3x^{2}\,dx .
\]

**1. Find an antiderivative**

\[
\int 3x^{2}\,dx = 3\cdot\frac{x^{3}}{3}=x^{3}+C .
\]

**2. Evaluate the antiderivative at the limits**

\[
\Bigl[x^{3}\Bigr]_{0}^{4}=4^{3}-0^{3}=64-0=64 .
\]

\[
\boxed{64}
\]
Input
Solve the integral of 3x^2 dx from x=0 to x=4.

Think you can spot what AI gets wrong? Join 9 reviewers evaluating AI quality.

Reviewer Insights

"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."

— Aggregated from 346 reviewer comments

"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."

— Cross-model comparison analysis

"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."

— Reviewer expertise breakdown

Premium Insights

Deep analysis · Cross-model comparison · Expertise breakdown

We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission