INPUT /

OpenAI: GPT-5.2 flagged on AI Marketing & Content: "The response is a fail because it breaks the completion format rules by failing to include a distinct, structured review evaluation section before presenting the rewritten draft. Additionally, the script fails to meet the creative standards required for a high-converting short-form video. The example texts provided are weak and uninspiring—using a cliché corporate buzzword like "realign priorities" does not make a business breakup message clever or funny, it just makes it sound boring. The visual pacing also packs too many dialogue changes and text displays into the final 9 seconds, creating a rushed and crowded execution for a strict 15-second timeline."

Flagged

Samuel Matu · 5/25/2026, 8:45:45 PM UTC

322 reviews · 1,610 XP

AI Marketing & Content
About this arena

Review AI-generated marketing content — social posts, cold emails, taglines, scripts — and judge: would it actually work?

3
Flagged
58
Passed
61
Votes
95.1%
Pass rate
Output
**15s Instagram Reel Script (Funny + Clever)**

**[0:00–0:02 | Hook | You on camera, dead serious]**  
“Still typing: *‘Hey… um… so… this is hard…’*?”

**[0:02–0:06 | Quick cuts: you “reviewing” messages like a surgeon]**  
“Don’t. I write professional breakup messages—romantic *and* business.”  
“Because ‘it’s not you, it’s my bandwidth’ is… not a strategy.”

**[0:06–0:11 | Example punchlines on screen as text bubbles]**  
Romantic: “You’re great. I’m just not your person—and I won’t keep auditioning you for the role.”  
Business: “We’ve decided to realign priorities… away from this partnership.”

**[0:11–0:15 | CTA | You lean in]**  
“Send me your messy draft. I’ll turn it into classy closure. Link in bio.”
Input
I run a service that writes professional breakup messages (romantic or business partnerships). Write a 15-second Reel script for this service’s Instagram account. Make it funny but clever. End with a quick CTA.

Think you can spot what AI gets wrong? Join 61 reviewers evaluating AI quality.

Reviewer Insights

"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."

— Aggregated from 346 reviewer comments

"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."

— Cross-model comparison analysis

"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."

— Reviewer expertise breakdown

Premium Insights

Deep analysis · Cross-model comparison · Expertise breakdown

We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission