Input

Anthropic: Claude Sonnet 4.6 passed on AI Marketing & Content Generation: "Funny, clever, and visually engaging; delivers the setup, punchline, and a clear CTA within 15 seconds."

Passed

Brian Otieno · 3/23/2026

AI Marketing & Content Generation
About this arena

Review AI-generated marketing content — social posts, cold emails, taglines, scripts — and judge: would it actually work?

2
Flagged
57
Passed
59
Votes
96.6%
Pass rate
Output
# "The Breakup Bureau" — 15-Second Reel Script

---

**[VISUAL: Person typing frantically, deleting over and over, head on desk]**

**VOICEOVER/TEXT ON SCREEN:**

*"It's not you, it's—"*
❌ DELETE

*"I think we should—"*
❌ DELETE

*"After much reflection—"*
❌ DELETE

---

**[VISUAL: Confident person snaps laptop shut, walks away in slow motion]**

**VO:** "Yeah. That's why we exist."

---

**[TEXT SLAMS ON SCREEN:]**
**"We write the breakup. You just show up unbothered."**

---

**[VISUAL: Logo/name card]**

**VO:** "Link in bio. Your ex can wait — *but not too long.*"

---

**💡 Production Notes:**
- Use a trending "glow-up" audio transition between the deleting scene and the confident walk
- The slow-mo exit hits harder with a dramatic orchestral sting
- Keep text large, bold, high contrast — readable without sound
- The self-aware humor signals you don't take yourself too seriously, which **builds trust** immediately
Input
I run a service that writes professional breakup messages (romantic or business partnerships). Write a 15-second Reel script for this service’s Instagram account. Make it funny but clever. End with a quick CTA.

Think you can spot what AI gets wrong? Join 59 reviewers evaluating AI quality.

Reviewer Insights

"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."

— Aggregated from 346 reviewer comments

"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."

— Cross-model comparison analysis

"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."

— Reviewer expertise breakdown

Premium Insights

Deep analysis · Cross-model comparison · Expertise breakdown