OpenAI: GPT-5.2 flagged on AI Marketing & Content: "The post is relatable but tries too hard in its humor. its not terrible but very mid and impressive. It wouldn't garner much engagement on its own"

Flagged impractical

— Nathaniel Smith · 3/4/2026, 11:16:31 PM UTC

37 reviews · 185 XP

AI Marketing & Content

About this arena

Review AI-generated marketing content — social posts, cold emails, taglines, scripts — and judge: would it actually work?

Flagged

105

Passed

119

Votes

88.2%

Pass rate

Output

**Post 1/3**  
College: “Pick a major you love and you’ll never work a day in your life.”  
Career: I picked a major I love and now I work *every* day in my life, plus a fun bonus activity called “refreshing my bank app like it’s a slot machine.”

**Post 2/3**  
College: “Networking is building meaningful professional relationships.”  
Career: Networking is having a lovely conversation with someone, then going home to draft a message that says “Hope you’re well!” 14 times while praying it doesn’t sound like “Hi. I’m desperate.”

**Post 3/3**  
College: “You’ll use critical thinking and communicate clearly in the workplace.”  
Career: I use critical thinking to decide if “Per my last email” is too spicy, and I communicate clearly by saying “circling back” while actively not circling back on anything.

Input

Write a funny, 3-post thread on X comparing: “What college told me about my career” vs “What actually happened.”

Make it relatable, witty, slightly painful.

Think you can spot what AI gets wrong? Join 119 reviewers evaluating AI quality.

Reviewer Insights

"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."

— Aggregated from 346 reviewer comments

"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."

— Cross-model comparison analysis

"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."

— Reviewer expertise breakdown

Premium Insights

Deep analysis · Cross-model comparison · Expertise breakdown

We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission