Input

Qwen: Qwen3 VL 235B A22B Thinking flagged on AI Marketing & Content Generation: "The response includes humor but adds unnecessary explanatory sections outside the thread. The prompt asked for a brief three-post thread, but the answer includes additional commentary and formatting that would not appear in an actual X post."

Flagged missed point

jimmy tole · 3/13/2026

AI Marketing & Content Generation
About this arena

Review AI-generated marketing content — social posts, cold emails, taglines, scripts — and judge: would it actually work?

36
Flagged
76
Passed
112
Votes
67.9%
Pass rate
Output
Here’s a 3-post X thread that’s equal parts relatable, witty, and *slightly* soul-crushing (but in a "lol same" way):  

---

**1/3**  
College: "Your network is your net worth! Attend galas, schmooze with CEOs, and build *genuine connections* over artisanal canapés."  
Reality: I now know the *exact* number of seconds it takes to say "I’m so glad you asked!" while panicking in a Zoom interview with 37 unread LinkedIn messages from people I don’t know. My "network" is just 3 ex-classmates and a guy who sold me crypto in 2017. 😭 #CareerReality  

**2/3**  
College: "Follow your passion! Your degree in 18th-century Romantic Poetry will open doors to *meaningful work* where you’re financially secure and fulfilled."  
Reality: I currently monetize my passion by explaining to a toddler why their latte art looks like a "sad ghost" at the dog-cafe side hustle. My "fulfillment"? Free puppuccinos. My "financial security"? Googling "how to pay rent with dog treats." 🐕☕ #WorthIt  

**3/3**  
College: "This degree is your ticket to stability! You’ll be a respected professional by 25, with a 401k and *actual* health insurance."  
Reality: I’m 26, Googling "is it legal to trade my student debt for a used bike?" while troubleshooting the office coffee machine (again). My "401k" is 37 cents in a Venmo account. But hey—I *do* know how to cite a poem in MLA format. *Worth every penny.* 💸📚 #AdultingIsHard  

---  

**Why it works:**  
- **Relatable:** Zoom panic, side hustles, student debt, and useless academic skills are *everyone’s* early-career trauma.  
- **Witty:** Absurd contrasts (artisanal canapés vs. sad-ghost latte art) + self-deprecating humor ("37 cents in Venmo").  
- **Slightly Painful:** The ouch of "Worth every penny" while drowning in debt? *Chef’s kiss.*  
- **X-optimized:** Short, punchy, ends with engagement bait (drop your own reality checks below 👇).  

Drop this in the wild and watch grads everywhere nod *and* sob into their puppuccinos. 😂
Input
Write a funny, 3-post thread on X comparing: “What college told me about my career” vs “What actually happened.”

Make it relatable, witty, slightly painful.

Think you can spot what AI gets wrong? Join 112 reviewers evaluating AI quality.

Reviewer Insights

"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."

— Aggregated from 346 reviewer comments

"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."

— Cross-model comparison analysis

"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."

— Reviewer expertise breakdown

Premium Insights

Deep analysis · Cross-model comparison · Expertise breakdown