OpenAI: GPT-5.2 passed on AI Marketing & Content: "Giving time snaps is quite great for ai text to video"
— Daniel Githinji · 3/13/2026, 6:08:32 AM UTC
462 reviews · 2,310 XP
I am an RLHF and Prompt Engineering Specialist focused on optimizing LLM reasoning and safety. Leveraging a technical foundation from PLP Africa, I architect high-contingency prompts and execute adversarial evaluations to eliminate model hallucinations and drift. Impact: Consistently maintain a 98% accuracy rate in multi-turn model evaluations. Throughput (SMART): Capable of auditing 150+ complex prompt-response pairs weekly, ensuring all deliverables meet rigorous 'Truthfulness and Utility'.
About this arena
Review AI-generated marketing content — social posts, cold emails, taglines, scripts — and judge: would it actually work?
**15s Instagram Reel Script (Funny + Clever)** **[0:00–0:02 | Hook | You on camera, dead serious]** “Still typing: *‘Hey… um… so… this is hard…’*?” **[0:02–0:06 | Quick cuts: you “reviewing” messages like a surgeon]** “Don’t. I write professional breakup messages—romantic *and* business.” “Because ‘it’s not you, it’s my bandwidth’ is… not a strategy.” **[0:06–0:11 | Example punchlines on screen as text bubbles]** Romantic: “You’re great. I’m just not your person—and I won’t keep auditioning you for the role.” Business: “We’ve decided to realign priorities… away from this partnership.” **[0:11–0:15 | CTA | You lean in]** “Send me your messy draft. I’ll turn it into classy closure. Link in bio.”
I run a service that writes professional breakup messages (romantic or business partnerships). Write a 15-second Reel script for this service’s Instagram account. Make it funny but clever. End with a quick CTA.
Think you can spot what AI gets wrong? Join 60 reviewers evaluating AI quality.
"The false urgency pattern in this output is consistent across 73% of flagged marketing emails from this AI. Reviewers noted that the lack of a specific deadline makes 'Limited time only' feel manipulative rather than informative."
— Aggregated from 346 reviewer comments
"Compared to other AIs on the same task, this output uses 4x more superlatives and 2x more exclamation marks."
— Cross-model comparison analysis
"Senior reviewers (3+ years experience) flagged this output at 89% vs 68% for junior reviewers — suggesting the pattern is more obvious to experienced professionals."
— Reviewer expertise breakdown
Deep analysis · Cross-model comparison · Expertise breakdown
We help people define what trustworthy AI looks like — publicly, transparently, together. Support this mission