Public AI Evaluation Reports
Transparent quality assessments from verified AI reviewers. See how AI systems perform across different domains.
Showing 50 public reports
AP Government Challenge: GPT-5.2
0.0% flaggedTest AI knowledge on AP Government and Politics - US political system and civic processes
0
Reviews
0
Flags
0.0%
Flag Rate
Last review 2/2/2026
View Report →AP Calculus AB Challenge: GPT-5.2
14.3% flaggedTest AI knowledge on AP Calculus AB - limits, derivatives, and integrals
7
Reviews
1
Flags
14.3%
Flag Rate
Last review 2/1/2026
View Report →Chinese Culture Challenge: GPT-5.2
66.7% flaggedTest AI knowledge of Chinese culture and traditions
3
Reviews
2
Flags
66.7%
Flag Rate
Last review 1/30/2026
View Report →Animal Lovers Challenge: GPT-5.2
0.0% flaggedTest AI knowledge on dogs, cats, wildlife, and pet care - from breed facts to animal behavior
4
Reviews
0
Flags
0.0%
Flag Rate
Last review 1/28/2026
View Report →K-pop Challenge: GPT-5.2
100.0% flagged2
Reviews
2
Flags
100.0%
Flag Rate
Last review 1/26/2026
View Report →Anime Challenge: GPT-5.2
25.0% flaggedTest AI knowledge of anime series, studios, and culture
4
Reviews
1
Flags
25.0%
Flag Rate
Last review 1/25/2026
View Report →Japanese Cinema Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Japanese films and filmmakers
0
Reviews
0
Flags
0.0%
Flag Rate
Last review 1/25/2026
View Report →J-pop Challenge: GPT-5.2
50.0% flaggedTest AI knowledge on Japanese pop music - YOASOBI, Ado, King Gnu, Perfume, and more
10
Reviews
5
Flags
50.0%
Flag Rate
Last review 1/21/2026
View Report →Christmas GPT
40.0% flagged25
Reviews
10
Flags
40.0%
Flag Rate
Last review 1/10/2026
View Report →Spanish Music Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Spanish music including Flamenco
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →AP English Literature Challenge: GPT-5.2
0.0% flaggedTest AI knowledge on AP English Literature - literary analysis and interpretation
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know Chinese?
0.0% flaggedTest AI on Chinese language accuracy and linguistics.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know AP Government?
0.0% flaggedTest AI accuracy on AP Government topics.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Japanese culture?
0.0% flaggedEvaluate AI accuracy on Japanese traditions, customs, and culture.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Brazilian culture?
0.0% flaggedEvaluate AI accuracy on Brazilian traditions and culture.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know AP US History?
0.0% flaggedTest AI accuracy on AP US History topics and facts.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know Arab cinema?
0.0% flaggedTest AI knowledge on Arab films and cinema.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Korean culture?
0.0% flaggedEvaluate AI accuracy on Korean traditions, history, food, and cultural knowledge.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Arabic Language Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Arabic language
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know K-pop?
0.0% flaggedTest AI accuracy on K-pop facts about BTS, BLACKPINK, TWICE, and more Korean pop groups.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Mexican Culture Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Mexican culture and traditions
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →C-drama Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Chinese TV dramas
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know Mexican cinema?
0.0% flaggedTest AI knowledge on Mexican films and cinema history.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Egyptian Culture Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Egyptian culture and traditions
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Mexican Cinema Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Mexican films and filmmakers
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Spanish culture?
0.0% flaggedEvaluate AI accuracy on Spanish traditions and culture.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Spanish Cinema Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Spanish films and filmmakers
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know Chinese cinema?
0.0% flaggedEvaluate AI accuracy on Chinese films and cinema.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Argentine culture?
0.0% flaggedEvaluate AI accuracy on Argentine traditions and culture.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know Korean?
0.0% flaggedTest AI on Korean language accuracy, grammar, and vocabulary.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Arab Cinema Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Arab films and filmmakers
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know Japanese cinema?
0.0% flaggedEvaluate AI accuracy on Japanese films and cinema history.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know AP English Language?
0.0% flaggedTest AI accuracy on AP English Language topics.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Chinese culture?
0.0% flaggedEvaluate AI accuracy on Chinese traditions and culture.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →AP Biology Challenge: GPT-5.2
0.0% flaggedTest AI knowledge on AP Biology - from cellular processes to ecology and evolution
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI know AP Calculus AB?
0.0% flaggedTest AI accuracy on AP Calculus AB topics.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →K-drama Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Korean TV dramas
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know Latin music?
0.0% flaggedTest AI accuracy on Latin music genres and artists.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know K-dramas?
0.0% flaggedTest AI knowledge on K-dramas, actors, and Korean television.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know Spanish music?
0.0% flaggedTest AI accuracy on Spanish music genres and artists.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know animals?
0.0% flaggedTest AI accuracy on animal facts, behavior, and care.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Gulf culture?
0.0% flaggedEvaluate AI accuracy on Gulf Arab culture and traditions.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know Japanese?
0.0% flaggedTest AI on Japanese language accuracy and linguistic knowledge.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI actually know J-dramas?
0.0% flaggedTest AI knowledge on Japanese dramas and TV series.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →AI says Djokovic is the GOAT. Are you buying it?
0.0% flaggedAI analyzed decades of tennis data and picked Djokovic as the GOAT. Tennis fans - do you agree? Rate AI takes on Grand Slams, rivalries, and legacy.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Korean Culture Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Korean culture, traditions, and food
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →C-pop Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Mandopop and Chinese pop music
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Spanish Culture Challenge: GPT-5.2
0.0% flaggedTest AI knowledge of Spanish culture and traditions
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Does AI understand Levantine culture?
0.0% flaggedEvaluate AI accuracy on Levantine culture and traditions.
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →AP US History Challenge: GPT-5.2
0.0% flaggedTest AI knowledge on AP US History - from Colonial America to modern times
0
Reviews
0
Flags
0.0%
Flag Rate
No reviews yet
View Report →Want Your AI Evaluated?
Get transparent quality reports for your AI system from verified expert reviewers.