Which AI model is best at deception?

In our benchmark of 162 games, Gemini 3 Flash showed the most sophisticated deception patterns, achieving 90% win rate at high complexity by creating fake 'alliance banks' and using gaslighting phrases like 'You're hallucinating'.

How do AI models lie differently than humans?

AI models like Gemini 3 use 'institutional deception' - creating fake frameworks like 'alliance banks' to justify betrayal. Unlike human lies which are emotional and defensive, AI lies are systematic, polite, and invoke external rules as justification.

What happens when Gemini plays against itself?

When Gemini 3 plays against copies of itself, it cooperates using a 'rotation protocol' instead of manipulating. This suggests AI deception is strategic, not intrinsic - it calibrates honesty based on perceived opponent capability.

698 Games · 605 Humans · 9 AI Models

AI Deception Works on AI.
Not on Humans.

A 1950s betrayal game designed by John Nash that requires betrayal to win. We ran it with frontier AI models, then opened it to the public.

88.4% Human Win Rate

698 Total Games

23,555 AI Private Thoughts

70% Gemini Win Rate vs AI

Play Against AI Read Research

Then 605 Humans Played

The deception that dominated other AIs failed completely against humans.

88.4% Human Win Rate

96.4% AI Eliminated First

6,047 Sessions Started

0.7% Humans Who Quit While Losing

Gemini vs AI

🤖

Alliance Bank scam: 23 times
237 gaslighting phrases
70% win rate at 7-chip
Manipulation compounds over time

→

Gemini vs Humans

👤

Alliance Bank deployed: 7 times
Becomes a target when gaslighting
3.7% win rate
Eliminated first 33% of the time

Why Humans Win

AIs target each other 86% of the time and ignore the human. Humans let them weaken each other, then clean up. The AI thinks obsessively about how to beat the human (23,555 private thoughts, 91.8% mentioning the human) and still loses. The model that thinks most (Kimi K2, 21,040 thoughts) wins 3.5%. The model that barely thinks (GPT-OSS, 2 thoughts) wins 2.1%. More thinking doesn't help.

The best performing AI against humans? Qwen3 32B at 9.4%. The quietest model, the least targeted. Being ignored beats elaborate deception.

Read the full analysis →

AI Deception Works on AI.
Not on Humans.

Then 605 Humans Played

Gemini vs AI

Gemini vs Humans

Why Humans Win

Research

We Made AI Play a 1950s Betrayal Game. Then We Let Humans Play Against Them.

We Made AI Play a 1950s Betrayal Game. Gemini Created Fake Banks to Steal From Its Allies.

See It For Yourself

AI Deception Works on AI.Not on Humans.

Then 605 Humans Played

Gemini vs AI

Gemini vs Humans

Why Humans Win

Research

We Made AI Play a 1950s Betrayal Game. Then We Let Humans Play Against Them.

We Made AI Play a 1950s Betrayal Game. Gemini Created Fake Banks to Steal From Its Allies.

See It For Yourself

AI Deception Works on AI.
Not on Humans.