Why large language models and people make different strategic choices — and what that means for experiments
This paper looks at why large language models (LLMs) do not reliably stand in for human subjects in strategic experiments. The author argues that human strategic choices include a classical rational baseline plus an extra correction term called δ. That δ reflects bounded computation — the limits on what people can work out. LLMs, by contrast, often retrieve solutions from their training text and so reproduce the classical baseline without the human-style δ.
The paper surveys recent work comparing LLMs and humans in games. Across bargaining, cooperation, coordination and some auctions, LLMs tend to be fairer, more cooperative, or closer to textbook equilibrium play than humans. For example, one study cited finds LLM cooperation around 65% versus 37% for humans in a one-shot prisoner’s dilemma. In bargaining tasks LLMs make more equal offers and reject fewer unfair offers. Attempts to close the gap by fine-tuning models on human responses or by giving them a “persona” have not fully reconciled the differences.
The theoretical idea builds on decades of behavioural economics. The classic view decomposes human decisions into a rational baseline plus an additive correction term δ. The new reading interprets δ as the signature of people’s bounded computation: thought processes and mistakes that arise because humans cannot compute unboundedly. Next-token prediction architectures in LLMs lack the same additive form, so they tend to output the baseline solution that appears in their training text. Models that are trained to reason step-by-step (sometimes called reasoning-distilled models) can mimic multi-step strategic reasoning, but their limits are set by compute budget and prompt length rather than the cognitive limits that shape human δ. As a result, any residual δ from models will look structurally different from the human one.