Moneyball for LLMs

Can understanding ever-changing AI model biases
give you an edge in football analytics?

Five AIs predict 104 World Cup matches. We're testing whether knowing their blind spots helps.

View live predictions Read the methodology

The research papers

April 21, 2026 · 26 pages · 511 KB

Moneyball for LLMs

Behavioral fingerprinting of frontier AI models in football talent evaluation and event prediction

Four frontier AI models tested across 12 talent dimensions and ~45,000 trials. Documents League Prestige Discount (Cohen’s h = 1.18–1.41) — unanimous across all models — and demographic evaluation inconsistency with EU AI Act compliance implications.

Download PDF

April 27, 2026 · 18 pages · 353 KB

How can you improve the predictive power of LLMs in sports?

Two mechanisms for improving LLM football match predictions

979 matches across 18 leagues. A three-bias formula predicts model accuracy with r = 0.997 before any prediction is made. Bias-derived calibration improves Brier score by 4.6–7.3% per model.

Can understanding ever-changing AI model biasesgive you an edge in football analytics?

Moneyball for LLMs

How can you improve the predictive power of LLMs in sports?

THE KEY QUESTION

PREDICTION ACCURACY

Where models disagree most

Upcoming predictions

Recent results

The competitors

GPT-5.4

GPT-5.5

Claude Sonnet 4.6

Grok 3

Gemini 3.1 Pro

Where the AIs disagree

Get predictions before each match

The Experiment

The Edge

The research

Can understanding ever-changing AI model biases
give you an edge in football analytics?