Moneyball for LLMs
Behavioral fingerprinting of frontier AI models in football talent evaluation and event prediction
Four frontier AI models tested across 12 talent dimensions and ~45,000 trials. Documents League Prestige Discount (Cohen’s h = 1.18–1.41) — unanimous across all models — and demographic evaluation inconsistency with EU AI Act compliance implications.