Superhuman AI Models Struggled to Predict Football Scores

In a recent examination of artificial intelligence capabilities, eight AI models were put to the test to predict outcomes in a virtual re-creation of the 2023-24 Premier League season. The analysis conducted by General Reasoning, a London-based AI startup, sought to determine whether these advanced models could successfully navigate the complexities of football betting.

The AI models, which included leading technologies from Google, OpenAI, and Anthropic, were provided with a starting bank of £100,000 alongside extensive historical data. This data encompassed past results, line-ups, and public betting odds. The models were tasked with developing strategies to maximise returns while managing risk throughout the season, requiring them to adapt to evolving game outcomes and identify edges in betting markets.

Among the participants, Anthropic’s Claude Opus 4.6 emerged as the best performer, demonstrating an average loss of 11 per cent over three attempts. Google’s Gemini 3.1 pro achieved a notable gain of approximately £33,000 during its most successful outing, yet subsequently faced bankruptcy in another simulation. Only two models, Claude Opus 4.6 and GPT-5.4, managed to avoid financial ruin across the three testing scenarios.

The findings from General Reasoning reveal a concerning trend; all evaluated AI models ultimately incurred losses during the season, with many facing significant ruin. The company’s report highlights a critical observation that existing AI frameworks excel in well-defined tasks with clear objectives. Open-ended, long-term scenarios prove significantly more challenging for these models.

Analysts noted that the ability of these AI systems to maintain coherent decision-making over extended periods was lacking. Many failed to act on their analyses or adjust strategies according to changing circumstances. The sophistication of the strategies employed by the models was found to be considerably lower compared to human decision-making processes, indicating substantial potential for enhancement in future AI developments.

This assessment suggests a vital need to shift evaluations toward more complex environments that challenge long-horizon and sequential decision-making capabilities. As AI technology evolves, its application within unpredictable domains like sports betting may prove to be an ongoing area for research and improvement.

Post Disclaimer

The following content has been published by Stockmark.IT. All information utilised in the creation of this communication has been gathered from publicly available sources that we consider reliable. Nevertheless, we cannot guarantee the accuracy or completeness of this communication.

This communication is intended solely for informational purposes and should not be construed as an offer, recommendation, solicitation, inducement, or invitation by or on behalf of the Company or any affiliates to engage in any investment activities. The opinions and views expressed by the authors are their own and do not necessarily reflect those of the Company, its affiliates, or any other third party.

The services and products mentioned in this communication may not be suitable for all recipients, by continuing to read this website and its content you agree to the terms of this disclaimer.

Our Socials

Recent Posts

Stockmark.1T logo with computer monitor icon from Stockmark.it
Loading Next Post...
Popular Now
Loading

Signing-in 3 seconds...

Signing-up 3 seconds...