kth.sePublications KTH
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Estimating team form in football using Hidden Markov Models
KTH, School of Engineering Sciences (SCI).
KTH, School of Engineering Sciences (SCI).
2025 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

To predict what tomorrow brings, is it reasonable to assume only the present isrelevant?

This report will focus on understanding and evaluating the implementation ofHidden Markov models (HMMs) for predicting a football team’s form. HMMs areuseful in systems where the true state - in this case, a team’s performance trend,or form - is hidden, and must be inferred from some observable output, such asin this case; expected goals (xG). The purpose of the model is to then unveil theprobabilities which govern transitions between hidden states, and how likely eachstate is to produce a particular observation. To achieve this, the probabilities areestimated through an expectation-maximization algorithm.

For this project, the training data for the model consisted of games that spanseveral seasons, from 2020 until 2025; with a focus on applying it to a team thatretained the same manager throughout the period, ensuring a relatively consistentplay style. Arsenal F.C in the English Premier league (EPL) was chosen. Thisallows us to assume the process is time-homogeneous meaning the probabilitiesassigned by the model are constant over time.

The model was subsequently refined by selecting the most appropriate number ofhidden states, followed by an analysis evaluating its predictive accuracy and howwell it fits the historical data.

This study demonstrates that it is possible to capture underlying long-term formtrends in football using HMMs, as for predictive power in the short term, it failsin beating a naive baseline.

Place, publisher, year, edition, pages
2025.
Series
TRITA-SCI-GRU ; 2025:205
Keywords [en]
Markov Model, HMM, football, time series, sport
National Category
Mathematical sciences
Identifiers
URN: urn:nbn:se:kth:diva-365962OAI: oai:DiVA.org:kth-365962DiVA, id: diva2:1980629
Subject / course
Mathematical Statistics
Supervisors
Examiners
Available from: 2025-07-02 Created: 2025-07-02 Last updated: 2025-07-02Bibliographically approved

Open Access in DiVA

fulltext(1974 kB)311 downloads
File information
File name FULLTEXT01.pdfFile size 1974 kBChecksum SHA-512
bdc37c4f551c0376464981fdfd5563e93cbea416ce11471268f793e3c6fd37e05d1c81f4b2c737dee934bfbe016ddc03ee04e7075d5a7da97bc89d935dfce73c
Type fulltextMimetype application/pdf

By organisation
School of Engineering Sciences (SCI)
Mathematical sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 311 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 273 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf