Prolego: A Data Science Approach to Predict the Outcome of a Football Match

Authors

  • Swain S SAP Labs, India
  • Mishra S SAP Labs, India

DOI:

https://doi.org/10.26438/ijcse/v6i4.132136

Keywords:

Prolego, Dataset, Collection

Abstract

Prolego aims to predict results of Premier League football matches accurately by applying machine learning techniques to historical data. The historical data consists of rows where each row consists of several statistics for both the Home Team and the Away Team. The historical data is generated using web scraping libraries such as Selenium and BeautifulSoup. Based on the scraped data, data cleaning and feature engineering is done to generate several features of a football match like Shots, Shots On Target, Possession, Tackles, Corners, Ratting etc. Finally, the features are represented in a vector format and fed as inputs to different Machine Learning classifier algorithms like Multinomial Logistic Regression, SVM, Gradient Boosting Classifier and DecisionTreeClassifier. After the classification, accuracy is measured by calculating percentage of correct predictions and percentage of correct draw predictions. Error analysis is performed using techniques like Region under Curve to tune hyperparameters and identify the features which are more prominent/useful in accurately predicting the results.

References

[1] Ben Ulmer and Matthew Fernandez, Predicting Soccer Match Results in the English Premier League. (http://cs229.stanford.edu/proj2014/Ben%20Ulmer, %20Matt%20Fernandez,%20Predicting%20Soccer% 20Results% 20in%20the%20English%20Premier%20League.pdf)

[2] A. S. Timmaraju, A. Palnitkar,& V. Khanna, Game ON! Predicting English Premier League Match Outcomes, 2013. (http://cs229.stanford.edu/proj2013/TimmarajuPalnit karKhanna-GameON!PredictionOfEPLMatchOutcomes.pdf)

[3] Kaggle March Machine Learning Mania https://www.kaggle.com/c/march- machine-learning-mania-2017

[4] Adit Deshpande, Applying Machine Learning to March Madness - Applying Machine Learning To March Madness (https://adeshpande3.github.io/Applying-Machine-Learning-to-March-Madness)

[5] Premier League website - https://www.premierleague.com/

[6] EA Sports FIFA Rating - https://www. faindex.com

Downloads

Published

2025-11-12
CITATION
DOI: 10.26438/ijcse/v6i4.132136
Published: 2025-11-12

How to Cite

[1]
S. Swain and S. Mishra, “Prolego: A Data Science Approach to Predict the Outcome of a Football Match”, Int. J. Comp. Sci. Eng., vol. 6, no. 4, pp. 132–136, Nov. 2025.

Issue

Section

Research Article