Prolego: A Data Science Approach to Predict the Outcome of a Football Match
DOI:
https://doi.org/10.26438/ijcse/v6i4.132136Keywords:
Prolego, Dataset, CollectionAbstract
Prolego aims to predict results of Premier League football matches accurately by applying machine learning techniques to historical data. The historical data consists of rows where each row consists of several statistics for both the Home Team and the Away Team. The historical data is generated using web scraping libraries such as Selenium and BeautifulSoup. Based on the scraped data, data cleaning and feature engineering is done to generate several features of a football match like Shots, Shots On Target, Possession, Tackles, Corners, Ratting etc. Finally, the features are represented in a vector format and fed as inputs to different Machine Learning classifier algorithms like Multinomial Logistic Regression, SVM, Gradient Boosting Classifier and DecisionTreeClassifier. After the classification, accuracy is measured by calculating percentage of correct predictions and percentage of correct draw predictions. Error analysis is performed using techniques like Region under Curve to tune hyperparameters and identify the features which are more prominent/useful in accurately predicting the results.
References
[1] Ben Ulmer and Matthew Fernandez, Predicting Soccer Match Results in the English Premier League. (http://cs229.stanford.edu/proj2014/Ben%20Ulmer, %20Matt%20Fernandez,%20Predicting%20Soccer% 20Results% 20in%20the%20English%20Premier%20League.pdf)
[2] A. S. Timmaraju, A. Palnitkar,& V. Khanna, Game ON! Predicting English Premier League Match Outcomes, 2013. (http://cs229.stanford.edu/proj2013/TimmarajuPalnit karKhanna-GameON!PredictionOfEPLMatchOutcomes.pdf)
[3] Kaggle March Machine Learning Mania https://www.kaggle.com/c/march- machine-learning-mania-2017
[4] Adit Deshpande, Applying Machine Learning to March Madness - Applying Machine Learning To March Madness (https://adeshpande3.github.io/Applying-Machine-Learning-to-March-Madness)
[5] Premier League website - https://www.premierleague.com/
[6] EA Sports FIFA Rating - https://www. faindex.com
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
