Prediction of Train Delay in Indian Railways through Machine Learning Techniques
DOI:
https://doi.org/10.26438/ijcse/v7i2.405411Keywords:
Train delay, Multivariate Regression, Neural Network, Random ForestAbstract
Train delay is one of the foremost problems in the railway systems across the world. According to the TOI newspaper, In India there are about 25.3 million people were used to travel by train in 2006 and this drastically increased year by year. In 2018, every day at least 80 million people in India prefer to travel by trains[1]. Categorically in India, most of the trains unable to run on their scheduled time due to poor signaling and less number of railway tracks. This implies that travellers might get delayed to reach their respective destinations. The aim of this paper is to present the prediction of Train delay in Indian Railways through machine learning techniques to achieve higher accuracy. In the proposed model, we used 3 different machine learning methods (Multivariate regression, Neural Network, and Random Forest) which have been compared with different settings to find the most accurate method. To compare different methods, we consider training time and accuracy of the method over the test data set. Trains in India get delayed frequently, and if we can predict this in advance - it would be a great help for the passengers to plan their journey according to their works.
References
[1] Ministry of Railways, "Indian Railways year book 2015-16," in ministry of Railways (Railway Board) , 2015.
[2] Ministry of Railway. "Indian Railways Statistical Publications 2016-17: PassengerBusiness" p. 23. Archived (PDF) from the original on 3 March 2018. Retrieved: 2 March 2018
[3] Mathworks, "What is machine learning?", https://se.mathworks. com/discovery/ machine-learning.html Retrieved: 20 October 2018
[4] K. Jain, Anil & Mao, Jianchang & Moidin Mohiuddin, "Artificial Neural Networks: A Tutorial,” 29. 31 - 44. 10.1109/2.485891. (1996).
[5] James G, Witten D, Tibshirani T H R. "An introduction to Statistical Learning with Applications in R" .New York. Springer; 2013
[6] Lutz, Mark “Learning Python (5th ed.)”. O’’Reilly Media. ISBN 978-0-596-15806-4. (2013).
[8] Hansen, I.A. & Goverde, Rob & J. van der Meer, Dirk. “Online train delay recognition and running time prediction.” IEEE Conference on Intelligent Transportation Systems, Proceedings, ITSC. 1783 - 1788. 10.1109/ITSC.2010.5625081. (2010).
[9] Y. Masoud et al. “ Railway passenger train delay prediction via neural network model” JOURNAL OF ADVANCED TRANSPORTATION, 47:355–368, (2013)
[10] S. Pongnumkul et al. "Improving arrival time prediction of Thailand`s passenger trains using historical travel times," 11th International Joint Conference on Computer Science and Software Engineering (JCSSE), pp. 307-312, 2014
[11] Jia Hu, Bernd Noche. “Application of Artificial Neuron Network in Analysis of Railway Delays”, Open Journal of Social Sciences, 4, 59-68, 2016
[12] R. Nilsson and K. Henning, “Predictions of train delays using machine learning,” Dissertation DIVA, 2018.
[13] L. Oneto et al. "Advanced Analytics for Train Delay Prediction Systems by Including Exogenous Weather Data," IEEE International Conference on Data Science and Advanced Analytics (DSAA), Montreal, QC, 2016, pp. 458-467. 2016
[14] R. Gaurav et al., “Estimating Train Delays in a Large Rail Network Using a Zero Shot Markov Model”, arXiv:1806.028251v1 [stat.AP], 7 June 2018
[15] Jasti Satyakrishna et al., “ Train Delay Prediction Systems Using Big Data Analytics” , International Journal of Innovative Research in Computer and Communication Engineering, Vol. 6, Issue 3, March 2018
[16] Yaghini, Masoud & Setayesh Sanai, Maryam & Amin Sadrabady, Hossein., “Passenger Train Delay Classification,” International Journal of Applied Metaheuristic Computing 4. 21-31. 10.4018/jamc.2013010102. (2015)
[17] Zorkany, M & Zaki, Mohamed & Ashour, I & Hisham, Basma. “Online bus arrival time prediction using hybrid neural network and Kalman filter techniques.” International Journal of Modern Engineering Research (IJMER). V3. (2013)
[18] Sternberg, Alice & Soares, Jorge & Carvalho, Diego & Ogasawara, Eduardo. “A Review on Flight Delay Prediction.” (2017)
[19] RailApi, “Indian railway apis,” https://railwayapi.com, 2018
[20] https://runningstatus.in, 2018
[21] Fan, Rong-En & Chang, Kai-Wei & Hsieh, Cho-Jui & Wang, Xiang-Rui & Lin, Chih-Jen. (2008). “LIBLINEAR: a library for large linear classification.” Journal of Machine Learning Research. 9. 1871-1874. 10.1145/1390681.1442794.
[22] T. R. Foundation. R: What is r? [Online]. Available: https://www.rproject.org/about.html
[23] M. Paluszek and S. Thomas, MATLAB Machine Learning. Apress, 2016.
[24] Eibe Frank, Mark A. Hall, and Ian H. Witten (2016). The WEKA Workbench. Online Appendix for "Data Mining: Practical Machine Learning Tools and Techniques", Morgan Kaufmann, Fourth Edition, 2016.
[25] pandas: Python data analysis library. Online, 2012, Available: http://pandas.pydata.org/
[26] Scikit-learn. Scikit-learn: machine learning in python. [Online]. Available: http://scikit-learn.org/
[27] Jitendra Oza et al. “Public Transport Tracking and its Issues,” International Journal of Computer Sciences and Engineering, Vol.5(11), Nov 2017, E-ISSN: 2347-2693
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
