Clustering The Duplicate Open Crash Reports Based on Call Stack Traces of Crash Reports
DOI:
https://doi.org/10.26438/ijcse/v6i9.207210Keywords:
Crash reports, clustering techniqueAbstract
A computer program such as software application that stops functioning properly is called software crash. Software crash is tedious problem in software development environment. Upon user permission, the crash report which contains the stack traces is sent to the developer or vendor. Software development team receives hundreds of crash reports from many deployment sites. There are many duplicate crash reports are generated, because many users submit the crash reports for the same problem. For analysing each crash reports, it may take more time. This motivates, to present the solution to analyse the crash reports and cluster the duplicate crash reports based on call stack similarities and store them into unique bucket, so that development resources can be optimized. In this paper, clustering the duplicate crash report of open source is proposed based on the similar information in the call stack. Hierarchical clustering technique is used to cluster the duplicate crash reports into unique bucket. Mozilla and Firefox open source crash reports are used for experiment and performance evaluation is done using purity determined the purity of clusters up to 80%. This method helps to increase the efficiency and reduce the number of developers along with an improved time to fix the bug.
References
Asha Ramaraddi Belahunashi, Pushpalatha M N,” A Survey on analysing the crash reports of software applications”, International Research Journal of Engineering and Technology , Volume 4, Issue 6, pp.1014-1017, June 2017.
Divya R S, Pushpalatha M N, “Software CrashLocator: Locating the Faulty Functions by Analyzing the Crash Stack Information in Crash Reports”, International Journal of Advanced Engineering, Management and Science (IJAEMS), Vol-2, Issue-5, pp.269-273, May- 2016
Rongxin Wu, Hongyu Zhang, Shing-Chi Cheung, and Sunghun Kim, “CrashLocator: Locating Crashing Faults Based on Crash Stacks”, ISSTA 2014 Proceedings of the 2014 International Symposium on Software Testing and Analysis, Pages 2014-214, 2014
Yingnong Dang, Rongxin Wu, Hongyu Zhang, Dongmei Zhang, and Peter Nobel, “Rebucket: a method for clustering duplicate crash reports based on call stack similarity”. In Proceedings of the 34th International Conference on Software Engineering, pages 1084– 1093. IEEE Press, 2012.
P. Runeson, M. Alexandersson, and O. Nyholm, “Detection of Duplicate Defect Reports Using Natural Language Processing”, in Proc. ICSE 2007, Minneapolis,USA, pp. 499-510, May 2007.
X. Wang, L. Zhang, T. Xie, J. Anvik, and J. Sun, "An approach to detecting duplicate bug reports using natural language and execution information", in Proc. ICSE’08, Leipzig, Germany, pp. 461-470, 2008
D. Kim, X. Wang, S. Kim, A. Zeller, S. Cheung, and S. Park, “Which crashes should i fix first? Predicting top crashes at an early stage to prioritize debugging efforts”, IEEE Transactions on Software Engineering, pp. 430-447, 2011.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
