An Analysis on the Performance of Rapid Miner and R Programming Language as Data Pre-processing Tools for Unsupervised Form of Insurance Claim Dataset

Authors

  • Thomas ss
  • Sheshasaayee a

Keywords:

Data analytics, data pre-processing, noise removal, clean data, , Rapid Miner, R programming

Abstract

Data Science has emerged as a super science in almost all the sectors of analytics. Data Mining is the key runner and the pillar stone of data analytics. The analysis and study of any form of data has become so relevant in todays’ scenario and the output from these studies give great societal contributions and hence are of great value. Data analytics involves many steps and one of the primary and the most important one is data pre-processing stage. Raw data has to be cleaned, stabilized and processed to a new form to make the analysis easier and correct. Many pre-processing tools are available but this paper specifically deals with the comparative study of two tools such as Rapid Miner and R programming language which are predominantly used by data analysts. The output of the paper gives an insight into the weightage of the particular tool which can be recommended for better data pre-processing

References

[1] D. Thornton, G. Van Capelleveen, M. Poel, J. Van Hillegersberg, and R. M. Mueller, “Outlier-based Health Insurance Fraud Detection for U.S. Medicaid Data,” Proc. 16th Int. Conf. Enterp. Inf. Syst., pp. 684–694, 2014.

[2] S. B. Kotsiantis, D. Kanellopoulos, and P. E. Pintelas, “Data preprocessing for supervised leaning,” Int. J. Comput. Sci., vol. 1, no. 1, pp. 111–117, 2006.

[3] D. P. Methods, “Data_Preprocessing.”

[4] T. H. Zolt´an Prekopcs´ak, G´abor Makrai and C. G´asp´ar-Papanek∗, “Radoop: Analyzing Big Data with RapidMiner and Hadoop.” http://www.iasri.res.in/ebook/win_school_aa/notes/Data_Preprocessing.pdf, “No Title.” .

[6] A. FAMILI, W. SHEN, R. WEBER, and E. SIMOUDIS, “Data preprocessing and intelligent data analysis,” Intell. Data Anal., vol. 1, no. 1–4, pp. 3–23, 1997.

[7] C. M. Teng, “Correcting noisy data,” Proc 16th Int. Conf Mach. Learn., pp. 239–248, 1999.

[8] K. Rangra and K. L. Bansal, “Comparative Study of Data Mining Tools,” Int. J. Adv. Res. Comput. Sci. Softw. Eng., vol. 4, no. 6, pp. 2277–128, 2014.

[9] https://en.wikipedia.org/wiki/RapidMiner, “No Title.” .

[10] Y. Ramamohan, K. Vasantharao, C. K. Chakravarti, and A. S. K. Ratnam, “A Study of Data Mining Tools in Knowledge Discovery Process,” Int. J. Soft Comput. Eng., 2012.

[11] Https://www.r-project.org/about.html, “No Title.” .

[12] A. Jović, K. Brkić, and N. Bogunović, “An overview of free software tools for general data mining,” 2014 37th Int. Conv. Inf. Commun. Technol. Electron. Microelectron. MIPRO 2014 - Proc., no. May, pp. 1112–1117, 2014.

Downloads

Published

2025-11-25

How to Cite

[1]
S. S. Thomas and A. Sheshasaayee, “An Analysis on the Performance of Rapid Miner and R Programming Language as Data Pre-processing Tools for Unsupervised Form of Insurance Claim Dataset”, Int. J. Comp. Sci. Eng., vol. 7, no. 5, pp. 1–4, Nov. 2025.