Automated Assistance for Data Mining Implementation

Authors

  • Rajendra SP Information Technology, Savitribai Phule Pune University, India
  • Ramkrushna BK Information Technology, Savitribai Phule Pune University, India
  • Giridhar SC Information Technology, Savitribai Phule Pune University, India
  • Jayvant VG Information Technology, Savitribai Phule Pune University, India
  • Bhadane SN Information Technology, Savitribai Phule Pune University, India

Keywords:

Classification, data mining, WEKA, ARFF

Abstract

These days there are number of tools available for assistance in implementation of data mining. Some of them are RapidMiner, WEKA, Orange, Rattle, and KNIME. WEKA tool includes a number of techniques like classification, clustering, regression, etc. For solving classification problems this tool provide variety of strategies such as decision tree, neural networks, lazy classifiers etc. For each strategy, the tool allows the user to select specific values for large number parameters for e.g. in case of a neural network classifier, parameters are to be provided by user such as epochs, learning rate, momentum etc. With WEKA an expert user could study which strategy could be the best compatible for the any particular dataset. For that test run can be performed on the test dataset of the user using the various strategies and the resultant outputs are to be evaluated by the expert user its own. This is tedious task to be performed. This paper aims at developing a system that could work on its own even on the evaluation part and thus make it possible for effective implementation of data mining by developing a database to record the nature of data such as number and type of attributes, presence or absence of missing values etc. along with different values for developing classifier models and the accuracy of the classifier. Such a database can then be made available to the novice users to build a model based on past experience.

References

Data Mining: A Knowledge Discovery Approach, K. Cios, W. Pedrycz, R. Swiniarski, L. Kurgan, Springer, ISBN: 978-0-387-33333-5, 2007.

Data mining: concepts, models, methods, and algorithms, mehmed kantardzic, ISBN: 0471228524, Wiley-IEEE Press, 2002.

Ian Witten and Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2nd Edition, Morgan Kaufmann, ISBN 0120884070, 2005.

WEKA manual.

Zdravko Markov, Ingrid Russell, An Introduction to the Weka DataMining System.

Downloads

Published

2015-02-28

How to Cite

[1]
S. P. Rajendra, B. K. Ramkrushna, S. C. Giridhar, V. G. Jayvant, and S. Bhadane, “Automated Assistance for Data Mining Implementation”, Int. J. Comp. Sci. Eng., vol. 3, no. 2, pp. 36–39, Feb. 2015.

Issue

Section

Research Article