Automated Assistance for Data Mining Implementation
Keywords:
Classification, data mining, WEKA, ARFFAbstract
These days there are number of tools available for assistance in implementation of data mining. Some of them are RapidMiner, WEKA, Orange, Rattle, and KNIME. WEKA tool includes a number of techniques like classification, clustering, regression, etc. For solving classification problems this tool provide variety of strategies such as decision tree, neural networks, lazy classifiers etc. For each strategy, the tool allows the user to select specific values for large number parameters for e.g. in case of a neural network classifier, parameters are to be provided by user such as epochs, learning rate, momentum etc. With WEKA an expert user could study which strategy could be the best compatible for the any particular dataset. For that test run can be performed on the test dataset of the user using the various strategies and the resultant outputs are to be evaluated by the expert user its own. This is tedious task to be performed. This paper aims at developing a system that could work on its own even on the evaluation part and thus make it possible for effective implementation of data mining by developing a database to record the nature of data such as number and type of attributes, presence or absence of missing values etc. along with different values for developing classifier models and the accuracy of the classifier. Such a database can then be made available to the novice users to build a model based on past experience.
References
Data Mining: A Knowledge Discovery Approach, K. Cios, W. Pedrycz, R. Swiniarski, L. Kurgan, Springer, ISBN: 978-0-387-33333-5, 2007.
Data mining: concepts, models, methods, and algorithms, mehmed kantardzic, ISBN: 0471228524, Wiley-IEEE Press, 2002.
Ian Witten and Eibe Frank, Data Mining: Practical Machine Learning Tools and Techniques, 2nd Edition, Morgan Kaufmann, ISBN 0120884070, 2005.
WEKA manual.
Zdravko Markov, Ingrid Russell, An Introduction to the Weka DataMining System.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
