Decision Trees for Mining Data Streams Based on the Gaussian Approximation
Keywords:
Data steam, decision trees, information gain, Gaussian approximationAbstract
Decision Trees are considered to be one of the most popular approaches for representing classifiers. Researchers from various disciplines such as statistics, machine learning, pattern recognition, and data mining have dealt with the issue of growing a decision tree from available data. The key point of constructing the decision tree is to determine the best attribute to split the considered node. Several methods to solve this problem were presented so far. However, they are either wrongly mathematically justified or time-consuming. The primary comparison parameters are time and accuracy. Also shown efforts made for handling the change in the concept and they are compared in terms of memory, technique and accuracy. Our method ensures, with a high probability set by the user, that the best attribute chosen in the considered node using a finite data sample is the same as it would be in the case of the whole data stream.
References
Caiyan Dai and Ling Chen, "An Algorithm for Mining Frequent Closed Itemsets with Density from Data Streams", International Journal of Computer Sciences and Engineering, Volume-04, Issue-02, Page No (40-48), Feb -2016, E-ISSN: 2347-2693
P. Argentiero, R. Chin and P. Beaudet, "An automated approach to the design of decision tree classifiers," IEEE Trans. Pattern Anal. Mach. Intell. PAMI-4, 51-57 (1982).
P. Fletcher and M.j.D. Powell,"A rapid decent method for minimization," Computer Journal, Vol.6, ISS.2, 163-168 (1963).
Rudolf Ahlsmede and Ingo Wegeru, Search problems, Wiley-Interscience, 1987.
K.S. Fu, Sequential methods in pattern recognition and machine learning, Academic press, 1998.
D. E. Gustafson, S. B. Gelfand, and S. K. Mitter, “ A nonparametric multiclass partitioning methods for classification,” in proc. 5th int. conf. pattern Recognition, 654-659 (1980).
E. G. Henrichon,Jr. and K. S. Fu, "A nonparametric partitioning procedure for pattern classification," IEEE Trans. Computer., Vol. C-18, 604-624,(1969).
G. R. Dattatreya and V. V. S. Sarma,"Bayesian and decision tree approaches for pattern recognition including feature measurement costs," IEEE Trans. Pattern Anal. Mach. Intell. Vol. PAMI-3, 293-298, (1981).
R. L. P. Chang and T. Pavlidis, "Fuzzy decision tree algorithms," IEEE Trans Syst. Man Cybernet., vol. SMC-7, 28-35 (1977)
J. Aczel and J. Daroczy, On measures of information and their characterizations, New York: Academic, 1975.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
