Decision Trees for Mining Data Streams Based on the Gaussian Approximation

Authors

  • Babu S Department of CSA, SCSVMV University ,Enathur, Kanchipuram, India
  • Fathima G Department of CSA, SCSVMV University ,Enathur, Kanchipuram, India

Keywords:

Data steam, decision trees, information gain, Gaussian approximation

Abstract

Decision Trees are considered to be one of the most popular approaches for representing classifiers. Researchers from various disciplines such as statistics, machine learning, pattern recognition, and data mining have dealt with the issue of growing a decision tree from available data. The key point of constructing the decision tree is to determine the best attribute to split the considered node. Several methods to solve this problem were presented so far. However, they are either wrongly mathematically justified or time-consuming. The primary comparison parameters are time and accuracy. Also shown efforts made for handling the change in the concept and they are compared in terms of memory, technique and accuracy. Our method ensures, with a high probability set by the user, that the best attribute chosen in the considered node using a finite data sample is the same as it would be in the case of the whole data stream.

References

Caiyan Dai and Ling Chen, "An Algorithm for Mining Frequent Closed Itemsets with Density from Data Streams", International Journal of Computer Sciences and Engineering, Volume-04, Issue-02, Page No (40-48), Feb -2016, E-ISSN: 2347-2693

P. Argentiero, R. Chin and P. Beaudet, "An automated approach to the design of decision tree classifiers," IEEE Trans. Pattern Anal. Mach. Intell. PAMI-4, 51-57 (1982).

P. Fletcher and M.j.D. Powell,"A rapid decent method for minimization," Computer Journal, Vol.6, ISS.2, 163-168 (1963).

Rudolf Ahlsmede and Ingo Wegeru, Search problems, Wiley-Interscience, 1987.

K.S. Fu, Sequential methods in pattern recognition and machine learning, Academic press, 1998.

D. E. Gustafson, S. B. Gelfand, and S. K. Mitter, “ A nonparametric multiclass partitioning methods for classification,” in proc. 5th int. conf. pattern Recognition, 654-659 (1980).

E. G. Henrichon,Jr. and K. S. Fu, "A nonparametric partitioning procedure for pattern classification," IEEE Trans. Computer., Vol. C-18, 604-624,(1969).

G. R. Dattatreya and V. V. S. Sarma,"Bayesian and decision tree approaches for pattern recognition including feature measurement costs," IEEE Trans. Pattern Anal. Mach. Intell. Vol. PAMI-3, 293-298, (1981).

R. L. P. Chang and T. Pavlidis, "Fuzzy decision tree algorithms," IEEE Trans Syst. Man Cybernet., vol. SMC-7, 28-35 (1977)

J. Aczel and J. Daroczy, On measures of information and their characterizations, New York: Academic, 1975.

Downloads

Published

2025-11-11

How to Cite

[1]
S. Babu and G. Fathima, “Decision Trees for Mining Data Streams Based on the Gaussian Approximation”, Int. J. Comp. Sci. Eng., vol. 4, no. 3, pp. 35–38, Nov. 2025.

Issue

Section

Review Article