A Novel Data AggregationTechnique for Removing Redundant Data in Hadoop

Authors

Uday Shankar SV Department of ISE, SJB Institute of Technology, India
Anvesh Naik Department of ISE, SJB Institute of Technology, India
Manoj CK Department of ISE, SJB Institute of Technology, India
Praveen B Department of ISE, SJB Institute of Technology, India
Yadush BR Department of ISE, SJB Institute of Technology, India

Keywords:

herewe are grouping the frequent itemsetand remove the redundant data

Abstract

Hadoop is the software framework which was developed by Apache Software Foundation.Hadoop framework is written in java with purpose to handle large amount of data. Hadoop manages huge volume of data.Hadoop runs the task under the MapReduce algorithm. MapReduce is a programming model suitable for processing of huge data. MapReduce framework has two phase, map phase and reduce phase.a mapredce job is usually splits the input data set into independent chunks,which is done by map phase.the framework sorts the output of the map which are input to reduce framework. To running frequent itemset require more resource and time consuming. To overcome this problem here we implementing the nobel data aggregation technique.

References

[1] Y. Xun, J. Zhang, and X. Qin, “Fidoop: Parallel mining of frequent itemsets using mapreduce,” IEEE Transactions on Systems,Man ,and Cybernetics: Systems, doi: 10.1109/TSMC.2015.2437327, 2015.

[2] J. Leskovec, A. Rajaraman, and J. D. Ullman, Mining of massive datasets. Cambridge University Press, 2014.

[3] M. Liroz-Gistau, R. Akbarinia, D. Agrawal, E. Pacitti, and P. Valduriez,“Data partitioning for minimizing transferred data in mapreduce,” in Data Management in Cloud, Grid and P2P Systems. Springer,2013.

[4] T. Kirsten, L. Kolb, M. Hartung, A. Groß, H. K¨opcke, and E. Rahm,“Data partitioning for parallel entity matching,” Proceedings of theVLDB Endowment, vol. 3, no. 2, 2010.

Downloads

PDF ⁰

Published

2025-11-26

How to Cite

[1]

U. S. SV, A. Naik, M. CK, P. B, and Y. BR, “A Novel Data AggregationTechnique for Removing Redundant Data in Hadoop”, Int. J. Comp. Sci. Eng., vol. 7, no. 15, pp. 270–271, Nov. 2025.

Download Citation

Issue

Vol. 7 No. 15 (2019): IJCSE Special Issue May Edition

Section

Research Article

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.

A Novel Data AggregationTechnique for Removing Redundant Data in Hadoop

Authors

Keywords:

Abstract

References

Downloads

Published

How to Cite

Issue

Section

License

Make a Submission

Journal Information

UGC Gazette Regulation

Join Editorial Board

Information

Current Issue

Keywords