Performance evaluation of Hadoop Distributed File System
Keywords:
BigData, HDFS, MapReduce, Namenode, Datanode, Jobtracker, TasktrackerAbstract
Huge amounts of data are required to build internet search engines and therefore large number of machines to process this entire data. The Apache Hadoop is a framework that allows for the distributed processing of large data sets across clusters of machines. The Hadoop having two modules: 1. Hadoop distributed file system and 2. MapReduce. The Hadoop distributed file system is different from the local normal file system. The hdfs can be implemented as single node cluster and multi node cluster. The large datasets are processed more efficiently by the multi node clusters. By increasing number of nodes the data will be processed faster than the fewer nodes.
References
Linthala Srinithya, Dr. G. Venkata Rami Reddy, “Performance Evaluation of Hadoop Distributed file System and Local File System” in IJSR ISSN: 2319-7064, Volume 3 Issue 9, September 2014.
Liu Liu, Jiangtao Yin, Lixin Gao, “Efficient Social Network Data Query Processing on MapReduce” ACM August 16, 2013.
E. Dede, M. Govindaraju, D. Gunter, R. Canon, L. Ramakrishnan,”Performance Evaluation of a MongoDB and Hadoop Platform for Scientific Data Analysis”.
Christos Doulkeridis, Kjetil Norvag, “A Survey of Large-Scale Analytical Query Processing in MapReduce”.
Stephen Kaisler, Frank Armour, J. Alberto Espinosa, William Money, “Big Data: Issues and Challenges Moving Forward”.
Hadoop The Definitive Guide,©2012, Tom White.
K.Udhaya Malar,D.Ragupathi and G.M.Prabhu, "The Hadoop Dispersed File system: Balancing Movability And Performance", IJCSE, Volume-2, Issue-9, september-2014.
Apache Hadoop. http://hadoop.apache.org/ Tuesday, June 23, 2015.
Hadoop multinode cluster configuration, http://hashprompt.blogspot.in/2014/06/multi-node-hadoop-cluster-on-ubuntu-1404.html, Wednesday, August 19, 2015.
Downloads
Published
How to Cite
Issue
Section
License

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors contributing to this journal agree to publish their articles under the Creative Commons Attribution 4.0 International License, allowing third parties to share their work (copy, distribute, transmit) and to adapt it, under the condition that the authors are given credit and that in the event of reuse or distribution, the terms of this license are made clear.
