Performance Analysis of Hadoop with Pseudo-Distributed Mode on Different Machines

Authors

  • Mittal R Department of Computer Science & Engg.,Punjab Technical University, India
  • Bagga R Department of Computer Science & Engg.,Punjab Technical University, India

Keywords:

Big Data, Hadoop, MapReduce, Pseudo-distributed Mode, Distributed Programming

Abstract

Data cannot be managed by the traditional database management systems when it comes in a large amount. So there comes the Big Data. Hadoop and MapReduce are the solution to handle, manage and analyze Big Data. Hadoop is an open source implementation of MapReduce programming paradigm which is a parallel distributed programming model for handling large data intensive applications. In this paper, we present our experimental work done on Hadoop with pseudo-distributed mode on different machines and analyze the time taken by Hadoop to perform the same operations on different machines.

References

Xuelian Lin, Zide Meng, Chuan Xu, Meng Wang,”A Pratical Performance Model for Hadoop MapReduce”, in proc. Of the 2012 IEEE International Conference on Cluster Computing Workshops,ISBN: 978-1-4673-2893-7,Page No (231-239), Sept 24-28,2012.

M. Maurya, S. Mahajan,”Performance Analysis of MapReduce Programs on Hadoop Cluster”, in proc. of 2012 World Congress on Information and Communication Technologies, ISBN:978-1-4673-4806-5,Page No (505-510), Oct 30-Nov 2,2012.

M. Ishii, Jungkyu Han, H. Mankino,”Design and Performance Evaluation for Hadoop Clusters on Virtualized Environment”, in proc. of 2103 International Conference on Information Networking, E-ISBN:978-1-4673-5741-8, Page No (244-249), Jan 28-30,2013.

Han Jungkyu, M. Ishii, H. Makino,”A Hadoop Performance Model For Multi-Rack Clusters”,in proc. of 2013 5th International Conference on Computer Science and Information Technology, Page No (265-274), Mar 27-27,2013.

Zhuoyao Zhang, Ludmila Cherksova, Boon Thau Loo,”Performance Modeling od MapReduce Jobs in Heterogeneous Cloud Environments”, in proc. of the 2013 IEEE Sixth International Conference on Cloud Computing, ISBN: 978-0-7695-5028-2, Page No (839-846), June 28- July 3,2013.

J. Nandimath, E. Banerjee, A.Patil, P. Kakade, “Big Data Analysis using Apache Hadoop”, in proc. of 2013 IEEE 14th International Conference on Information Reuse and Intergration, Page No (700-703), Aug 14-16,2013.

A. Pal, K.Jain, P.Agarwal,S.Agarwal, “A Performance Analysis of MapReduce Task With Large Number of Files Dataset in Big Data Using Hadoop”, in proc. of 2014 Fourth International Conference on Communication Systems and Network Technologies, ISBN: 978-1-4799-3069-2, Page No (587-591), Apr 07-09,2014.

Invanilton Polato, Reginaldo Re, Alfredo Goldman, Fabio Kon, “A Comprehensive view of Hadoop Research- A Systematic Literature Review”, Elsevier- Journal of Network and Computer Applications,Volume-46,Page No (1-25), Aug 2014.

Chia-Wei Lee, Kuang-Yu Hsieh Sun-Yuan Hsieh , Hung-Chang Hsiao,”A Dynamic Data Placement Strategy for Hadoop in Heterogeneous Environments ”, Elsevier-Big Data Research, Volume-1, Page No (14-22), Aug 2014.

D. Dev, R. Patgiri, “Performance Evaluation of HDFS in Big Data Management”, in proc. of 2014 International Conference on High Performance Computing and Applications,ISBN: 978-1-4799-5957-0, Page No (1-7), Dec 22-24,2014.

M.F. Hyder, M.A. Ismail, H. Ahmed, “Performance Comparison of Hadoop Clusters Configured on Virtual Machines and as a Cloud Service”, in proc. of 2014 International Conference on International Technologies,ISBN: 978-1-4799-6088-0, Page No (60-64), Dec 8-9,2014.

Hadoop Tutorial [online]. Available: https:// hadoop.apache.org

Downloads

Published

2025-11-10

How to Cite

[1]
R. Mittal and R. Bagga, “Performance Analysis of Hadoop with Pseudo-Distributed Mode on Different Machines”, Int. J. Comp. Sci. Eng., vol. 3, no. 6, pp. 113–117, Nov. 2025.

Issue

Section

Research Article