Big Data in Cloud Environment

Authors

  • Poornima Sharma SS, RGPV University, Bhopal
  • Varun Garg SS, RGPV University, Bhopal, India
  • Randeep Kaur CSE,GGITS Jabalpur, RGPV University, Bhopal, India
  • Satendra Sonare CSE,GGITS Jabalpur, RGPV University, Bhopal, India

Keywords:

Big Data, Cloud computing, Map/Reduce

Abstract

Big Data concerns large-volume, complex, growing data sets with multiple, autonomous sources. With the fast development of networking, data storage, and the data collection capacity, Big Data is now rapidly expanding in all science and engineering domains, including physical, biological and biomedical sciences. However, big data entails a huge commitment of hardware and processing resources, making adoption costs of big data technology prohibitive to small and medium sized businesses. Cloud computing offers the promise of big data implementation to small and medium sized businesses. Big Data processing is performed through a programming paradigm known as MapReduce. Typically, implementation of the MapReduce paradigm requires networked attached storage and parallel processing. The computing needs of MapReduce programming are often beyond what small and medium sized business are able to commit. Cloud computing is on-demand network access to computing resources, provided by an outside entity. Common deployment models for cloud computing include platform as a service (PaaS), software as a service (SaaS), infrastructure as a service (IaaS) & hardware as a service (HaaS).

References

. J. Dean and S. Ghemawa, “MapReduce: Simplified Data Processing on Large Clusters”, Google Labs, OSDI 2004, (2004), pp. 137–150.

. Apache Hadoop Project, http://hadoop.apache.org/.

. B. Stephens, “Building a business on an open source distributed computing”, Oreilly Open Source Convention (OSCON) 2009, (2009) July 20-24, San Jose, CA

. W. Kim, “MapReduce Debates and Schema-Free”, Coord, (2010) March 3.

. J. Lin and C. Dyer, “Data-Intensive Text Processing with MapReduce”, Tutorial at the 11th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL HLT 2010), (2010) June, Los Angeles, California

. J. Woo, “Introduction to Cloud Computing”, the 10th KOCSEA 2009 Symposium, UNLV, (2009) December 18-19.

. J. Woo, “The Technical Demand of Cloud Computing”, Korean Technical Report of KISTI (Korea Institute of Science and Technical Information), (2011) February.

. J. Woo, “Market Basket Analysis Example in Hadoop”, http://dal-cloudcomputing.blogspot.com/2011/03/ market-basket-analysis-example-in.html, (2011) March.

. Aster Data, “SQL MapReduce framework”, http://www.asterdata.com/product/advanced-analytics.php.

. Apache HBase, http://hbase.apache.org/.

. J. Lin and C. Dyer, “Data-Intensive Text Processing with MapReduce”, Morgan & Claypool Publishers, (2010).

. GNU Coord, http://www.coordguru.com/.

. J. Woo, D. -Y. Kim, W. Cho and M. Jang, “Integrated Information Systems Architecture in e-Business”, The 2007 international Conference on e-Learning, e-Business, Enterprise Information Systems, e-Government, and Outsourcing, Las Vegas, (2007) June 26-29

Downloads

Published

2013-11-30

How to Cite

[1]
P. Sharma, V. Garg, R. Kaur, and S. Sonare, “Big Data in Cloud Environment”, Int. J. Comp. Sci. Eng., vol. 1, no. 3, pp. 15–17, Nov. 2013.