Survey on Efficient Resource Utilization using Hadoop Cluster for Big Data Processing

Poonam Mahajan; Manish Patel; Amol Agarwal; Nikhil Raut; Devendra Gadekar

doi:10.17148/IJARCCE.2016.51117

← Back to VOLUME 5, ISSUE 11, NOVEMBER 2016

Survey on Efficient Resource Utilization using Hadoop Cluster for Big Data Processing

Poonam Mahajan, Manish Patel, Amol Agarwal, Nikhil Raut, Devendra Gadekar

DOI: 10.17148/IJARCCE.2016.51117

Abstract: Hadoop is a framework which is used to store and process large amount of data. Hadoop cluster is designed to analyze and store a huge amount of data. As day by day amount of data stored and processed is increasing rapidly, so we need such an optimal scheduling algorithm to meet the requirement. Job Scheduling is an important parameter to achieve high performance in Hadoop cluster. Hadoop scheduler is pluggable module used for resource allocation. Majorly used schedulers are FIFO, Fair and Capacity scheduler. In this paper we compare and classify parameter such as Average Response Time, Average waiting time and Fairness of various scheduling algorithm in different environment.

Keywords: Hadoop, MapReduce, Scheduling.

Downloads: Download PDF|DOI: 10.17148/IJARCCE.2016.51117

How to Cite:

[1] Poonam Mahajan, Manish Patel, Amol Agarwal, Nikhil Raut, Devendra Gadekar, “Survey on Efficient Resource Utilization using Hadoop Cluster for Big Data Processing,” International Journal of Advanced Research in Computer and Communication Engineering (IJARCCE), DOI: 10.17148/IJARCCE.2016.51117