Unknown

Dataset Information

0

Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies.


ABSTRACT: MapReduce is the preferred cloud computing framework used in large data analysis and application processing. MapReduce frameworks currently in place suffer performance degradation due to the adoption of sequential processing approaches with little modification and thus exhibit underutilization of cloud resources. To overcome this drawback and reduce costs, we introduce a Parallel MapReduce (PMR) framework in this paper. We design a novel parallel execution strategy of Map and Reduce worker nodes. Our strategy enables further performance improvement and efficient utilization of cloud resources execution of Map and Reduce functions to utilize multicore environments available with computing nodes. We explain in detail makespan modeling and working principle of the PMR framework in the paper. Performance of PMR is compared with Hadoop through experiments considering three biomedical applications. Experiments conducted for BLAST, CAP3, and DeepBind biomedical applications report makespan time reduction of 38.92%, 18.00%, and 34.62% considering the PMR framework against Hadoop framework. Experiments' results prove that the PMR cloud computing platform proposed is robust, cost-effective, and scalable, which sufficiently supports diverse applications on public and private cloud platforms. Consequently, overall presentation and results indicate that there is good matching between theoretical makespan modeling presented and experimental values investigated.

SUBMITTER: Al-Absi AA 

PROVIDER: S-EPMC6207866 | biostudies-literature | 2018

REPOSITORIES: biostudies-literature

altmetric image

Publications

Parallel MapReduce: Maximizing Cloud Resource Utilization and Performance Improvement Using Parallel Execution Strategies.

Al-Absi Ahmed Abdulhakim AA   Al-Sammarraie Najeeb Abbas NA   Shaher Yafooz Wael Mohamed WM   Kang Dae-Ki DK  

BioMed research international 20181017


MapReduce is the preferred cloud computing framework used in large data analysis and application processing. MapReduce frameworks currently in place suffer performance degradation due to the adoption of sequential processing approaches with little modification and thus exhibit underutilization of cloud resources. To overcome this drawback and reduce costs, we introduce a Parallel MapReduce (<i>PMR</i>) framework in this paper. We design a novel parallel execution strategy of Map and Reduce worke  ...[more]

Similar Datasets

| S-EPMC4709609 | biostudies-literature
| S-EPMC6290780 | biostudies-literature
| S-EPMC6480938 | biostudies-literature
| S-EPMC5547731 | biostudies-other
| S-EPMC3532373 | biostudies-literature
| S-EPMC7959626 | biostudies-literature
| S-EPMC4978928 | biostudies-literature
| S-EPMC4246436 | biostudies-literature
| S-EPMC6931271 | biostudies-literature
| S-EPMC8096093 | biostudies-literature