Multi-objective scheduling of MapReduce jobs in big data processing

Data generation has increased drastically over the past few years due to the rapid development of Internet-based technologies. This period has been called the big data era. Big data offer an emerging paradigm shift in data exploration and utilization. The MapReduce computational paradigm is a well-k...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hashem, Ibrahim Abaker Targio, Anuar, Nor Badrul, Marjani, Mohsen, Gani, Abdullah, Sangaiah, Arun Kumar, Sakariyah, Adewole Kayode
Format:	Article
Published:	Springer 2018
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://eprints.um.edu.my/21924/ https://doi.org/10.1007/s11042-017-4685-y
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Universiti Malaya

id	my.um.eprints.21924
record_format	eprints
spelling	my.um.eprints.219242019-08-08T08:10:52Z http://eprints.um.edu.my/21924/ Multi-objective scheduling of MapReduce jobs in big data processing Hashem, Ibrahim Abaker Targio Anuar, Nor Badrul Marjani, Mohsen Gani, Abdullah Sangaiah, Arun Kumar Sakariyah, Adewole Kayode QA75 Electronic computers. Computer science Data generation has increased drastically over the past few years due to the rapid development of Internet-based technologies. This period has been called the big data era. Big data offer an emerging paradigm shift in data exploration and utilization. The MapReduce computational paradigm is a well-known framework and is considered the main enabler for the distributed and scalable processing of a large amount of data. However, despite recent efforts toward improving the performance of MapReduce, scheduling MapReduce jobs across multiple nodes has been considered a multi-objective optimization problem. This problem can become increasingly complex when virtualized clusters in cloud computing are used to execute a large number of tasks. This study aims to optimize MapReduce job scheduling based on the completion time and cost of cloud service models. First, the problem is formulated as a multi-objective model. The model consists of two objective functions, namely, (i) completion time and (ii) cost minimization. Second, a scheduling algorithm using earliest finish time scheduling that considers resource allocation and job scheduling in the cloud is proposed. Lastly, experimental results show that the proposed scheduler exhibits better performance than other well-known schedulers, such as FIFO and Fair. Springer 2018 Article PeerReviewed Hashem, Ibrahim Abaker Targio and Anuar, Nor Badrul and Marjani, Mohsen and Gani, Abdullah and Sangaiah, Arun Kumar and Sakariyah, Adewole Kayode (2018) Multi-objective scheduling of MapReduce jobs in big data processing. Multimedia Tools and Applications, 77 (8). pp. 9979-9994. ISSN 1380-7501 https://doi.org/10.1007/s11042-017-4685-y doi:10.1007/s11042-017-4685-y
institution	Universiti Malaya
building	UM Library
collection	Institutional Repository
continent	Asia
country	Malaysia
content_provider	Universiti Malaya
content_source	UM Research Repository
url_provider	http://eprints.um.edu.my/
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Hashem, Ibrahim Abaker Targio Anuar, Nor Badrul Marjani, Mohsen Gani, Abdullah Sangaiah, Arun Kumar Sakariyah, Adewole Kayode Multi-objective scheduling of MapReduce jobs in big data processing
description	Data generation has increased drastically over the past few years due to the rapid development of Internet-based technologies. This period has been called the big data era. Big data offer an emerging paradigm shift in data exploration and utilization. The MapReduce computational paradigm is a well-known framework and is considered the main enabler for the distributed and scalable processing of a large amount of data. However, despite recent efforts toward improving the performance of MapReduce, scheduling MapReduce jobs across multiple nodes has been considered a multi-objective optimization problem. This problem can become increasingly complex when virtualized clusters in cloud computing are used to execute a large number of tasks. This study aims to optimize MapReduce job scheduling based on the completion time and cost of cloud service models. First, the problem is formulated as a multi-objective model. The model consists of two objective functions, namely, (i) completion time and (ii) cost minimization. Second, a scheduling algorithm using earliest finish time scheduling that considers resource allocation and job scheduling in the cloud is proposed. Lastly, experimental results show that the proposed scheduler exhibits better performance than other well-known schedulers, such as FIFO and Fair.
format	Article
author	Hashem, Ibrahim Abaker Targio Anuar, Nor Badrul Marjani, Mohsen Gani, Abdullah Sangaiah, Arun Kumar Sakariyah, Adewole Kayode
author_facet	Hashem, Ibrahim Abaker Targio Anuar, Nor Badrul Marjani, Mohsen Gani, Abdullah Sangaiah, Arun Kumar Sakariyah, Adewole Kayode
author_sort	Hashem, Ibrahim Abaker Targio
title	Multi-objective scheduling of MapReduce jobs in big data processing
title_short	Multi-objective scheduling of MapReduce jobs in big data processing
title_full	Multi-objective scheduling of MapReduce jobs in big data processing
title_fullStr	Multi-objective scheduling of MapReduce jobs in big data processing
title_full_unstemmed	Multi-objective scheduling of MapReduce jobs in big data processing
title_sort	multi-objective scheduling of mapreduce jobs in big data processing
publisher	Springer
publishDate	2018
url	http://eprints.um.edu.my/21924/ https://doi.org/10.1007/s11042-017-4685-y
_version_	1643691700164493312

Multi-objective scheduling of MapReduce jobs in big data processing

Similar Items