DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES
Apache Spark is a large-scale distributed data processing framework. Distributed processing in Apache Spark is by using several computers that form a cluster. With distributed processing, there is one advantage that is scalable processing. With Kubernetes technology, distributed processing is easier...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/43641 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:43641 |
---|---|
spelling |
id-itb.:436412019-09-27T15:16:42ZDEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES Nugroho, Fajar Indonesia Final Project Apache Spark, Kubernetes, distributed computation, dynamic resource allocation. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/43641 Apache Spark is a large-scale distributed data processing framework. Distributed processing in Apache Spark is by using several computers that form a cluster. With distributed processing, there is one advantage that is scalable processing. With Kubernetes technology, distributed processing is easier. Apache Spark has supported running on Kubernetes. However, the allocation of computing resources used is still static. Static allocation of resources has several limitations, including the lack of optimal system utilization, and the number of executors who cannot adjust existing resources. For this reason, it is necessary to develop so that the allocation of Apache Spark computational resources over Kubernetes can be dynamic so as to increase system utilization while increasing Apache Spark performance. The development of dynamic resource allocation can be implemented and produce quite diverse performance. From the results of research conducted, running Apache Spark on top of the Governor with dynamic resource allocation performance is lower than the static resource allocation. But for cases where the executor pod often fails, Apache Spark on top of the Govern or with dynamic resource allocation has better performance text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Apache Spark is a large-scale distributed data processing framework. Distributed processing in Apache Spark is by using several computers that form a cluster. With distributed processing, there is one advantage that is scalable processing. With Kubernetes technology, distributed processing is easier. Apache Spark has supported running on Kubernetes. However, the allocation of computing resources used is still static. Static allocation of resources has several limitations, including the lack of optimal system utilization, and the number of executors who cannot adjust existing resources. For this reason, it is necessary to develop so that the allocation of Apache Spark computational resources over Kubernetes can be dynamic so as to increase system utilization while increasing Apache Spark performance. The development of dynamic resource allocation can be implemented and produce quite diverse performance. From the results of research conducted, running Apache Spark on top of the Governor with dynamic resource allocation performance is lower than the static resource allocation. But for cases where the executor pod often fails, Apache Spark on top of the Govern or with dynamic resource allocation has better performance |
format |
Final Project |
author |
Nugroho, Fajar |
spellingShingle |
Nugroho, Fajar DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES |
author_facet |
Nugroho, Fajar |
author_sort |
Nugroho, Fajar |
title |
DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES |
title_short |
DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES |
title_full |
DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES |
title_fullStr |
DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES |
title_full_unstemmed |
DEVELOPMENT OF DYNAMIC RESOURCES SCHEDULING ON APACHE SPARK ON TOP OF KUBERNETES |
title_sort |
development of dynamic resources scheduling on apache spark on top of kubernetes |
url |
https://digilib.itb.ac.id/gdl/view/43641 |
_version_ |
1822926637657751552 |