IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
This research focuses on the design implementation of cloud-native infrastructure for AI/ML experiments using Kubernetes. The main goal of this research is to build an infrastructure that can support AI/ML experiments with high flexibility and scalability. Kubernetes was chosen as the container o...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/82270 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:82270 |
---|---|
spelling |
id-itb.:822702024-07-07T04:43:56ZIMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES Anindhita Chandra, Indira Indonesia Final Project cloud native, Kubernetes, AI/ML, cloud native infrastructure, accessibility, scalability, resources optimalization INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/82270 This research focuses on the design implementation of cloud-native infrastructure for AI/ML experiments using Kubernetes. The main goal of this research is to build an infrastructure that can support AI/ML experiments with high flexibility and scalability. Kubernetes was chosen as the container orchestration platform due to its ability to manage dynamic and complex workloads. The design implementation involves configuring a Kubernetes cluster consisting of several nodes, including master and worker nodes equipped with GPUs and CPUs. Additional configurations include a storage class for storage management, load balancer settings for network traffic distribution, and a monitoring platform using Prometheus and Grafana to monitor system performance. Kubeflow is also integrated as the main framework to facilitate the management of AI/ML experiments. This process ensures that the infrastructure can be operated and optimized according to user needs. Testing was conducted to evaluate the performance and efficiency of the built infrastructure. Accessibility testing involved several usage scenarios with various devices, including PCs, laptops, and phones. Additionally, resource usage testing was carried out with various scenarios, involving multiple users accessing and running AI/ML workloads with different configurations. Analysis of the test results shows that the built cloud-native infrastructure has several key advantages. The system not only supports dynamic scalability but also improves resource usage efficiency. The use of container technology and Kubernetes orchestration allows for real-time addition or reduction of resources. This technology is crucial for AI/ML experiments that require high computation. Additionally, the implemented monitoring platform enables continuous performance monitoring, facilitating the identification and resolution of potential issues. This research successfully demonstrates that the design and implementation of cloud-native infrastructure using Kubernetes can significantly improve efficiency and effectiveness in managing AI/ML workloads. This infrastructure not only supports various computational needs but also provides the flexibility and scalability required for a dynamic research environment. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
This research focuses on the design implementation of cloud-native infrastructure
for AI/ML experiments using Kubernetes. The main goal of this research is to build
an infrastructure that can support AI/ML experiments with high flexibility and
scalability. Kubernetes was chosen as the container orchestration platform due to
its ability to manage dynamic and complex workloads.
The design implementation involves configuring a Kubernetes cluster consisting of
several nodes, including master and worker nodes equipped with GPUs and CPUs.
Additional configurations include a storage class for storage management, load
balancer settings for network traffic distribution, and a monitoring platform using
Prometheus and Grafana to monitor system performance. Kubeflow is also
integrated as the main framework to facilitate the management of AI/ML
experiments. This process ensures that the infrastructure can be operated and
optimized according to user needs.
Testing was conducted to evaluate the performance and efficiency of the built
infrastructure. Accessibility testing involved several usage scenarios with various
devices, including PCs, laptops, and phones. Additionally, resource usage testing
was carried out with various scenarios, involving multiple users accessing and
running AI/ML workloads with different configurations.
Analysis of the test results shows that the built cloud-native infrastructure has
several key advantages. The system not only supports dynamic scalability but also
improves resource usage efficiency. The use of container technology and
Kubernetes orchestration allows for real-time addition or reduction of resources.
This technology is crucial for AI/ML experiments that require high computation.
Additionally, the implemented monitoring platform enables continuous
performance monitoring, facilitating the identification and resolution of potential
issues.
This research successfully demonstrates that the design and implementation of
cloud-native infrastructure using Kubernetes can significantly improve efficiency
and effectiveness in managing AI/ML workloads. This infrastructure not only
supports various computational needs but also provides the flexibility and
scalability required for a dynamic research environment. |
format |
Final Project |
author |
Anindhita Chandra, Indira |
spellingShingle |
Anindhita Chandra, Indira IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES |
author_facet |
Anindhita Chandra, Indira |
author_sort |
Anindhita Chandra, Indira |
title |
IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES |
title_short |
IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES |
title_full |
IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES |
title_fullStr |
IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES |
title_full_unstemmed |
IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES |
title_sort |
implementation of cloud native infrastructure design for ai/ml experiments using kubernetes |
url |
https://digilib.itb.ac.id/gdl/view/82270 |
_version_ |
1822997625175015424 |