IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES

This research focuses on the design implementation of cloud-native infrastructure for AI/ML experiments using Kubernetes. The main goal of this research is to build an infrastructure that can support AI/ML experiments with high flexibility and scalability. Kubernetes was chosen as the container o...

Full description

Saved in:
Bibliographic Details
Main Author: Anindhita Chandra, Indira
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/82270
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:82270
spelling id-itb.:822702024-07-07T04:43:56ZIMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES Anindhita Chandra, Indira Indonesia Final Project cloud native, Kubernetes, AI/ML, cloud native infrastructure, accessibility, scalability, resources optimalization INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/82270 This research focuses on the design implementation of cloud-native infrastructure for AI/ML experiments using Kubernetes. The main goal of this research is to build an infrastructure that can support AI/ML experiments with high flexibility and scalability. Kubernetes was chosen as the container orchestration platform due to its ability to manage dynamic and complex workloads. The design implementation involves configuring a Kubernetes cluster consisting of several nodes, including master and worker nodes equipped with GPUs and CPUs. Additional configurations include a storage class for storage management, load balancer settings for network traffic distribution, and a monitoring platform using Prometheus and Grafana to monitor system performance. Kubeflow is also integrated as the main framework to facilitate the management of AI/ML experiments. This process ensures that the infrastructure can be operated and optimized according to user needs. Testing was conducted to evaluate the performance and efficiency of the built infrastructure. Accessibility testing involved several usage scenarios with various devices, including PCs, laptops, and phones. Additionally, resource usage testing was carried out with various scenarios, involving multiple users accessing and running AI/ML workloads with different configurations. Analysis of the test results shows that the built cloud-native infrastructure has several key advantages. The system not only supports dynamic scalability but also improves resource usage efficiency. The use of container technology and Kubernetes orchestration allows for real-time addition or reduction of resources. This technology is crucial for AI/ML experiments that require high computation. Additionally, the implemented monitoring platform enables continuous performance monitoring, facilitating the identification and resolution of potential issues. This research successfully demonstrates that the design and implementation of cloud-native infrastructure using Kubernetes can significantly improve efficiency and effectiveness in managing AI/ML workloads. This infrastructure not only supports various computational needs but also provides the flexibility and scalability required for a dynamic research environment. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description This research focuses on the design implementation of cloud-native infrastructure for AI/ML experiments using Kubernetes. The main goal of this research is to build an infrastructure that can support AI/ML experiments with high flexibility and scalability. Kubernetes was chosen as the container orchestration platform due to its ability to manage dynamic and complex workloads. The design implementation involves configuring a Kubernetes cluster consisting of several nodes, including master and worker nodes equipped with GPUs and CPUs. Additional configurations include a storage class for storage management, load balancer settings for network traffic distribution, and a monitoring platform using Prometheus and Grafana to monitor system performance. Kubeflow is also integrated as the main framework to facilitate the management of AI/ML experiments. This process ensures that the infrastructure can be operated and optimized according to user needs. Testing was conducted to evaluate the performance and efficiency of the built infrastructure. Accessibility testing involved several usage scenarios with various devices, including PCs, laptops, and phones. Additionally, resource usage testing was carried out with various scenarios, involving multiple users accessing and running AI/ML workloads with different configurations. Analysis of the test results shows that the built cloud-native infrastructure has several key advantages. The system not only supports dynamic scalability but also improves resource usage efficiency. The use of container technology and Kubernetes orchestration allows for real-time addition or reduction of resources. This technology is crucial for AI/ML experiments that require high computation. Additionally, the implemented monitoring platform enables continuous performance monitoring, facilitating the identification and resolution of potential issues. This research successfully demonstrates that the design and implementation of cloud-native infrastructure using Kubernetes can significantly improve efficiency and effectiveness in managing AI/ML workloads. This infrastructure not only supports various computational needs but also provides the flexibility and scalability required for a dynamic research environment.
format Final Project
author Anindhita Chandra, Indira
spellingShingle Anindhita Chandra, Indira
IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
author_facet Anindhita Chandra, Indira
author_sort Anindhita Chandra, Indira
title IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
title_short IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
title_full IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
title_fullStr IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
title_full_unstemmed IMPLEMENTATION OF CLOUD NATIVE INFRASTRUCTURE DESIGN FOR AI/ML EXPERIMENTS USING KUBERNETES
title_sort implementation of cloud native infrastructure design for ai/ml experiments using kubernetes
url https://digilib.itb.ac.id/gdl/view/82270
_version_ 1822997625175015424