Mobile deployment of stable diffusion models

The rapid development of Generative Artificial Intelligence (Gen AI) in recent years has posed significant challenges in terms of both cost and power consumption since training and inference processes require intensive GPU computation. Moreover, the use of GPUs in cloud-based platforms for Gen AI in...

Full description

Saved in:
Bibliographic Details
Main Author: Tran, Que An
Other Authors: Dusit Niyato
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181081
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181081
record_format dspace
spelling sg-ntu-dr.10356-1810812024-11-13T22:45:13Z Mobile deployment of stable diffusion models Tran, Que An Dusit Niyato College of Computing and Data Science DNIYATO@ntu.edu.sg Computer and Information Science The rapid development of Generative Artificial Intelligence (Gen AI) in recent years has posed significant challenges in terms of both cost and power consumption since training and inference processes require intensive GPU computation. Moreover, the use of GPUs in cloud-based platforms for Gen AI inference raises privacy concerns such as the potential leakage of user prompts to external service providers, which might lead to legal complications afterwards. This project investigates the potential use of Gen AI by optimizing and benchmarking the performance of the Stable Diffusion model on Android devices. By using the NCNN framework, the project enables Text-to-Image and Image-to-Image generation on mobile platforms without relying on cloud-based resources, addressing concerns such as cost, power consumption, and privacy. The project aims to develop an Android application that is capable of efficiently running the Stable Diffusion model. The implementation focuses on minimizing computational load and enhancing inference speed. By reusing prompts with text embedding models, the project refines the Distributed Diffusion optimization technique, contributing significantly to the integration of Stable Diffusion on mobile devices. The project also explores using cloud-based infrastructure for multi-model text-to-video generation. These efforts showcase Stable Diffusion's capabilities and suggest future possibilities for AI applications on mobile devices. The detailed implementation for this project is available at GitHub. Bachelor's degree 2024-11-13T22:45:13Z 2024-11-13T22:45:13Z 2024 Final Year Project (FYP) Tran, Q. A. (2024). Mobile deployment of stable diffusion models. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181081 https://hdl.handle.net/10356/181081 en SCSE23-0825 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
spellingShingle Computer and Information Science
Tran, Que An
Mobile deployment of stable diffusion models
description The rapid development of Generative Artificial Intelligence (Gen AI) in recent years has posed significant challenges in terms of both cost and power consumption since training and inference processes require intensive GPU computation. Moreover, the use of GPUs in cloud-based platforms for Gen AI inference raises privacy concerns such as the potential leakage of user prompts to external service providers, which might lead to legal complications afterwards. This project investigates the potential use of Gen AI by optimizing and benchmarking the performance of the Stable Diffusion model on Android devices. By using the NCNN framework, the project enables Text-to-Image and Image-to-Image generation on mobile platforms without relying on cloud-based resources, addressing concerns such as cost, power consumption, and privacy. The project aims to develop an Android application that is capable of efficiently running the Stable Diffusion model. The implementation focuses on minimizing computational load and enhancing inference speed. By reusing prompts with text embedding models, the project refines the Distributed Diffusion optimization technique, contributing significantly to the integration of Stable Diffusion on mobile devices. The project also explores using cloud-based infrastructure for multi-model text-to-video generation. These efforts showcase Stable Diffusion's capabilities and suggest future possibilities for AI applications on mobile devices. The detailed implementation for this project is available at GitHub.
author2 Dusit Niyato
author_facet Dusit Niyato
Tran, Que An
format Final Year Project
author Tran, Que An
author_sort Tran, Que An
title Mobile deployment of stable diffusion models
title_short Mobile deployment of stable diffusion models
title_full Mobile deployment of stable diffusion models
title_fullStr Mobile deployment of stable diffusion models
title_full_unstemmed Mobile deployment of stable diffusion models
title_sort mobile deployment of stable diffusion models
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/181081
_version_ 1816858988533252096