Semantic communications and generative artificial intelligence for user-centric metaverse services

Metaverse, the next frontier of the Internet, promises an immersive and intelligent world powered by advanced networks. At its core, the success of Metaverse services relies on the ability to deliver efficiency, adaptability, and satisfaction to users. Semantic Communications (SemCom) and Generative...

Full description

Saved in:
Bibliographic Details
Main Author: Du, Hongyang
Other Authors: Dusit Niyato
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/175895
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-175895
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Engineering
Generative artificial intelligence
Semantic communications
Metaverse
Network optimization
spellingShingle Computer and Information Science
Engineering
Generative artificial intelligence
Semantic communications
Metaverse
Network optimization
Du, Hongyang
Semantic communications and generative artificial intelligence for user-centric metaverse services
description Metaverse, the next frontier of the Internet, promises an immersive and intelligent world powered by advanced networks. At its core, the success of Metaverse services relies on the ability to deliver efficiency, adaptability, and satisfaction to users. Semantic Communications (SemCom) and Generative Artificial Intelligence (GenAI) are fundamental to supporting these requirements, which serve as the backbone for communications and computing paradigms. SemCom enables efficient, context-aware interactions essential for a seamless user experience, while GenAI drives the dynamic, personalized network management solutions generation to support Metaverse. In this thesis, I explore the pivotal roles of SemCom and GenAI in shaping user-centric Metaverse through innovations like Device-toDevice (D2D) information sharing, wireless sensing, and attention-aware resource allocation. These dimensions are explored in depth through subsequent chapters, showing how they collectively contribute to a more efficient, adaptable, and satisfying Metaverse experience. In Chapter 3, the focus is enhancing the efficiency of Mixed Reality (MR) technologies in Metaverse, specifically by addressing the computational constraints of Headset-Mounted Devices (HMDs). To mitigate these limitations, I introduce a full-duplex D2D SemCom strategy to reduce reliance on intensive computation through efficient information exchange. This innovative approach facilitates users sharing AI-generated content and semantic information, streamlining computational processes and enhancing the spatial coherence of computation outputs. I rigorously evaluate the performance of the full-duplex D2D communications using generalized small-scale fading models, focusing on achievable data rates and bit error probabilities. Furthermore, the chapter introduces a novel contract theoretic AI-generated incentive mechanism to enhance semantic information exchange, which, as demonstrated through numerical analysis, surpasses traditional deep reinforcement learning algorithms, including proximal policy optimization and soft actor-critic algorithms. The outcomes underscore the effectiveness and practicality of our contributions to advancing SemCom and GenAI for Metaverse. In Chapter 4, I explore the enhancement of adaptability in SemCom and GenAI for various wireless sensing tasks, such as user localization and activity detection, which are critical for user-avatar synchronization in Metaverse. Here, I introduce a novel paradigm: inverse SemCom. This approach does not extract semantic information from messages but encodes task-related source messages into a hypersource message, optimizing data transmission and storage. This chapter further details the development of an inverse semantic-aware wireless sensing framework for Metaverse, comprising three specialized algorithms for data sampling, Reconfigurable Intelligent Surface (RIS)-aided encoding, and GenAI-aided self-supervised decoding. A novel feature of this framework is the innovative RIS hardware, capable of encoding multiple signal spectrums into a single MetaSpectrum, utilizing a semantic hash sampling method for heightened encoding efficiency. Complementing this, a GenAI-aided self-supervised learning method is introduced to decode these MetaSpectrums precisely. Empirical evidence highlights a notable reduction in data volume and enhancement in accuracy for sensing tasks, underscoring the significant role of this approach in bolstering the adaptability and efficiency of SemCom for a more synchronized and responsive Metaverse. In Chapter 5, I study the significance of maximizing user satisfaction in Metaverse by enhancing personalized, immersive experiences, an endeavor traditionally constrained by the limitations of Ultra-Reliable and Low-Latency Communications (URLLC). To this end, I propose the evolution of URLLC into neXt-generation URLLC (xURLLC), incorporating personalized resource allocation to boost the Quality of Experience (QoE) significantly. This chapter elaborates on developing an optimal contract design framework, examining the interplay between Metaverse Service Providers (MSP) and network Infrastructure Providers (InP) to maximize user QoE while aligning with provider incentives. A pivotal introduction in this chapter is Meta-Immersion, an innovative metric that quantifies QoE by integrating objective performance indicators and subjective user perceptions. Furthermore, an attention-aware rendering capacity allocation scheme is developed to enhance QoE further within xURLLC by leveraging the associations between user attention and the attributes of AI-generated virtual Metaverse objects. Empirical validations using a user-object-attention dataset demonstrate that our approach can substantially improve QoE by an average of 20.1% over traditional URLLC. This represents a considerable advance in fostering a more engaging Metaverse, improving user satisfaction. In summary, this thesis constructs a comprehensive framework that positions SemCom and GenAI as supporting techniques in the evolution of Metaverse. Efficiency (Chapter 3) reduces latency and resource consumption, while adaptability (Chapter 4) ensures responsiveness to technological advancements and user preferences, collectively achieving a user-centric experience enhancement (Chapter 5). Thus, the methodologies and models introduced throughout this thesis collectively advance the theoretical understanding and practical applications of immersive Internet environments. Additionally, I explore potential future research directions, bringing insights for ongoing innovation and research in this dynamic and evolving domain.
author2 Dusit Niyato
author_facet Dusit Niyato
Du, Hongyang
format Thesis-Doctor of Philosophy
author Du, Hongyang
author_sort Du, Hongyang
title Semantic communications and generative artificial intelligence for user-centric metaverse services
title_short Semantic communications and generative artificial intelligence for user-centric metaverse services
title_full Semantic communications and generative artificial intelligence for user-centric metaverse services
title_fullStr Semantic communications and generative artificial intelligence for user-centric metaverse services
title_full_unstemmed Semantic communications and generative artificial intelligence for user-centric metaverse services
title_sort semantic communications and generative artificial intelligence for user-centric metaverse services
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/175895
_version_ 1800916282174865408
spelling sg-ntu-dr.10356-1758952024-06-03T06:51:19Z Semantic communications and generative artificial intelligence for user-centric metaverse services Du, Hongyang Dusit Niyato Soong Boon Hee Interdisciplinary Graduate School (IGS) Energy Research Institute @ NTU (ERI@N) DNIYATO@ntu.edu.sg, EBHSOONG@ntu.edu.sg Computer and Information Science Engineering Generative artificial intelligence Semantic communications Metaverse Network optimization Metaverse, the next frontier of the Internet, promises an immersive and intelligent world powered by advanced networks. At its core, the success of Metaverse services relies on the ability to deliver efficiency, adaptability, and satisfaction to users. Semantic Communications (SemCom) and Generative Artificial Intelligence (GenAI) are fundamental to supporting these requirements, which serve as the backbone for communications and computing paradigms. SemCom enables efficient, context-aware interactions essential for a seamless user experience, while GenAI drives the dynamic, personalized network management solutions generation to support Metaverse. In this thesis, I explore the pivotal roles of SemCom and GenAI in shaping user-centric Metaverse through innovations like Device-toDevice (D2D) information sharing, wireless sensing, and attention-aware resource allocation. These dimensions are explored in depth through subsequent chapters, showing how they collectively contribute to a more efficient, adaptable, and satisfying Metaverse experience. In Chapter 3, the focus is enhancing the efficiency of Mixed Reality (MR) technologies in Metaverse, specifically by addressing the computational constraints of Headset-Mounted Devices (HMDs). To mitigate these limitations, I introduce a full-duplex D2D SemCom strategy to reduce reliance on intensive computation through efficient information exchange. This innovative approach facilitates users sharing AI-generated content and semantic information, streamlining computational processes and enhancing the spatial coherence of computation outputs. I rigorously evaluate the performance of the full-duplex D2D communications using generalized small-scale fading models, focusing on achievable data rates and bit error probabilities. Furthermore, the chapter introduces a novel contract theoretic AI-generated incentive mechanism to enhance semantic information exchange, which, as demonstrated through numerical analysis, surpasses traditional deep reinforcement learning algorithms, including proximal policy optimization and soft actor-critic algorithms. The outcomes underscore the effectiveness and practicality of our contributions to advancing SemCom and GenAI for Metaverse. In Chapter 4, I explore the enhancement of adaptability in SemCom and GenAI for various wireless sensing tasks, such as user localization and activity detection, which are critical for user-avatar synchronization in Metaverse. Here, I introduce a novel paradigm: inverse SemCom. This approach does not extract semantic information from messages but encodes task-related source messages into a hypersource message, optimizing data transmission and storage. This chapter further details the development of an inverse semantic-aware wireless sensing framework for Metaverse, comprising three specialized algorithms for data sampling, Reconfigurable Intelligent Surface (RIS)-aided encoding, and GenAI-aided self-supervised decoding. A novel feature of this framework is the innovative RIS hardware, capable of encoding multiple signal spectrums into a single MetaSpectrum, utilizing a semantic hash sampling method for heightened encoding efficiency. Complementing this, a GenAI-aided self-supervised learning method is introduced to decode these MetaSpectrums precisely. Empirical evidence highlights a notable reduction in data volume and enhancement in accuracy for sensing tasks, underscoring the significant role of this approach in bolstering the adaptability and efficiency of SemCom for a more synchronized and responsive Metaverse. In Chapter 5, I study the significance of maximizing user satisfaction in Metaverse by enhancing personalized, immersive experiences, an endeavor traditionally constrained by the limitations of Ultra-Reliable and Low-Latency Communications (URLLC). To this end, I propose the evolution of URLLC into neXt-generation URLLC (xURLLC), incorporating personalized resource allocation to boost the Quality of Experience (QoE) significantly. This chapter elaborates on developing an optimal contract design framework, examining the interplay between Metaverse Service Providers (MSP) and network Infrastructure Providers (InP) to maximize user QoE while aligning with provider incentives. A pivotal introduction in this chapter is Meta-Immersion, an innovative metric that quantifies QoE by integrating objective performance indicators and subjective user perceptions. Furthermore, an attention-aware rendering capacity allocation scheme is developed to enhance QoE further within xURLLC by leveraging the associations between user attention and the attributes of AI-generated virtual Metaverse objects. Empirical validations using a user-object-attention dataset demonstrate that our approach can substantially improve QoE by an average of 20.1% over traditional URLLC. This represents a considerable advance in fostering a more engaging Metaverse, improving user satisfaction. In summary, this thesis constructs a comprehensive framework that positions SemCom and GenAI as supporting techniques in the evolution of Metaverse. Efficiency (Chapter 3) reduces latency and resource consumption, while adaptability (Chapter 4) ensures responsiveness to technological advancements and user preferences, collectively achieving a user-centric experience enhancement (Chapter 5). Thus, the methodologies and models introduced throughout this thesis collectively advance the theoretical understanding and practical applications of immersive Internet environments. Additionally, I explore potential future research directions, bringing insights for ongoing innovation and research in this dynamic and evolving domain. Doctor of Philosophy 2024-05-09T00:01:50Z 2024-05-09T00:01:50Z 2024 Thesis-Doctor of Philosophy Du, H. (2024). Semantic communications and generative artificial intelligence for user-centric metaverse services. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/175895 https://hdl.handle.net/10356/175895 10.32657/10356/175895 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University