From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models

In this thesis, I delve into the realm of noise and information, exploring the application and capabilities of randomized neural networks in discriminative tasks, as well as the utilization of diffusion models in generative tasks. I begin by investigating the inherent randomness in neural networks,...

Full description

Saved in:
Bibliographic Details
Main Author: Hu, Minghui
Other Authors: Arokiaswami Alphones
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2024
Subjects:
Online Access:https://hdl.handle.net/10356/177388
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-177388
record_format dspace
spelling sg-ntu-dr.10356-1773882024-06-03T06:51:19Z From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models Hu, Minghui Arokiaswami Alphones Ponnuthurai Nagaratnam Suganthan School of Electrical and Electronic Engineering EAlphones@ntu.edu.sg, EPNSugan@ntu.edu.sg Computer and Information Science Randomized neural networks Diffusion models RVFL Generative models In this thesis, I delve into the realm of noise and information, exploring the application and capabilities of randomized neural networks in discriminative tasks, as well as the utilization of diffusion models in generative tasks. I begin by investigating the inherent randomness in neural networks, and how this randomness can be harnessed to perform discriminative tasks with high accuracy. I then transition to the domain of generative tasks, where I employ diffusion models to generate high-quality data from noise. The primary innovations include: 1. Part I: Randomized Neural Networks for Discriminative Tasks - The introduction of an unsupervised learning approach and self-distillation for Randomised neural networks, which has improved the efficiency of utilizing limited data resources and has also enhanced the overall capabilities of the model. - The development of an ensemble deep RVFL network for regression tasks, incorporating techniques such as boosting factors, skip connections, and an ensemble scheme for improved predictive power. - More structures for Randomised Neural Networks, including Automated Layer-wise Solution, Adaptive Ensemble, Deep Reservoir variants and Noise Elimination. Our broader objective is to extend the utility of RVFL networks across various domains and applications. 2. Part II: Diffusion Models for Generative Tasks - The design of a Vector Quantized Discrete Diffusion Model (VQ-DDM) for efficient and high-fidelity image generation, which employs a two-stage process involving discrete VAE and diffusion model for latent code distribution fitting. - The introduction of a Unified Discrete Diffusion model (UniD3) for simultaneous vision-language generation, which constructs a joint probability distribution by mixing discrete image and text tokens. - The proposal of methods to improve the controllability of text-conditional diffusion models, including a Generalized ControlNet for multi-modal input and a plug-and-play module for low frequence control. The findings of this thesis demonstrate the potential of randomised neural networks and diffusion models in handling complex machine learning tasks, offering new insights into the interplay between noise and information. The innovative approaches presented in this thesis open up new research directions and have the potential to significantly impact the field of machine learning. Doctor of Philosophy 2024-05-24T12:52:02Z 2024-05-24T12:52:02Z 2024 Thesis-Doctor of Philosophy Hu, M. (2024). From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177388 https://hdl.handle.net/10356/177388 10.32657/10356/177388 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Computer and Information Science
Randomized neural networks
Diffusion models
RVFL
Generative models
spellingShingle Computer and Information Science
Randomized neural networks
Diffusion models
RVFL
Generative models
Hu, Minghui
From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
description In this thesis, I delve into the realm of noise and information, exploring the application and capabilities of randomized neural networks in discriminative tasks, as well as the utilization of diffusion models in generative tasks. I begin by investigating the inherent randomness in neural networks, and how this randomness can be harnessed to perform discriminative tasks with high accuracy. I then transition to the domain of generative tasks, where I employ diffusion models to generate high-quality data from noise. The primary innovations include: 1. Part I: Randomized Neural Networks for Discriminative Tasks - The introduction of an unsupervised learning approach and self-distillation for Randomised neural networks, which has improved the efficiency of utilizing limited data resources and has also enhanced the overall capabilities of the model. - The development of an ensemble deep RVFL network for regression tasks, incorporating techniques such as boosting factors, skip connections, and an ensemble scheme for improved predictive power. - More structures for Randomised Neural Networks, including Automated Layer-wise Solution, Adaptive Ensemble, Deep Reservoir variants and Noise Elimination. Our broader objective is to extend the utility of RVFL networks across various domains and applications. 2. Part II: Diffusion Models for Generative Tasks - The design of a Vector Quantized Discrete Diffusion Model (VQ-DDM) for efficient and high-fidelity image generation, which employs a two-stage process involving discrete VAE and diffusion model for latent code distribution fitting. - The introduction of a Unified Discrete Diffusion model (UniD3) for simultaneous vision-language generation, which constructs a joint probability distribution by mixing discrete image and text tokens. - The proposal of methods to improve the controllability of text-conditional diffusion models, including a Generalized ControlNet for multi-modal input and a plug-and-play module for low frequence control. The findings of this thesis demonstrate the potential of randomised neural networks and diffusion models in handling complex machine learning tasks, offering new insights into the interplay between noise and information. The innovative approaches presented in this thesis open up new research directions and have the potential to significantly impact the field of machine learning.
author2 Arokiaswami Alphones
author_facet Arokiaswami Alphones
Hu, Minghui
format Thesis-Doctor of Philosophy
author Hu, Minghui
author_sort Hu, Minghui
title From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
title_short From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
title_full From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
title_fullStr From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
title_full_unstemmed From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
title_sort from noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
publisher Nanyang Technological University
publishDate 2024
url https://hdl.handle.net/10356/177388
_version_ 1806059866253426688