From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models
In this thesis, I delve into the realm of noise and information, exploring the application and capabilities of randomized neural networks in discriminative tasks, as well as the utilization of diffusion models in generative tasks. I begin by investigating the inherent randomness in neural networks,...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/177388 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-177388 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1773882024-06-03T06:51:19Z From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models Hu, Minghui Arokiaswami Alphones Ponnuthurai Nagaratnam Suganthan School of Electrical and Electronic Engineering EAlphones@ntu.edu.sg, EPNSugan@ntu.edu.sg Computer and Information Science Randomized neural networks Diffusion models RVFL Generative models In this thesis, I delve into the realm of noise and information, exploring the application and capabilities of randomized neural networks in discriminative tasks, as well as the utilization of diffusion models in generative tasks. I begin by investigating the inherent randomness in neural networks, and how this randomness can be harnessed to perform discriminative tasks with high accuracy. I then transition to the domain of generative tasks, where I employ diffusion models to generate high-quality data from noise. The primary innovations include: 1. Part I: Randomized Neural Networks for Discriminative Tasks - The introduction of an unsupervised learning approach and self-distillation for Randomised neural networks, which has improved the efficiency of utilizing limited data resources and has also enhanced the overall capabilities of the model. - The development of an ensemble deep RVFL network for regression tasks, incorporating techniques such as boosting factors, skip connections, and an ensemble scheme for improved predictive power. - More structures for Randomised Neural Networks, including Automated Layer-wise Solution, Adaptive Ensemble, Deep Reservoir variants and Noise Elimination. Our broader objective is to extend the utility of RVFL networks across various domains and applications. 2. Part II: Diffusion Models for Generative Tasks - The design of a Vector Quantized Discrete Diffusion Model (VQ-DDM) for efficient and high-fidelity image generation, which employs a two-stage process involving discrete VAE and diffusion model for latent code distribution fitting. - The introduction of a Unified Discrete Diffusion model (UniD3) for simultaneous vision-language generation, which constructs a joint probability distribution by mixing discrete image and text tokens. - The proposal of methods to improve the controllability of text-conditional diffusion models, including a Generalized ControlNet for multi-modal input and a plug-and-play module for low frequence control. The findings of this thesis demonstrate the potential of randomised neural networks and diffusion models in handling complex machine learning tasks, offering new insights into the interplay between noise and information. The innovative approaches presented in this thesis open up new research directions and have the potential to significantly impact the field of machine learning. Doctor of Philosophy 2024-05-24T12:52:02Z 2024-05-24T12:52:02Z 2024 Thesis-Doctor of Philosophy Hu, M. (2024). From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/177388 https://hdl.handle.net/10356/177388 10.32657/10356/177388 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Computer and Information Science Randomized neural networks Diffusion models RVFL Generative models |
spellingShingle |
Computer and Information Science Randomized neural networks Diffusion models RVFL Generative models Hu, Minghui From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
description |
In this thesis, I delve into the realm of noise and information, exploring the application and capabilities of randomized neural networks in discriminative tasks, as well as the utilization of diffusion models in generative tasks.
I begin by investigating the inherent randomness in neural networks, and how this randomness can be harnessed to perform discriminative tasks with high accuracy. I then transition to the domain of generative tasks, where I employ diffusion models to generate high-quality data from noise.
The primary innovations include:
1. Part I: Randomized Neural Networks for Discriminative Tasks
- The introduction of an unsupervised learning approach and self-distillation for Randomised neural networks, which has improved the efficiency of utilizing limited data resources and has also enhanced the overall capabilities of the model.
- The development of an ensemble deep RVFL network for regression tasks, incorporating techniques such as boosting factors, skip connections, and an ensemble scheme for improved predictive power.
- More structures for Randomised Neural Networks, including Automated Layer-wise Solution, Adaptive Ensemble, Deep Reservoir variants and Noise Elimination. Our broader objective is to extend the utility of RVFL networks across various domains and applications.
2. Part II: Diffusion Models for Generative Tasks
- The design of a Vector Quantized Discrete Diffusion Model (VQ-DDM) for efficient and high-fidelity image generation, which employs a two-stage process involving discrete VAE and diffusion model for latent code distribution fitting.
- The introduction of a Unified Discrete Diffusion model (UniD3) for simultaneous vision-language generation, which constructs a joint probability distribution by mixing discrete image and text tokens.
- The proposal of methods to improve the controllability of text-conditional diffusion models, including a Generalized ControlNet for multi-modal input and a plug-and-play module for low frequence control.
The findings of this thesis demonstrate the potential of randomised neural networks and diffusion models in handling complex machine learning tasks, offering new insights into the interplay between noise and information. The innovative approaches presented in this thesis open up new research directions and have the potential to significantly impact the field of machine learning. |
author2 |
Arokiaswami Alphones |
author_facet |
Arokiaswami Alphones Hu, Minghui |
format |
Thesis-Doctor of Philosophy |
author |
Hu, Minghui |
author_sort |
Hu, Minghui |
title |
From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
title_short |
From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
title_full |
From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
title_fullStr |
From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
title_full_unstemmed |
From noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
title_sort |
from noise to information: discriminative tasks based on randomized neural networks and generative tasks based on diffusion models |
publisher |
Nanyang Technological University |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/177388 |
_version_ |
1806059866253426688 |