Causal view of generalization
Causal reasoning, an essential cognitive ability in human intelligence, allows us to generalize past learning to solve present problems. Unfortunately, while machine learning prospers over the past decade by training powerful deep neural networks (DNN) on massive data, it still lacks the generalizat...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/172269 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-172269 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1722692024-01-04T06:32:51Z Causal view of generalization Yue, Zhongqi Hanwang Zhang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Causal reasoning, an essential cognitive ability in human intelligence, allows us to generalize past learning to solve present problems. Unfortunately, while machine learning prospers over the past decade by training powerful deep neural networks (DNN) on massive data, it still lacks the generalization ability like us humans. Inspired by the important role of causality in human generalization, we take a causal view of machine generalization. We reveal that the spurious correlation in the training data is a confounder that prevents generalization, which can only be eliminated by causal intervention. In this thesis, we study three categories of causal intervention and contribute practical implementations to improve generalization: 1) backdoor adjustment, 2) invariant learning, and 3) learning disentangled representation. The proposed practical implementations are extensively evaluated by standard benchmarks and demonstrate state-of-the-art generalization performance in Few-Shot Learning, Unsupervised Domain Adaptation, Semi-Supervised Learning, Zero-Shot Learning, Open-Set Recognition, and Unsupervised Representation Learning. Doctor of Philosophy 2023-12-04T09:00:12Z 2023-12-04T09:00:12Z 2023 Thesis-Doctor of Philosophy Yue, Z. (2023). Causal view of generalization. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/172269 https://hdl.handle.net/10356/172269 10.32657/10356/172269 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Yue, Zhongqi Causal view of generalization |
description |
Causal reasoning, an essential cognitive ability in human intelligence, allows us to generalize past learning to solve present problems. Unfortunately, while machine learning prospers over the past decade by training powerful deep neural networks (DNN) on massive data, it still lacks the generalization ability like us humans. Inspired by the important role of causality in human generalization, we take a causal view of machine generalization. We reveal that the spurious correlation in the training data is a confounder that prevents generalization, which can only be eliminated by causal intervention. In this thesis, we study three categories of causal intervention and contribute practical implementations to improve generalization: 1) backdoor adjustment, 2) invariant learning, and 3) learning disentangled representation. The proposed practical implementations are extensively evaluated by standard benchmarks and demonstrate state-of-the-art generalization performance in Few-Shot Learning, Unsupervised Domain Adaptation, Semi-Supervised Learning, Zero-Shot Learning, Open-Set Recognition, and Unsupervised Representation Learning. |
author2 |
Hanwang Zhang |
author_facet |
Hanwang Zhang Yue, Zhongqi |
format |
Thesis-Doctor of Philosophy |
author |
Yue, Zhongqi |
author_sort |
Yue, Zhongqi |
title |
Causal view of generalization |
title_short |
Causal view of generalization |
title_full |
Causal view of generalization |
title_fullStr |
Causal view of generalization |
title_full_unstemmed |
Causal view of generalization |
title_sort |
causal view of generalization |
publisher |
Nanyang Technological University |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/172269 |
_version_ |
1787590719555239936 |