Causal view of generalization

Causal reasoning, an essential cognitive ability in human intelligence, allows us to generalize past learning to solve present problems. Unfortunately, while machine learning prospers over the past decade by training powerful deep neural networks (DNN) on massive data, it still lacks the generalizat...

Full description

Saved in:

Bibliographic Details
Main Author:	Yue, Zhongqi
Other Authors:	Hanwang Zhang
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/172269
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-172269
record_format	dspace
spelling	sg-ntu-dr.10356-1722692024-01-04T06:32:51Z Causal view of generalization Yue, Zhongqi Hanwang Zhang School of Computer Science and Engineering hanwangzhang@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Causal reasoning, an essential cognitive ability in human intelligence, allows us to generalize past learning to solve present problems. Unfortunately, while machine learning prospers over the past decade by training powerful deep neural networks (DNN) on massive data, it still lacks the generalization ability like us humans. Inspired by the important role of causality in human generalization, we take a causal view of machine generalization. We reveal that the spurious correlation in the training data is a confounder that prevents generalization, which can only be eliminated by causal intervention. In this thesis, we study three categories of causal intervention and contribute practical implementations to improve generalization: 1) backdoor adjustment, 2) invariant learning, and 3) learning disentangled representation. The proposed practical implementations are extensively evaluated by standard benchmarks and demonstrate state-of-the-art generalization performance in Few-Shot Learning, Unsupervised Domain Adaptation, Semi-Supervised Learning, Zero-Shot Learning, Open-Set Recognition, and Unsupervised Representation Learning. Doctor of Philosophy 2023-12-04T09:00:12Z 2023-12-04T09:00:12Z 2023 Thesis-Doctor of Philosophy Yue, Z. (2023). Causal view of generalization. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/172269 https://hdl.handle.net/10356/172269 10.32657/10356/172269 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Yue, Zhongqi Causal view of generalization
description	Causal reasoning, an essential cognitive ability in human intelligence, allows us to generalize past learning to solve present problems. Unfortunately, while machine learning prospers over the past decade by training powerful deep neural networks (DNN) on massive data, it still lacks the generalization ability like us humans. Inspired by the important role of causality in human generalization, we take a causal view of machine generalization. We reveal that the spurious correlation in the training data is a confounder that prevents generalization, which can only be eliminated by causal intervention. In this thesis, we study three categories of causal intervention and contribute practical implementations to improve generalization: 1) backdoor adjustment, 2) invariant learning, and 3) learning disentangled representation. The proposed practical implementations are extensively evaluated by standard benchmarks and demonstrate state-of-the-art generalization performance in Few-Shot Learning, Unsupervised Domain Adaptation, Semi-Supervised Learning, Zero-Shot Learning, Open-Set Recognition, and Unsupervised Representation Learning.
author2	Hanwang Zhang
author_facet	Hanwang Zhang Yue, Zhongqi
format	Thesis-Doctor of Philosophy
author	Yue, Zhongqi
author_sort	Yue, Zhongqi
title	Causal view of generalization
title_short	Causal view of generalization
title_full	Causal view of generalization
title_fullStr	Causal view of generalization
title_full_unstemmed	Causal view of generalization
title_sort	causal view of generalization
publisher	Nanyang Technological University
publishDate	2023
url	https://hdl.handle.net/10356/172269
_version_	1787590719555239936

Causal view of generalization

Similar Items