Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change

Supervised learning algorithms do not work well when the deployment condition is dissimilar to the training condition. Such non-stationary conditions include covariate shifts and concept shifts. Importance weighted learning (IWL) is used to handle a one-time covariate shift but not frequent shifts a...

Full description

Saved in:

Bibliographic Details
Main Author:	Goh, Chun Fan
Other Authors:	Seet Gim Lee, Gerald
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2021
Subjects:	Engineering::Mechanical engineering::Robots Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/147041
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-147041
record_format	dspace
spelling	sg-ntu-dr.10356-1470412023-03-11T17:49:06Z Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change Goh, Chun Fan Seet Gim Lee, Gerald School of Mechanical and Aerospace Engineering Robotics Research Centre MGLSEET@ntu.edu.sg Engineering::Mechanical engineering::Robots Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Supervised learning algorithms do not work well when the deployment condition is dissimilar to the training condition. Such non-stationary conditions include covariate shifts and concept shifts. Importance weighted learning (IWL) is used to handle a one-time covariate shift but not frequent shifts and concept shifts. While forgetting addresses concept shifts, it is wasteful in discarding previously learned models. To address these shortfalls, this thesis proposes looking into the three stages of supervised learning and devised pre-learning methods, which deal with data and feature selection; in-learning methods, which modify the learning process; and post-learning methods, which modify the prediction process to compensate for shifts in conditions. The ﬁrst in-learning method is a transfer learning-based technique that utilizes a limited amount of test data to train further the prediction model pre-trained on general training data. This technique boosted the accuracy of a vocal emotion recognizer by 10%. For applications that require a timely response, we employed a post-learning strategy in the form of local learning. It handles multiple covariate shifts and improves the prediction accuracy in one vocal emotion recognition instance from 88.8% to 93.2%. Local learning also allows the use of feature augmentation to convert a more diﬃcult concept-shift problem into an easier covariate-shift problem. The resulting controller outperforms PID controllers in water shooting control. When data are abundant, we leverage pre-learning methods such as condition-speciﬁc learning, to avoid non-stationary conditions altogether. Using this technique, we developed a semi-automatic snore labeling software that produces good accuracy (0.93 F1-score) and cuts labeling time from hours to minutes. Alternatively, we use deep learning methods to learn features that are robust to shifts. In our ablation study, we showed that features extracted from very deep networks and recurrent networks result in a more accurate and robust snore classiﬁcation. With the advance of computer simulation, unlimited artiﬁcial data can be generated to better approximate and cover possible test conditions. We tested this idea in teaching a double-hull welding robot to climb down safely from a high wall through reinforcement learning and achieved a 90% success rate. Finally, from these applications, we distilled a method selection guideline based on data availability, time urgency, and type of shift. Doctor of Philosophy 2021-03-19T07:17:16Z 2021-03-19T07:17:16Z 2021 Thesis-Doctor of Philosophy Goh, C. F. (2021). Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/147041 https://hdl.handle.net/10356/147041 10.32657/10356/147041 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Mechanical engineering::Robots Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Mechanical engineering::Robots Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Goh, Chun Fan Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
description	Supervised learning algorithms do not work well when the deployment condition is dissimilar to the training condition. Such non-stationary conditions include covariate shifts and concept shifts. Importance weighted learning (IWL) is used to handle a one-time covariate shift but not frequent shifts and concept shifts. While forgetting addresses concept shifts, it is wasteful in discarding previously learned models. To address these shortfalls, this thesis proposes looking into the three stages of supervised learning and devised pre-learning methods, which deal with data and feature selection; in-learning methods, which modify the learning process; and post-learning methods, which modify the prediction process to compensate for shifts in conditions. The ﬁrst in-learning method is a transfer learning-based technique that utilizes a limited amount of test data to train further the prediction model pre-trained on general training data. This technique boosted the accuracy of a vocal emotion recognizer by 10%. For applications that require a timely response, we employed a post-learning strategy in the form of local learning. It handles multiple covariate shifts and improves the prediction accuracy in one vocal emotion recognition instance from 88.8% to 93.2%. Local learning also allows the use of feature augmentation to convert a more diﬃcult concept-shift problem into an easier covariate-shift problem. The resulting controller outperforms PID controllers in water shooting control. When data are abundant, we leverage pre-learning methods such as condition-speciﬁc learning, to avoid non-stationary conditions altogether. Using this technique, we developed a semi-automatic snore labeling software that produces good accuracy (0.93 F1-score) and cuts labeling time from hours to minutes. Alternatively, we use deep learning methods to learn features that are robust to shifts. In our ablation study, we showed that features extracted from very deep networks and recurrent networks result in a more accurate and robust snore classiﬁcation. With the advance of computer simulation, unlimited artiﬁcial data can be generated to better approximate and cover possible test conditions. We tested this idea in teaching a double-hull welding robot to climb down safely from a high wall through reinforcement learning and achieved a 90% success rate. Finally, from these applications, we distilled a method selection guideline based on data availability, time urgency, and type of shift.
author2	Seet Gim Lee, Gerald
author_facet	Seet Gim Lee, Gerald Goh, Chun Fan
format	Thesis-Doctor of Philosophy
author	Goh, Chun Fan
author_sort	Goh, Chun Fan
title	Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
title_short	Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
title_full	Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
title_fullStr	Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
title_full_unstemmed	Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
title_sort	improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change
publisher	Nanyang Technological University
publishDate	2021
url	https://hdl.handle.net/10356/147041
_version_	1761781749570863104

Improving machine learning methods for solving non-stationary conditions based on data availability, time urgency, and types of change

Similar Items