The study of Rashomon effects on machine learning : a case study on breast cancer

The Rashomon effect is a theory that suggests the presence of multiple uncorrelated observations and explanations that can be made for a single observation. This theory has been translated into a popular machine learning method: Random Forests which uses bootstrapping (bagging) algorithms to create...

Full description

Saved in:
Bibliographic Details
Main Author: Wee, Yu Hui
Other Authors: Goh Wen Bin Wilson
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/150018
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-150018
record_format dspace
spelling sg-ntu-dr.10356-1500182023-02-28T18:08:14Z The study of Rashomon effects on machine learning : a case study on breast cancer Wee, Yu Hui Goh Wen Bin Wilson School of Biological Sciences wilsongoh@ntu.edu.sg Science::Biological sciences::Genetics The Rashomon effect is a theory that suggests the presence of multiple uncorrelated observations and explanations that can be made for a single observation. This theory has been translated into a popular machine learning method: Random Forests which uses bootstrapping (bagging) algorithms to create a set of uncorrelated decision trees that together make the decision (prediction) of the final result. In this study, we will be using 3 ER breast cancer datasets as a case study and we look at the results of the selection of each individual tree in the forest using the standard random forest algorithms and when bootstrapping of the attributes was removed. We found that most forests converged into a few highly correlate gene signatures which dominates the prediction and masks the errors of non-accurate models. Besides, because the random forest algorithm can generate highly accurate with a group of and non-predictive signatures, we need to be careful when using random forest machine models for prediction in the field of cancer biology. Bachelor of Science in Biological Sciences 2021-06-11T06:20:02Z 2021-06-11T06:20:02Z 2021 Final Year Project (FYP) Wee, Y. H. (2021). The study of Rashomon effects on machine learning : a case study on breast cancer. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/150018 https://hdl.handle.net/10356/150018 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Science::Biological sciences::Genetics
spellingShingle Science::Biological sciences::Genetics
Wee, Yu Hui
The study of Rashomon effects on machine learning : a case study on breast cancer
description The Rashomon effect is a theory that suggests the presence of multiple uncorrelated observations and explanations that can be made for a single observation. This theory has been translated into a popular machine learning method: Random Forests which uses bootstrapping (bagging) algorithms to create a set of uncorrelated decision trees that together make the decision (prediction) of the final result. In this study, we will be using 3 ER breast cancer datasets as a case study and we look at the results of the selection of each individual tree in the forest using the standard random forest algorithms and when bootstrapping of the attributes was removed. We found that most forests converged into a few highly correlate gene signatures which dominates the prediction and masks the errors of non-accurate models. Besides, because the random forest algorithm can generate highly accurate with a group of and non-predictive signatures, we need to be careful when using random forest machine models for prediction in the field of cancer biology.
author2 Goh Wen Bin Wilson
author_facet Goh Wen Bin Wilson
Wee, Yu Hui
format Final Year Project
author Wee, Yu Hui
author_sort Wee, Yu Hui
title The study of Rashomon effects on machine learning : a case study on breast cancer
title_short The study of Rashomon effects on machine learning : a case study on breast cancer
title_full The study of Rashomon effects on machine learning : a case study on breast cancer
title_fullStr The study of Rashomon effects on machine learning : a case study on breast cancer
title_full_unstemmed The study of Rashomon effects on machine learning : a case study on breast cancer
title_sort study of rashomon effects on machine learning : a case study on breast cancer
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/150018
_version_ 1759858172694102016