A robust and interpretable feature selection pipeline

A feature selection pipeline that removes redundant and irrelevant features without resulting in a significant drop in performance is investigated in this work. The novel pipeline frameworks consider the combined effect of redundancy minimisation through Principal Feature Analysis (PFA) algorithms...

Full description

Saved in:
Bibliographic Details
Main Author: Krishnan, Nithya
Other Authors: A S Madhukumar
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/148109
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-148109
record_format dspace
spelling sg-ntu-dr.10356-1481092021-04-23T14:23:08Z A robust and interpretable feature selection pipeline Krishnan, Nithya A S Madhukumar School of Computer Science and Engineering ASMadhukumar@ntu.edu.sg Engineering::Computer science and engineering A feature selection pipeline that removes redundant and irrelevant features without resulting in a significant drop in performance is investigated in this work. The novel pipeline frameworks consider the combined effect of redundancy minimisation through Principal Feature Analysis (PFA) algorithms and relevant feature selection through Causality-Based(Causality-based) methods. These independent methods and pipeline frameworks undergo a comprehensive evaluation upon diverse datasets using a variety of evaluation metrics. It is demonstrated that such methods can significantly decrease the number of features while maintaining a less than proportional drop in performance. The pipelines are also built to be interpretable, with the user being able to know which features are removed at each stage of the pipeline and the reasons for doing so. Pipeline frameworks which incorporate Causality-based methods followed by PFA methods are also computationally efficient and do not take a considerable amount of time. These frameworks also improve upon the performance of the independent PFA and Causality-based methods used, providing a promising tool for interpretable and robust feature selection. Bachelor of Engineering Science (Computer Science) 2021-04-23T14:23:08Z 2021-04-23T14:23:08Z 2021 Final Year Project (FYP) Krishnan, N. (2021). A robust and interpretable feature selection pipeline. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/148109 https://hdl.handle.net/10356/148109 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Krishnan, Nithya
A robust and interpretable feature selection pipeline
description A feature selection pipeline that removes redundant and irrelevant features without resulting in a significant drop in performance is investigated in this work. The novel pipeline frameworks consider the combined effect of redundancy minimisation through Principal Feature Analysis (PFA) algorithms and relevant feature selection through Causality-Based(Causality-based) methods. These independent methods and pipeline frameworks undergo a comprehensive evaluation upon diverse datasets using a variety of evaluation metrics. It is demonstrated that such methods can significantly decrease the number of features while maintaining a less than proportional drop in performance. The pipelines are also built to be interpretable, with the user being able to know which features are removed at each stage of the pipeline and the reasons for doing so. Pipeline frameworks which incorporate Causality-based methods followed by PFA methods are also computationally efficient and do not take a considerable amount of time. These frameworks also improve upon the performance of the independent PFA and Causality-based methods used, providing a promising tool for interpretable and robust feature selection.
author2 A S Madhukumar
author_facet A S Madhukumar
Krishnan, Nithya
format Final Year Project
author Krishnan, Nithya
author_sort Krishnan, Nithya
title A robust and interpretable feature selection pipeline
title_short A robust and interpretable feature selection pipeline
title_full A robust and interpretable feature selection pipeline
title_fullStr A robust and interpretable feature selection pipeline
title_full_unstemmed A robust and interpretable feature selection pipeline
title_sort robust and interpretable feature selection pipeline
publisher Nanyang Technological University
publishDate 2021
url https://hdl.handle.net/10356/148109
_version_ 1698713640289435648