Automating feature engineering in machine Learning

Feature engineering is a vital part of machine learning that transforms massive raw data into the applicable feature set, and many algorithms of feature engineering have been proposed to promote the efficiency and accuracy of this process. Moreover, automated feature engineering has also been resear...

Full description

Saved in:
Bibliographic Details
Main Author: Zhu, Haoyu
Other Authors: Mao Kezhi
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/141857
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-141857
record_format dspace
spelling sg-ntu-dr.10356-1418572023-07-07T17:46:17Z Automating feature engineering in machine Learning Zhu, Haoyu Mao Kezhi School of Electrical and Electronic Engineering Agency for Science, Technology and Research (A*STAR) Yang Feng EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering Feature engineering is a vital part of machine learning that transforms massive raw data into the applicable feature set, and many algorithms of feature engineering have been proposed to promote the efficiency and accuracy of this process. Moreover, automated feature engineering has also been researched. For example, the AutoLearn feature learning algorithm automatedly generates new features by the linear relationship among features and then performs feature selection process on the new feature set [1]. This project is focusing on feature engineering, specifically feature selection, to develop a unified function that integrates different types of feature selection methods for the automation by hyperparameter optimization. The goals of this project are to utilize a unified function that can implement all types of feature selection algorithms and to automate this process as many hyperparameters could be tuned automatically. And explorations on feature selection algorithms and classification models were carried for designing the unified function. Automated feature selection was also tested on wrapper methods by utilizing the Bayesian optimization method. As the deliverable of this project, the unified function was capable of implementing any kind of feature selection algorithm and outputting feature set that potentially can improve the classification performance. Experiments using the unified function on different datasets showed the potentials to eradicate manual efforts in selecting a subset of useful features. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-06-11T05:24:30Z 2020-06-11T05:24:30Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/141857 en B1083-191 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Electrical and electronic engineering
spellingShingle Engineering::Electrical and electronic engineering
Zhu, Haoyu
Automating feature engineering in machine Learning
description Feature engineering is a vital part of machine learning that transforms massive raw data into the applicable feature set, and many algorithms of feature engineering have been proposed to promote the efficiency and accuracy of this process. Moreover, automated feature engineering has also been researched. For example, the AutoLearn feature learning algorithm automatedly generates new features by the linear relationship among features and then performs feature selection process on the new feature set [1]. This project is focusing on feature engineering, specifically feature selection, to develop a unified function that integrates different types of feature selection methods for the automation by hyperparameter optimization. The goals of this project are to utilize a unified function that can implement all types of feature selection algorithms and to automate this process as many hyperparameters could be tuned automatically. And explorations on feature selection algorithms and classification models were carried for designing the unified function. Automated feature selection was also tested on wrapper methods by utilizing the Bayesian optimization method. As the deliverable of this project, the unified function was capable of implementing any kind of feature selection algorithm and outputting feature set that potentially can improve the classification performance. Experiments using the unified function on different datasets showed the potentials to eradicate manual efforts in selecting a subset of useful features.
author2 Mao Kezhi
author_facet Mao Kezhi
Zhu, Haoyu
format Final Year Project
author Zhu, Haoyu
author_sort Zhu, Haoyu
title Automating feature engineering in machine Learning
title_short Automating feature engineering in machine Learning
title_full Automating feature engineering in machine Learning
title_fullStr Automating feature engineering in machine Learning
title_full_unstemmed Automating feature engineering in machine Learning
title_sort automating feature engineering in machine learning
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/141857
_version_ 1772828743714209792