Automating feature engineering in machine Learning
Feature engineering is a vital part of machine learning that transforms massive raw data into the applicable feature set, and many algorithms of feature engineering have been proposed to promote the efficiency and accuracy of this process. Moreover, automated feature engineering has also been resear...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/141857 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-141857 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1418572023-07-07T17:46:17Z Automating feature engineering in machine Learning Zhu, Haoyu Mao Kezhi School of Electrical and Electronic Engineering Agency for Science, Technology and Research (A*STAR) Yang Feng EKZMao@ntu.edu.sg Engineering::Electrical and electronic engineering Feature engineering is a vital part of machine learning that transforms massive raw data into the applicable feature set, and many algorithms of feature engineering have been proposed to promote the efficiency and accuracy of this process. Moreover, automated feature engineering has also been researched. For example, the AutoLearn feature learning algorithm automatedly generates new features by the linear relationship among features and then performs feature selection process on the new feature set [1]. This project is focusing on feature engineering, specifically feature selection, to develop a unified function that integrates different types of feature selection methods for the automation by hyperparameter optimization. The goals of this project are to utilize a unified function that can implement all types of feature selection algorithms and to automate this process as many hyperparameters could be tuned automatically. And explorations on feature selection algorithms and classification models were carried for designing the unified function. Automated feature selection was also tested on wrapper methods by utilizing the Bayesian optimization method. As the deliverable of this project, the unified function was capable of implementing any kind of feature selection algorithm and outputting feature set that potentially can improve the classification performance. Experiments using the unified function on different datasets showed the potentials to eradicate manual efforts in selecting a subset of useful features. Bachelor of Engineering (Electrical and Electronic Engineering) 2020-06-11T05:24:30Z 2020-06-11T05:24:30Z 2020 Final Year Project (FYP) https://hdl.handle.net/10356/141857 en B1083-191 application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering |
spellingShingle |
Engineering::Electrical and electronic engineering Zhu, Haoyu Automating feature engineering in machine Learning |
description |
Feature engineering is a vital part of machine learning that transforms massive raw data into the applicable feature set, and many algorithms of feature engineering have been proposed to promote the efficiency and accuracy of this process. Moreover, automated feature engineering has also been researched. For example, the AutoLearn feature learning algorithm automatedly generates new features by the linear relationship among features and then performs feature selection process on the new feature set [1]. This project is focusing on feature engineering, specifically feature selection, to develop a unified function that integrates different types of feature selection methods for the automation by hyperparameter optimization. The goals of this project are to utilize a unified function that can implement all types of feature selection algorithms and to automate this process as many hyperparameters could be tuned automatically. And explorations on feature selection algorithms and classification models were carried for designing the unified function. Automated feature selection was also tested on wrapper methods by utilizing the Bayesian optimization method. As the deliverable of this project, the unified function was capable of implementing any kind of feature selection algorithm and outputting feature set that potentially can improve the classification performance. Experiments using the unified function on different datasets showed the potentials to eradicate manual efforts in selecting a subset of useful features. |
author2 |
Mao Kezhi |
author_facet |
Mao Kezhi Zhu, Haoyu |
format |
Final Year Project |
author |
Zhu, Haoyu |
author_sort |
Zhu, Haoyu |
title |
Automating feature engineering in machine Learning |
title_short |
Automating feature engineering in machine Learning |
title_full |
Automating feature engineering in machine Learning |
title_fullStr |
Automating feature engineering in machine Learning |
title_full_unstemmed |
Automating feature engineering in machine Learning |
title_sort |
automating feature engineering in machine learning |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/141857 |
_version_ |
1772828743714209792 |