Probabilistic Value Selection for Space Efficient Model
An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the data...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/5264 https://ink.library.smu.edu.sg/context/sis_research/article/6267/viewcontent/6._Probabilistic_Value_Selection_for_Space_Efficient__IEEE_MDM2020_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-6267 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-62672020-10-29T09:59:51Z Probabilistic Value Selection for Space Efficient Model NJOO, Gunarto Sindoro ZHENG, Baihua HSU, Kuo-Wei PENG, Wen-Chih An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the dataset with two purposes: reducing the model size and preserving its accuracy. Two probabilistic methods based on information theory's metric are proposed: PVS and P + VS. Extensive experiments on the benchmark datasets with various sizes are elaborated. Those results are compared with the existing preprocessing methods such as feature selection, feature transformation, and instance selection methods. Experiment results show that value selection can achieve the balance between accuracy and model size reduction. 2020-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5264 info:doi/10.1109/MDM48529.2020.00037 https://ink.library.smu.edu.sg/context/sis_research/article/6267/viewcontent/6._Probabilistic_Value_Selection_for_Space_Efficient__IEEE_MDM2020_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University preprocessing data mining value selection model size reduction entropy information theory Databases and Information Systems Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
preprocessing data mining value selection model size reduction entropy information theory Databases and Information Systems Theory and Algorithms |
spellingShingle |
preprocessing data mining value selection model size reduction entropy information theory Databases and Information Systems Theory and Algorithms NJOO, Gunarto Sindoro ZHENG, Baihua HSU, Kuo-Wei PENG, Wen-Chih Probabilistic Value Selection for Space Efficient Model |
description |
An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the dataset with two purposes: reducing the model size and preserving its accuracy. Two probabilistic methods based on information theory's metric are proposed: PVS and P + VS. Extensive experiments on the benchmark datasets with various sizes are elaborated. Those results are compared with the existing preprocessing methods such as feature selection, feature transformation, and instance selection methods. Experiment results show that value selection can achieve the balance between accuracy and model size reduction. |
format |
text |
author |
NJOO, Gunarto Sindoro ZHENG, Baihua HSU, Kuo-Wei PENG, Wen-Chih |
author_facet |
NJOO, Gunarto Sindoro ZHENG, Baihua HSU, Kuo-Wei PENG, Wen-Chih |
author_sort |
NJOO, Gunarto Sindoro |
title |
Probabilistic Value Selection for Space Efficient Model |
title_short |
Probabilistic Value Selection for Space Efficient Model |
title_full |
Probabilistic Value Selection for Space Efficient Model |
title_fullStr |
Probabilistic Value Selection for Space Efficient Model |
title_full_unstemmed |
Probabilistic Value Selection for Space Efficient Model |
title_sort |
probabilistic value selection for space efficient model |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/5264 https://ink.library.smu.edu.sg/context/sis_research/article/6267/viewcontent/6._Probabilistic_Value_Selection_for_Space_Efficient__IEEE_MDM2020_.pdf |
_version_ |
1770575364701028352 |