Probabilistic Value Selection for Space Efficient Model

An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the data...

Full description

Saved in:
Bibliographic Details
Main Authors: NJOO, Gunarto Sindoro, ZHENG, Baihua, HSU, Kuo-Wei, PENG, Wen-Chih
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5264
https://ink.library.smu.edu.sg/context/sis_research/article/6267/viewcontent/6._Probabilistic_Value_Selection_for_Space_Efficient__IEEE_MDM2020_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6267
record_format dspace
spelling sg-smu-ink.sis_research-62672020-10-29T09:59:51Z Probabilistic Value Selection for Space Efficient Model NJOO, Gunarto Sindoro ZHENG, Baihua HSU, Kuo-Wei PENG, Wen-Chih An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the dataset with two purposes: reducing the model size and preserving its accuracy. Two probabilistic methods based on information theory's metric are proposed: PVS and P + VS. Extensive experiments on the benchmark datasets with various sizes are elaborated. Those results are compared with the existing preprocessing methods such as feature selection, feature transformation, and instance selection methods. Experiment results show that value selection can achieve the balance between accuracy and model size reduction. 2020-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5264 info:doi/10.1109/MDM48529.2020.00037 https://ink.library.smu.edu.sg/context/sis_research/article/6267/viewcontent/6._Probabilistic_Value_Selection_for_Space_Efficient__IEEE_MDM2020_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University preprocessing data mining value selection model size reduction entropy information theory Databases and Information Systems Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic preprocessing
data mining
value selection
model size reduction
entropy
information theory
Databases and Information Systems
Theory and Algorithms
spellingShingle preprocessing
data mining
value selection
model size reduction
entropy
information theory
Databases and Information Systems
Theory and Algorithms
NJOO, Gunarto Sindoro
ZHENG, Baihua
HSU, Kuo-Wei
PENG, Wen-Chih
Probabilistic Value Selection for Space Efficient Model
description An alternative to current mainstream preprocessing methods is proposed: Value Selection (VS). Unlike the existing methods such as feature selection that removes features and instance selection that eliminates instances, value selection eliminates the values (with respect to each feature) in the dataset with two purposes: reducing the model size and preserving its accuracy. Two probabilistic methods based on information theory's metric are proposed: PVS and P + VS. Extensive experiments on the benchmark datasets with various sizes are elaborated. Those results are compared with the existing preprocessing methods such as feature selection, feature transformation, and instance selection methods. Experiment results show that value selection can achieve the balance between accuracy and model size reduction.
format text
author NJOO, Gunarto Sindoro
ZHENG, Baihua
HSU, Kuo-Wei
PENG, Wen-Chih
author_facet NJOO, Gunarto Sindoro
ZHENG, Baihua
HSU, Kuo-Wei
PENG, Wen-Chih
author_sort NJOO, Gunarto Sindoro
title Probabilistic Value Selection for Space Efficient Model
title_short Probabilistic Value Selection for Space Efficient Model
title_full Probabilistic Value Selection for Space Efficient Model
title_fullStr Probabilistic Value Selection for Space Efficient Model
title_full_unstemmed Probabilistic Value Selection for Space Efficient Model
title_sort probabilistic value selection for space efficient model
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/5264
https://ink.library.smu.edu.sg/context/sis_research/article/6267/viewcontent/6._Probabilistic_Value_Selection_for_Space_Efficient__IEEE_MDM2020_.pdf
_version_ 1770575364701028352