Data quality in privacy preservation for associative classification

Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymizat...

Full description

Saved in:

Bibliographic Details
Main Authors:	Harnsamut N., Natwichai J., Sun X., Li X.
Format:	Conference or Workshop Item
Language:	English
Published:	2014
Online Access:	http://www.scopus.com/inward/record.url?eid=2-s2.0-68749105788&partnerID=40&md5=d7ed1e9bef0f79792f8b3a5c5b108993 http://cmuir.cmu.ac.th/handle/6653943832/1370
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Chiang Mai University
Language:	English

id	th-cmuir.6653943832-1370
record_format	dspace
spelling	th-cmuir.6653943832-13702014-08-29T09:29:13Z Data quality in privacy preservation for associative classification Harnsamut N. Natwichai J. Sun X. Li X. Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymization is considered as the transformation approach for preserving data privacy. In such a context, we discuss the metrics of the data quality in terms of classification, which is one of the most important tasks in data mining. Since different type of classification may use different approach to deliver knowledge, data quality metric for the classification task should be tailored to a certain type of classification. Specifically, we propose a frequency-based data quality metric to represent the data quality of the transformed dataset in the situation that associative classification is to be processed. Subsequently, we validate our proposed metric with experiments. The experiment results have shown that our proposed metric can effectively reflect the data quality for the associative classification problem. © 2008 Springer-Verlag Berlin Heidelberg. 2014-08-29T09:29:13Z 2014-08-29T09:29:13Z 2008 Conference Paper 3540881913; 9783540881919 03029743 10.1007/978-3-540-88192-6-12 77018 http://www.scopus.com/inward/record.url?eid=2-s2.0-68749105788&partnerID=40&md5=d7ed1e9bef0f79792f8b3a5c5b108993 http://cmuir.cmu.ac.th/handle/6653943832/1370 English
institution	Chiang Mai University
building	Chiang Mai University Library
country	Thailand
collection	CMU Intellectual Repository
language	English
description	Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymization is considered as the transformation approach for preserving data privacy. In such a context, we discuss the metrics of the data quality in terms of classification, which is one of the most important tasks in data mining. Since different type of classification may use different approach to deliver knowledge, data quality metric for the classification task should be tailored to a certain type of classification. Specifically, we propose a frequency-based data quality metric to represent the data quality of the transformed dataset in the situation that associative classification is to be processed. Subsequently, we validate our proposed metric with experiments. The experiment results have shown that our proposed metric can effectively reflect the data quality for the associative classification problem. © 2008 Springer-Verlag Berlin Heidelberg.
format	Conference or Workshop Item
author	Harnsamut N. Natwichai J. Sun X. Li X.
spellingShingle	Harnsamut N. Natwichai J. Sun X. Li X. Data quality in privacy preservation for associative classification
author_facet	Harnsamut N. Natwichai J. Sun X. Li X.
author_sort	Harnsamut N.
title	Data quality in privacy preservation for associative classification
title_short	Data quality in privacy preservation for associative classification
title_full	Data quality in privacy preservation for associative classification
title_fullStr	Data quality in privacy preservation for associative classification
title_full_unstemmed	Data quality in privacy preservation for associative classification
title_sort	data quality in privacy preservation for associative classification
publishDate	2014
url	http://www.scopus.com/inward/record.url?eid=2-s2.0-68749105788&partnerID=40&md5=d7ed1e9bef0f79792f8b3a5c5b108993 http://cmuir.cmu.ac.th/handle/6653943832/1370
_version_	1681419657938993152

Data quality in privacy preservation for associative classification

Similar Items