Data quality in privacy preservation for associative classification

Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymizat...

Full description

Saved in:
Bibliographic Details
Main Authors: Nattapon Harnsamut, Juggapong Natwichai, Xingzhi Sun, Xue Li
Format: Book Series
Published: 2018
Subjects:
Online Access:https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=68749105788&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/60280
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-60280
record_format dspace
spelling th-cmuir.6653943832-602802018-09-10T03:44:55Z Data quality in privacy preservation for associative classification Nattapon Harnsamut Juggapong Natwichai Xingzhi Sun Xue Li Computer Science Mathematics Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymization is considered as the transformation approach for preserving data privacy. In such a context, we discuss the metrics of the data quality in terms of classification, which is one of the most important tasks in data mining. Since different type of classification may use different approach to deliver knowledge, data quality metric for the classification task should be tailored to a certain type of classification. Specifically, we propose a frequency-based data quality metric to represent the data quality of the transformed dataset in the situation that associative classification is to be processed. Subsequently, we validate our proposed metric with experiments. The experiment results have shown that our proposed metric can effectively reflect the data quality for the associative classification problem. © 2008 Springer-Verlag Berlin Heidelberg. 2018-09-10T03:40:32Z 2018-09-10T03:40:32Z 2008-12-01 Book Series 16113349 03029743 2-s2.0-68749105788 10.1007/978-3-540-88192-6-12 https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=68749105788&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/60280
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Computer Science
Mathematics
spellingShingle Computer Science
Mathematics
Nattapon Harnsamut
Juggapong Natwichai
Xingzhi Sun
Xue Li
Data quality in privacy preservation for associative classification
description Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymization is considered as the transformation approach for preserving data privacy. In such a context, we discuss the metrics of the data quality in terms of classification, which is one of the most important tasks in data mining. Since different type of classification may use different approach to deliver knowledge, data quality metric for the classification task should be tailored to a certain type of classification. Specifically, we propose a frequency-based data quality metric to represent the data quality of the transformed dataset in the situation that associative classification is to be processed. Subsequently, we validate our proposed metric with experiments. The experiment results have shown that our proposed metric can effectively reflect the data quality for the associative classification problem. © 2008 Springer-Verlag Berlin Heidelberg.
format Book Series
author Nattapon Harnsamut
Juggapong Natwichai
Xingzhi Sun
Xue Li
author_facet Nattapon Harnsamut
Juggapong Natwichai
Xingzhi Sun
Xue Li
author_sort Nattapon Harnsamut
title Data quality in privacy preservation for associative classification
title_short Data quality in privacy preservation for associative classification
title_full Data quality in privacy preservation for associative classification
title_fullStr Data quality in privacy preservation for associative classification
title_full_unstemmed Data quality in privacy preservation for associative classification
title_sort data quality in privacy preservation for associative classification
publishDate 2018
url https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=68749105788&origin=inward
http://cmuir.cmu.ac.th/jspui/handle/6653943832/60280
_version_ 1681425406709727232