Privacy preservation for associative classification

© 2014 Wiley Periodicals, Inc. Privacy preservation is becoming a critical issue to data-mining processes. In practice, a data transformation process is often needed to preserve privacy. However, data transformation would introduce a data quality issue. In this case, the impact on data quality due t...

Full description

Saved in:
Bibliographic Details
Main Authors: Harnsamut,N., Natwichai,J., Sun,X., Li,X.
Format: Article
Published: Wiley-Blackwell 2015
Subjects:
Online Access:http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84911003397&origin=inward
http://cmuir.cmu.ac.th/handle/6653943832/39069
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-39069
record_format dspace
spelling th-cmuir.6653943832-390692015-06-16T08:01:27Z Privacy preservation for associative classification Harnsamut,N. Natwichai,J. Sun,X. Li,X. Artificial Intelligence Computational Mathematics © 2014 Wiley Periodicals, Inc. Privacy preservation is becoming a critical issue to data-mining processes. In practice, a data transformation process is often needed to preserve privacy. However, data transformation would introduce a data quality issue. In this case, the impact on data quality due to the data transformation should be estimated and made clear to the user of the data transformation process. In this article, we consider the problem of k-anonymization transformation in associative classification. The privacy preservation and data quality issues are considered in twofold. First, we propose a frequency-based data quality metric to represent the data quality for associative classification. Second, a novel heuristic algorithm, namely minimum classification correction rate transformation, is proposed. The algorithm is guided by the classification correction rate of the given datasets. We validate our proposed metric and algorithm with University of California-Irvine repository datasets. The experiment results have shown that our proposed metric can effectively demonstrate the data quality for associative classification. The results also show that the proposed algorithm is not only efficient but also highly effective. 2015-06-16T08:01:27Z 2015-06-16T08:01:27Z 2014-01-01 Article 08247935 2-s2.0-84911003397 10.1111/coin.12028 http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84911003397&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/39069 Wiley-Blackwell
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Artificial Intelligence
Computational Mathematics
spellingShingle Artificial Intelligence
Computational Mathematics
Harnsamut,N.
Natwichai,J.
Sun,X.
Li,X.
Privacy preservation for associative classification
description © 2014 Wiley Periodicals, Inc. Privacy preservation is becoming a critical issue to data-mining processes. In practice, a data transformation process is often needed to preserve privacy. However, data transformation would introduce a data quality issue. In this case, the impact on data quality due to the data transformation should be estimated and made clear to the user of the data transformation process. In this article, we consider the problem of k-anonymization transformation in associative classification. The privacy preservation and data quality issues are considered in twofold. First, we propose a frequency-based data quality metric to represent the data quality for associative classification. Second, a novel heuristic algorithm, namely minimum classification correction rate transformation, is proposed. The algorithm is guided by the classification correction rate of the given datasets. We validate our proposed metric and algorithm with University of California-Irvine repository datasets. The experiment results have shown that our proposed metric can effectively demonstrate the data quality for associative classification. The results also show that the proposed algorithm is not only efficient but also highly effective.
format Article
author Harnsamut,N.
Natwichai,J.
Sun,X.
Li,X.
author_facet Harnsamut,N.
Natwichai,J.
Sun,X.
Li,X.
author_sort Harnsamut,N.
title Privacy preservation for associative classification
title_short Privacy preservation for associative classification
title_full Privacy preservation for associative classification
title_fullStr Privacy preservation for associative classification
title_full_unstemmed Privacy preservation for associative classification
title_sort privacy preservation for associative classification
publisher Wiley-Blackwell
publishDate 2015
url http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84911003397&origin=inward
http://cmuir.cmu.ac.th/handle/6653943832/39069
_version_ 1681421587744555008