A heuristic data reduction approach for associative classification rule hiding

When data are to be shared between business partners, there could be some sensitive patterns which should not be disclosed to the other parties. On the other hand, the "quality" of the data must also be preserved. This creates an interesting question: how can we maintain the shared data th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Natwichai J., Sun X., Li X.
Format:	Conference or Workshop Item
Language:	English
Published:	2014
Online Access:	http://www.scopus.com/inward/record.url?eid=2-s2.0-58349098012&partnerID=40&md5=05f62c50be0cdb79ab196f85bf2f52ea http://cmuir.cmu.ac.th/handle/6653943832/1367
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Chiang Mai University
Language:	English

id	th-cmuir.6653943832-1367
record_format	dspace
spelling	th-cmuir.6653943832-13672014-08-29T09:29:13Z A heuristic data reduction approach for associative classification rule hiding Natwichai J. Sun X. Li X. When data are to be shared between business partners, there could be some sensitive patterns which should not be disclosed to the other parties. On the other hand, the "quality" of the data must also be preserved. This creates an interesting question: how can we maintain the shared data that are guaranteed to have the quality, and the certain types of sensitive patterns be removed or "hidden"? In this paper, we address such the problem of sensitive classification rule hiding by using data reduction approach, i.e. removing the whole selected tuples in the given dataset. We focus on a specific type of classification rules, i.e. associative classification rules. In our context, a sensitive rule is hidden when its support falls below a minimal support threshold. Meanwhile, the impact on the data quality of the dataset is represented in term of a number of false-dropped rules, and a number of ghost rules. We present a few observations on the data quality with regard to the data reduction processes. From the observations, we can represent the impact by each reduction precisely without any re-applying the classification algorithm. Subsequently, we propose a heuristic algorithm to hide the sensitive rules based on the observations. Experimental results are presented to show the effectiveness and the efficiency of the proposed algorithm. © 2008 Springer Berlin Heidelberg. 2014-08-29T09:29:13Z 2014-08-29T09:29:13Z 2008 Conference Paper 354089196X; 9783540891963 03029743 10.1007/978-3-540-89197-0_16 75109 http://www.scopus.com/inward/record.url?eid=2-s2.0-58349098012&partnerID=40&md5=05f62c50be0cdb79ab196f85bf2f52ea http://cmuir.cmu.ac.th/handle/6653943832/1367 English
institution	Chiang Mai University
building	Chiang Mai University Library
country	Thailand
collection	CMU Intellectual Repository
language	English
description	When data are to be shared between business partners, there could be some sensitive patterns which should not be disclosed to the other parties. On the other hand, the "quality" of the data must also be preserved. This creates an interesting question: how can we maintain the shared data that are guaranteed to have the quality, and the certain types of sensitive patterns be removed or "hidden"? In this paper, we address such the problem of sensitive classification rule hiding by using data reduction approach, i.e. removing the whole selected tuples in the given dataset. We focus on a specific type of classification rules, i.e. associative classification rules. In our context, a sensitive rule is hidden when its support falls below a minimal support threshold. Meanwhile, the impact on the data quality of the dataset is represented in term of a number of false-dropped rules, and a number of ghost rules. We present a few observations on the data quality with regard to the data reduction processes. From the observations, we can represent the impact by each reduction precisely without any re-applying the classification algorithm. Subsequently, we propose a heuristic algorithm to hide the sensitive rules based on the observations. Experimental results are presented to show the effectiveness and the efficiency of the proposed algorithm. © 2008 Springer Berlin Heidelberg.
format	Conference or Workshop Item
author	Natwichai J. Sun X. Li X.
spellingShingle	Natwichai J. Sun X. Li X. A heuristic data reduction approach for associative classification rule hiding
author_facet	Natwichai J. Sun X. Li X.
author_sort	Natwichai J.
title	A heuristic data reduction approach for associative classification rule hiding
title_short	A heuristic data reduction approach for associative classification rule hiding
title_full	A heuristic data reduction approach for associative classification rule hiding
title_fullStr	A heuristic data reduction approach for associative classification rule hiding
title_full_unstemmed	A heuristic data reduction approach for associative classification rule hiding
title_sort	heuristic data reduction approach for associative classification rule hiding
publishDate	2014
url	http://www.scopus.com/inward/record.url?eid=2-s2.0-58349098012&partnerID=40&md5=05f62c50be0cdb79ab196f85bf2f52ea http://cmuir.cmu.ac.th/handle/6653943832/1367
_version_	1681419657400025088

A heuristic data reduction approach for associative classification rule hiding

Similar Items