An incremental privacy-preservation algorithm for the (k, e)-Anonymous model

© 2014 Elsevier Ltd. All rights reserved. An important issue to be addressed when data are to be published is data privacy. In this paper, the problem of data privacy based on a prominent privacy model, (k; e)-Anonymous, is addressed. Our scenario is that when a new dataset is to be released, there...

Full description

Saved in:
Bibliographic Details
Main Authors: Srisungsittisunti,B., Natwichai,J.,
Format: Article
Published: Elsevier Limited 2015
Subjects:
Online Access:http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84927761841&origin=inward
http://cmuir.cmu.ac.th/handle/6653943832/39117
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Chiang Mai University
id th-cmuir.6653943832-39117
record_format dspace
spelling th-cmuir.6653943832-391172015-06-16T08:01:39Z An incremental privacy-preservation algorithm for the (k, e)-Anonymous model Srisungsittisunti,B. Natwichai,J. , Electrical and Electronic Engineering Control and Systems Engineering Computer Science (all) © 2014 Elsevier Ltd. All rights reserved. An important issue to be addressed when data are to be published is data privacy. In this paper, the problem of data privacy based on a prominent privacy model, (k; e)-Anonymous, is addressed. Our scenario is that when a new dataset is to be released, there may be, at the same time, datasets that were released elsewhere. A problem arises because some attackers might obtain multiple versions of the same dataset and compare them with the newly released dataset. Although the privacy of all of the datasets has been well-preserved individually, such a comparison can lead to a privacy breach, which is a so-called ''incremental privacy breach''. To address this problem effectively, we first study the characteristics of the effects of multiple dataset releases with a theoretical approach. It has been found that a privacy breach that is subjected to an increment occurs when there is overlap between any parts of the new dataset with any parts of an existing dataset. Based on our proposed studies, a polynomial-time algorithm is proposed. This algorithm needs to consider only one previous version of the dataset, and it can also skip computing the overlapping partitions. Thus, the computational complexity of the proposed algorithm is reduced from O(nm) to only O(pn3) where p is the number of partitions, n is the number of tuples, and m is the number of released datasets. At the same time, the privacy of all of the released datasets as well as the optimal solution can be always guaranteed. In addition, experiment results that illustrate the efficiency of our algorithm on real-world datasets are presented. 2015-06-16T08:01:38Z 2015-06-16T08:01:38Z 2015-01-01 Article 00457906 2-s2.0-84927761841 10.1016/j.compeleceng.2014.10.007 http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84927761841&origin=inward http://cmuir.cmu.ac.th/handle/6653943832/39117 Elsevier Limited
institution Chiang Mai University
building Chiang Mai University Library
country Thailand
collection CMU Intellectual Repository
topic Electrical and Electronic Engineering
Control and Systems Engineering
Computer Science (all)
spellingShingle Electrical and Electronic Engineering
Control and Systems Engineering
Computer Science (all)
Srisungsittisunti,B.
Natwichai,J.
,
An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
description © 2014 Elsevier Ltd. All rights reserved. An important issue to be addressed when data are to be published is data privacy. In this paper, the problem of data privacy based on a prominent privacy model, (k; e)-Anonymous, is addressed. Our scenario is that when a new dataset is to be released, there may be, at the same time, datasets that were released elsewhere. A problem arises because some attackers might obtain multiple versions of the same dataset and compare them with the newly released dataset. Although the privacy of all of the datasets has been well-preserved individually, such a comparison can lead to a privacy breach, which is a so-called ''incremental privacy breach''. To address this problem effectively, we first study the characteristics of the effects of multiple dataset releases with a theoretical approach. It has been found that a privacy breach that is subjected to an increment occurs when there is overlap between any parts of the new dataset with any parts of an existing dataset. Based on our proposed studies, a polynomial-time algorithm is proposed. This algorithm needs to consider only one previous version of the dataset, and it can also skip computing the overlapping partitions. Thus, the computational complexity of the proposed algorithm is reduced from O(nm) to only O(pn3) where p is the number of partitions, n is the number of tuples, and m is the number of released datasets. At the same time, the privacy of all of the released datasets as well as the optimal solution can be always guaranteed. In addition, experiment results that illustrate the efficiency of our algorithm on real-world datasets are presented.
format Article
author Srisungsittisunti,B.
Natwichai,J.
,
author_facet Srisungsittisunti,B.
Natwichai,J.
,
author_sort Srisungsittisunti,B.
title An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
title_short An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
title_full An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
title_fullStr An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
title_full_unstemmed An incremental privacy-preservation algorithm for the (k, e)-Anonymous model
title_sort incremental privacy-preservation algorithm for the (k, e)-anonymous model
publisher Elsevier Limited
publishDate 2015
url http://www.scopus.com/inward/record.url?partnerID=HzOxMe3b&scp=84927761841&origin=inward
http://cmuir.cmu.ac.th/handle/6653943832/39117
_version_ 1681421596384821248