Data quality in privacy preservation for associative classification

Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymizat...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Nattapon Harnsamut, Juggapong Natwichai, Xingzhi Sun, Xue Li
التنسيق:	Book Series
منشور في:	2018
الموضوعات:	Computer Science Mathematics
الوصول للمادة أونلاين:	https://www.scopus.com/inward/record.uri?partnerID=HzOxMe3b&scp=68749105788&origin=inward http://cmuir.cmu.ac.th/jspui/handle/6653943832/60280
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Chiang Mai University

الوصف
الملخص:	Privacy preserving has become an essential process for any data mining task. In general, data transformation is needed to ensure privacy preservation. Once the privacy is preserved, data quality issue must be addressed, i.e. the impact on data quality should be minimized. In this paper, k-Anonymization is considered as the transformation approach for preserving data privacy. In such a context, we discuss the metrics of the data quality in terms of classification, which is one of the most important tasks in data mining. Since different type of classification may use different approach to deliver knowledge, data quality metric for the classification task should be tailored to a certain type of classification. Specifically, we propose a frequency-based data quality metric to represent the data quality of the transformed dataset in the situation that associative classification is to be processed. Subsequently, we validate our proposed metric with experiments. The experiment results have shown that our proposed metric can effectively reflect the data quality for the associative classification problem. © 2008 Springer-Verlag Berlin Heidelberg.

Data quality in privacy preservation for associative classification

مواد مشابهة