Differentially private distributed frequency estimation
In order to remain competitive, Internet companies collect and analyse user data for the purpose of the improvement of user experiences. Frequency estimation is a widely used statistical tool, which could potentially conflict with the relevant privacy regulations. Privacy preserving analytic methods...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/168026 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-168026 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1680262023-05-19T15:36:24Z Differentially private distributed frequency estimation Yang, Mengmeng Tjuawinata, Ivan Lam, Kwok-Yan Zhu, Tianqing Zhao, Jun School of Computer Science and Engineering Strategic Centre for Research in Privacy-Preserving Technologies & Systems Engineering::Computer science and engineering Differential Privacy Frequency Estimation In order to remain competitive, Internet companies collect and analyse user data for the purpose of the improvement of user experiences. Frequency estimation is a widely used statistical tool, which could potentially conflict with the relevant privacy regulations. Privacy preserving analytic methods based on differential privacy have been proposed, which require either a large user base or a trusted server. Although the requirements for such solutions may not be a problem for larger companies, they may be unattainable for smaller organizations. To address this issue, we propose a distributed privacy-preserving sampling-based frequency estimation method which has high accuracy even in the scenario with a small number of users while not requiring any trusted server. This is achieved by combining multi-party computation and sampling techniques. We also provide a relation between its privacy guarantee, output accuracy, and the number of participants. Distinct from most existing methods, our methods achieve <italic>centralized</italic> differential privacy guarantee without the need of any trusted server. We established that, even for a small number of participants, our mechanisms can produce estimates with high accuracy and hence they provide smaller companies with more opportunity for growth through privacy-preserving statistical analysis. We further propose an architectural model to support weighted aggregation in order to achieve a higher accuracy estimate to cater for users with varying privacy requirements. Compared to the unweighted aggregation, our method provides a more accurate estimate. Extensive experiments are conducted to show the effectiveness of the proposed methods. Submitted/Accepted version 2023-05-19T03:05:40Z 2023-05-19T03:05:40Z 2022 Journal Article Yang, M., Tjuawinata, I., Lam, K., Zhu, T. & Zhao, J. (2022). Differentially private distributed frequency estimation. IEEE Transactions On Dependable and Secure Computing. https://dx.doi.org/10.1109/TDSC.2022.3227654 1545-5971 https://hdl.handle.net/10356/168026 10.1109/TDSC.2022.3227654 2-s2.0-85144804720 en IEEE Transactions on Dependable and Secure Computing © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TDSC.2022.3227654. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Differential Privacy Frequency Estimation |
spellingShingle |
Engineering::Computer science and engineering Differential Privacy Frequency Estimation Yang, Mengmeng Tjuawinata, Ivan Lam, Kwok-Yan Zhu, Tianqing Zhao, Jun Differentially private distributed frequency estimation |
description |
In order to remain competitive, Internet companies collect and analyse user data for the purpose of the improvement of user experiences. Frequency estimation is a widely used statistical tool, which could potentially conflict with the relevant privacy regulations. Privacy preserving analytic methods based on differential privacy have been proposed, which require either a large user base or a trusted server. Although the requirements for such solutions may not be a problem for larger companies, they may be unattainable for smaller organizations. To address this issue, we propose a distributed privacy-preserving sampling-based frequency estimation method which has high accuracy even in the scenario with a small number of users while not requiring any trusted server. This is achieved by combining multi-party computation and sampling techniques. We also provide a relation between its privacy guarantee, output accuracy, and the number of participants. Distinct from most existing methods, our methods achieve <italic>centralized</italic> differential privacy guarantee without the need of any trusted server. We established that, even for a small number of participants, our mechanisms can produce estimates with high accuracy and hence they provide smaller companies with more opportunity for growth through privacy-preserving statistical analysis. We further propose an architectural model to support weighted aggregation in order to achieve a higher accuracy estimate to cater for users with varying privacy requirements. Compared to the unweighted aggregation, our method provides a more accurate estimate. Extensive experiments are conducted to show the effectiveness of the proposed methods. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Yang, Mengmeng Tjuawinata, Ivan Lam, Kwok-Yan Zhu, Tianqing Zhao, Jun |
format |
Article |
author |
Yang, Mengmeng Tjuawinata, Ivan Lam, Kwok-Yan Zhu, Tianqing Zhao, Jun |
author_sort |
Yang, Mengmeng |
title |
Differentially private distributed frequency estimation |
title_short |
Differentially private distributed frequency estimation |
title_full |
Differentially private distributed frequency estimation |
title_fullStr |
Differentially private distributed frequency estimation |
title_full_unstemmed |
Differentially private distributed frequency estimation |
title_sort |
differentially private distributed frequency estimation |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/168026 |
_version_ |
1772825924737171456 |