Differentially private distributed frequency estimation

In order to remain competitive, Internet companies collect and analyse user data for the purpose of the improvement of user experiences. Frequency estimation is a widely used statistical tool, which could potentially conflict with the relevant privacy regulations. Privacy preserving analytic methods...

Full description

Saved in:
Bibliographic Details
Main Authors: Yang, Mengmeng, Tjuawinata, Ivan, Lam, Kwok-Yan, Zhu, Tianqing, Zhao, Jun
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/168026
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-168026
record_format dspace
spelling sg-ntu-dr.10356-1680262023-05-19T15:36:24Z Differentially private distributed frequency estimation Yang, Mengmeng Tjuawinata, Ivan Lam, Kwok-Yan Zhu, Tianqing Zhao, Jun School of Computer Science and Engineering Strategic Centre for Research in Privacy-Preserving Technologies & Systems Engineering::Computer science and engineering Differential Privacy Frequency Estimation In order to remain competitive, Internet companies collect and analyse user data for the purpose of the improvement of user experiences. Frequency estimation is a widely used statistical tool, which could potentially conflict with the relevant privacy regulations. Privacy preserving analytic methods based on differential privacy have been proposed, which require either a large user base or a trusted server. Although the requirements for such solutions may not be a problem for larger companies, they may be unattainable for smaller organizations. To address this issue, we propose a distributed privacy-preserving sampling-based frequency estimation method which has high accuracy even in the scenario with a small number of users while not requiring any trusted server. This is achieved by combining multi-party computation and sampling techniques. We also provide a relation between its privacy guarantee, output accuracy, and the number of participants. Distinct from most existing methods, our methods achieve <italic>centralized</italic> differential privacy guarantee without the need of any trusted server. We established that, even for a small number of participants, our mechanisms can produce estimates with high accuracy and hence they provide smaller companies with more opportunity for growth through privacy-preserving statistical analysis. We further propose an architectural model to support weighted aggregation in order to achieve a higher accuracy estimate to cater for users with varying privacy requirements. Compared to the unweighted aggregation, our method provides a more accurate estimate. Extensive experiments are conducted to show the effectiveness of the proposed methods. Submitted/Accepted version 2023-05-19T03:05:40Z 2023-05-19T03:05:40Z 2022 Journal Article Yang, M., Tjuawinata, I., Lam, K., Zhu, T. & Zhao, J. (2022). Differentially private distributed frequency estimation. IEEE Transactions On Dependable and Secure Computing. https://dx.doi.org/10.1109/TDSC.2022.3227654 1545-5971 https://hdl.handle.net/10356/168026 10.1109/TDSC.2022.3227654 2-s2.0-85144804720 en IEEE Transactions on Dependable and Secure Computing © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TDSC.2022.3227654. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Differential Privacy
Frequency Estimation
spellingShingle Engineering::Computer science and engineering
Differential Privacy
Frequency Estimation
Yang, Mengmeng
Tjuawinata, Ivan
Lam, Kwok-Yan
Zhu, Tianqing
Zhao, Jun
Differentially private distributed frequency estimation
description In order to remain competitive, Internet companies collect and analyse user data for the purpose of the improvement of user experiences. Frequency estimation is a widely used statistical tool, which could potentially conflict with the relevant privacy regulations. Privacy preserving analytic methods based on differential privacy have been proposed, which require either a large user base or a trusted server. Although the requirements for such solutions may not be a problem for larger companies, they may be unattainable for smaller organizations. To address this issue, we propose a distributed privacy-preserving sampling-based frequency estimation method which has high accuracy even in the scenario with a small number of users while not requiring any trusted server. This is achieved by combining multi-party computation and sampling techniques. We also provide a relation between its privacy guarantee, output accuracy, and the number of participants. Distinct from most existing methods, our methods achieve <italic>centralized</italic> differential privacy guarantee without the need of any trusted server. We established that, even for a small number of participants, our mechanisms can produce estimates with high accuracy and hence they provide smaller companies with more opportunity for growth through privacy-preserving statistical analysis. We further propose an architectural model to support weighted aggregation in order to achieve a higher accuracy estimate to cater for users with varying privacy requirements. Compared to the unweighted aggregation, our method provides a more accurate estimate. Extensive experiments are conducted to show the effectiveness of the proposed methods.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Yang, Mengmeng
Tjuawinata, Ivan
Lam, Kwok-Yan
Zhu, Tianqing
Zhao, Jun
format Article
author Yang, Mengmeng
Tjuawinata, Ivan
Lam, Kwok-Yan
Zhu, Tianqing
Zhao, Jun
author_sort Yang, Mengmeng
title Differentially private distributed frequency estimation
title_short Differentially private distributed frequency estimation
title_full Differentially private distributed frequency estimation
title_fullStr Differentially private distributed frequency estimation
title_full_unstemmed Differentially private distributed frequency estimation
title_sort differentially private distributed frequency estimation
publishDate 2023
url https://hdl.handle.net/10356/168026
_version_ 1772825924737171456