Embellishing Text Search Queries to Protect User Privacy

Users of text search engines are increasingly wary that their activities may disclose confidential information about their business or personal profiles. It would be desirable for a search engine to perform document retrieval for users while protecting their intent. In this paper, we identify the pr...

Full description

Saved in:
Bibliographic Details
Main Authors: PANG, Hwee Hwa, DING, Xuhua, XIAO, Xiaokui
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2010
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/614
https://ink.library.smu.edu.sg/context/sis_research/article/1613/viewcontent/vldb10.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1613
record_format dspace
spelling sg-smu-ink.sis_research-16132016-09-16T01:53:13Z Embellishing Text Search Queries to Protect User Privacy PANG, Hwee Hwa DING, Xuhua XIAO, Xiaokui Users of text search engines are increasingly wary that their activities may disclose confidential information about their business or personal profiles. It would be desirable for a search engine to perform document retrieval for users while protecting their intent. In this paper, we identify the privacy risks arising from semantically related search terms within a query, and from recurring highspecificity query terms in a search session. To counter the risks, we propose a solution for a similarity text retrieval system to offer anonymity and plausible deniability for the query terms, and hence the user intent, without degrading the system’s precision-recall performance. The solution comprises a mechanism that embellishes each user query with decoy terms that exhibit similar specificity spread as the genuine terms, but point to plausible alternative topics. We also provide an accompanying retrieval scheme that enables the search engine to compute the encrypted document relevance scores from only the genuine search terms, yet remain oblivious to their distinction from the decoys. Empirical evaluation results are presented to substantiate the effectiveness of our solution. 2010-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/614 info:doi/10.14778/1920841.1920918 https://ink.library.smu.edu.sg/context/sis_research/article/1613/viewcontent/vldb10.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Search engines privacy confidential information similarity text retrieval system Databases and Information Systems Information Security
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Search engines
privacy
confidential information
similarity text retrieval system
Databases and Information Systems
Information Security
spellingShingle Search engines
privacy
confidential information
similarity text retrieval system
Databases and Information Systems
Information Security
PANG, Hwee Hwa
DING, Xuhua
XIAO, Xiaokui
Embellishing Text Search Queries to Protect User Privacy
description Users of text search engines are increasingly wary that their activities may disclose confidential information about their business or personal profiles. It would be desirable for a search engine to perform document retrieval for users while protecting their intent. In this paper, we identify the privacy risks arising from semantically related search terms within a query, and from recurring highspecificity query terms in a search session. To counter the risks, we propose a solution for a similarity text retrieval system to offer anonymity and plausible deniability for the query terms, and hence the user intent, without degrading the system’s precision-recall performance. The solution comprises a mechanism that embellishes each user query with decoy terms that exhibit similar specificity spread as the genuine terms, but point to plausible alternative topics. We also provide an accompanying retrieval scheme that enables the search engine to compute the encrypted document relevance scores from only the genuine search terms, yet remain oblivious to their distinction from the decoys. Empirical evaluation results are presented to substantiate the effectiveness of our solution.
format text
author PANG, Hwee Hwa
DING, Xuhua
XIAO, Xiaokui
author_facet PANG, Hwee Hwa
DING, Xuhua
XIAO, Xiaokui
author_sort PANG, Hwee Hwa
title Embellishing Text Search Queries to Protect User Privacy
title_short Embellishing Text Search Queries to Protect User Privacy
title_full Embellishing Text Search Queries to Protect User Privacy
title_fullStr Embellishing Text Search Queries to Protect User Privacy
title_full_unstemmed Embellishing Text Search Queries to Protect User Privacy
title_sort embellishing text search queries to protect user privacy
publisher Institutional Knowledge at Singapore Management University
publishDate 2010
url https://ink.library.smu.edu.sg/sis_research/614
https://ink.library.smu.edu.sg/context/sis_research/article/1613/viewcontent/vldb10.pdf
_version_ 1770570619765653504