STDA: Secure Time Series Data Analytics with practical efficiency in wide-area network

Time series data analytics technology significantly benefits modern scientific research, especially in fields such as medical health, financial investment, and transportation. Unfortunately, privacy issues hinder people from handing over the data to a third party for various analytical tasks; becaus...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Xiaoguo, HUANG Zixi, ZHAO, Bowen, YANG, Guomin, XIANG, Tao, DENG, Robert H.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8499
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:Time series data analytics technology significantly benefits modern scientific research, especially in fields such as medical health, financial investment, and transportation. Unfortunately, privacy issues hinder people from handing over the data to a third party for various analytical tasks; because the data may reveal much more individual sensitive information, e.g., disease information from medical data, investment tendency from financial data, or the daily trajectory from transportation data. To break down this barrier, secure computation approaches have shown their importance in processing sensitive data, and have attracted much attention from the industry and research communities. However, when considering the case of secure time-series data analytics (e.g., DTW similarity), we are still far from achieving high efficiency due to high round complexity in communication or expensive computational complexity. We observe that DTW involves a lot of comparison operations and existing approaches in dealing with the comparison require higher communication costs. To this end, this paper studies secure DTW-based analytics with practical efficiency over time series data. Specifically, we propose the framework of secure time series data analytics (STDA) and formulate the problem of top- query for outsourced time series data. Based on threshold Paillier encryption, we present a top- query protocol utilizing the DTW distance as a metric and its security analysis, optimizations, and performance evaluation. The experimental results demonstrate that in a wide-area network with a 10 ms latency, our top- approach outperforms the state-of-the-art by 3x times, while DTW calculation outperforms by 9x times. Correspondingly, the optimized DTW achieves 17x times better, and optimized top- achieves 4-10x times better.