DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition

Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tenso...

Full description

Saved in:
Bibliographic Details
Main Authors: YANG, Keyu, GAO, Yunjun, SHEN, Yifeng, ZHENG, Baihua, CHEN, Lu
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2021
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/6124
https://ink.library.smu.edu.sg/context/sis_research/article/7127/viewcontent/DisMASTD_ICDE21_CR.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-7127
record_format dspace
spelling sg-smu-ink.sis_research-71272023-10-31T02:06:45Z DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition YANG, Keyu GAO, Yunjun SHEN, Yifeng ZHENG, Baihua CHEN, Lu Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tensor decomposition. It requires an efficient distributed dynamic tensor decomposition without re-computing the whole tensor from scratch. In this paper, we propose DisMASTD, an efficient distributed multi-aspect streaming tensor decomposition. First, we prove the optimal tensor partitioning problem is NP-hard. Second, we present two heuristic tensor partitioning approaches to ensure the load balancing. Third, we develop a distributed multi-aspect streaming tensor decomposition computation method, which avoids repetitive computation and reduces network communication by maintaining and reusing the intermediate results. Last but not least, we perform extensive experiments with both real and synthetic datasets to demonstrate the efficiency and scalability of DisMASTD. 2021-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6124 info:doi/10.1109/ICDE51399.2021.00098 https://ink.library.smu.edu.sg/context/sis_research/article/7127/viewcontent/DisMASTD_ICDE21_CR.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Data analysis tensors social computing Databases and Information Systems Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Data analysis
tensors
social computing
Databases and Information Systems
Numerical Analysis and Scientific Computing
spellingShingle Data analysis
tensors
social computing
Databases and Information Systems
Numerical Analysis and Scientific Computing
YANG, Keyu
GAO, Yunjun
SHEN, Yifeng
ZHENG, Baihua
CHEN, Lu
DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
description Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tensor decomposition. It requires an efficient distributed dynamic tensor decomposition without re-computing the whole tensor from scratch. In this paper, we propose DisMASTD, an efficient distributed multi-aspect streaming tensor decomposition. First, we prove the optimal tensor partitioning problem is NP-hard. Second, we present two heuristic tensor partitioning approaches to ensure the load balancing. Third, we develop a distributed multi-aspect streaming tensor decomposition computation method, which avoids repetitive computation and reduces network communication by maintaining and reusing the intermediate results. Last but not least, we perform extensive experiments with both real and synthetic datasets to demonstrate the efficiency and scalability of DisMASTD.
format text
author YANG, Keyu
GAO, Yunjun
SHEN, Yifeng
ZHENG, Baihua
CHEN, Lu
author_facet YANG, Keyu
GAO, Yunjun
SHEN, Yifeng
ZHENG, Baihua
CHEN, Lu
author_sort YANG, Keyu
title DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
title_short DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
title_full DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
title_fullStr DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
title_full_unstemmed DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
title_sort dismastd: an efficient distributed multi-aspect streaming tensor decomposition
publisher Institutional Knowledge at Singapore Management University
publishDate 2021
url https://ink.library.smu.edu.sg/sis_research/6124
https://ink.library.smu.edu.sg/context/sis_research/article/7127/viewcontent/DisMASTD_ICDE21_CR.pdf
_version_ 1781794001008984064