DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition
Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tenso...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2021
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/6124 https://ink.library.smu.edu.sg/context/sis_research/article/7127/viewcontent/DisMASTD_ICDE21_CR.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-7127 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-71272023-10-31T02:06:45Z DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition YANG, Keyu GAO, Yunjun SHEN, Yifeng ZHENG, Baihua CHEN, Lu Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tensor decomposition. It requires an efficient distributed dynamic tensor decomposition without re-computing the whole tensor from scratch. In this paper, we propose DisMASTD, an efficient distributed multi-aspect streaming tensor decomposition. First, we prove the optimal tensor partitioning problem is NP-hard. Second, we present two heuristic tensor partitioning approaches to ensure the load balancing. Third, we develop a distributed multi-aspect streaming tensor decomposition computation method, which avoids repetitive computation and reduces network communication by maintaining and reusing the intermediate results. Last but not least, we perform extensive experiments with both real and synthetic datasets to demonstrate the efficiency and scalability of DisMASTD. 2021-04-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/6124 info:doi/10.1109/ICDE51399.2021.00098 https://ink.library.smu.edu.sg/context/sis_research/article/7127/viewcontent/DisMASTD_ICDE21_CR.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Data analysis tensors social computing Databases and Information Systems Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Data analysis tensors social computing Databases and Information Systems Numerical Analysis and Scientific Computing |
spellingShingle |
Data analysis tensors social computing Databases and Information Systems Numerical Analysis and Scientific Computing YANG, Keyu GAO, Yunjun SHEN, Yifeng ZHENG, Baihua CHEN, Lu DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition |
description |
Tensor decomposition is a fundamental multidimensional data analysis tool for many data-driven applications, such as social computing, computer vision, and bioinformatics, to name but a few. However, the rapidly increasing streaming data nowadays introduces new challenges to traditional static tensor decomposition. It requires an efficient distributed dynamic tensor decomposition without re-computing the whole tensor from scratch. In this paper, we propose DisMASTD, an efficient distributed multi-aspect streaming tensor decomposition. First, we prove the optimal tensor partitioning problem is NP-hard. Second, we present two heuristic tensor partitioning approaches to ensure the load balancing. Third, we develop a distributed multi-aspect streaming tensor decomposition computation method, which avoids repetitive computation and reduces network communication by maintaining and reusing the intermediate results. Last but not least, we perform extensive experiments with both real and synthetic datasets to demonstrate the efficiency and scalability of DisMASTD. |
format |
text |
author |
YANG, Keyu GAO, Yunjun SHEN, Yifeng ZHENG, Baihua CHEN, Lu |
author_facet |
YANG, Keyu GAO, Yunjun SHEN, Yifeng ZHENG, Baihua CHEN, Lu |
author_sort |
YANG, Keyu |
title |
DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition |
title_short |
DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition |
title_full |
DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition |
title_fullStr |
DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition |
title_full_unstemmed |
DisMASTD: An efficient distributed multi-aspect streaming tensor decomposition |
title_sort |
dismastd: an efficient distributed multi-aspect streaming tensor decomposition |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2021 |
url |
https://ink.library.smu.edu.sg/sis_research/6124 https://ink.library.smu.edu.sg/context/sis_research/article/7127/viewcontent/DisMASTD_ICDE21_CR.pdf |
_version_ |
1781794001008984064 |