Traffic-optimized data placement for social media

Social media users are generating data on an unprecedented scale. Distributed storage systems are often used to cope with explosive data growth. Data partitioning and replication are two interrelated data placement issues affecting the interserver traffic caused by user-initiated read and write oper...

Full description

Saved in:
Bibliographic Details
Main Authors: Tang, Jing, Tang, Xueyan, Yuan, Junsong
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2020
Subjects:
Online Access:https://hdl.handle.net/10356/140032
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-140032
record_format dspace
spelling sg-ntu-dr.10356-1400322020-05-26T05:15:14Z Traffic-optimized data placement for social media Tang, Jing Tang, Xueyan Yuan, Junsong School of Computer Science and Engineering School of Electrical and Electronic Engineering Interdisciplinary Graduate School (IGS) Engineering::Computer science and engineering Social Media Distributed Storage Social media users are generating data on an unprecedented scale. Distributed storage systems are often used to cope with explosive data growth. Data partitioning and replication are two interrelated data placement issues affecting the interserver traffic caused by user-initiated read and write operations in distributed storage systems. This paper investigates how to minimize the interserver traffic among a cluster of social media servers through joint data partitioning and replication optimization. We formally define the problem and study its hardness. We then propose a traffic-optimized partitioning and replication (TOPR) method to continuously adapt data placement according to various dynamics. Evaluations with real Twitter and LiveJournal social graphs show that TOPR not only reduces the interserver traffic significantly but also saves much storage cost of replication compared to state-of-the-art methods. We also benchmark TOPR against the offline optimum by a binary linear program. NRF (Natl Research Foundation, S’pore) MOE (Min. of Education, S’pore) 2020-05-26T05:15:14Z 2020-05-26T05:15:14Z 2017 Journal Article Tang, J., Tang, X., & Yuan, J. (2018). Traffic-optimized data placement for social media. IEEE Transactions on Multimedia, 20(4), 1008-1023. doi:10.1109/TMM.2017.2760627 1520-9210 https://hdl.handle.net/10356/140032 10.1109/TMM.2017.2760627 2-s2.0-85031773704 4 20 1008 1023 en IEEE Transactions on Multimedia © 2017 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Social Media
Distributed Storage
spellingShingle Engineering::Computer science and engineering
Social Media
Distributed Storage
Tang, Jing
Tang, Xueyan
Yuan, Junsong
Traffic-optimized data placement for social media
description Social media users are generating data on an unprecedented scale. Distributed storage systems are often used to cope with explosive data growth. Data partitioning and replication are two interrelated data placement issues affecting the interserver traffic caused by user-initiated read and write operations in distributed storage systems. This paper investigates how to minimize the interserver traffic among a cluster of social media servers through joint data partitioning and replication optimization. We formally define the problem and study its hardness. We then propose a traffic-optimized partitioning and replication (TOPR) method to continuously adapt data placement according to various dynamics. Evaluations with real Twitter and LiveJournal social graphs show that TOPR not only reduces the interserver traffic significantly but also saves much storage cost of replication compared to state-of-the-art methods. We also benchmark TOPR against the offline optimum by a binary linear program.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Tang, Jing
Tang, Xueyan
Yuan, Junsong
format Article
author Tang, Jing
Tang, Xueyan
Yuan, Junsong
author_sort Tang, Jing
title Traffic-optimized data placement for social media
title_short Traffic-optimized data placement for social media
title_full Traffic-optimized data placement for social media
title_fullStr Traffic-optimized data placement for social media
title_full_unstemmed Traffic-optimized data placement for social media
title_sort traffic-optimized data placement for social media
publishDate 2020
url https://hdl.handle.net/10356/140032
_version_ 1681059671621763072