Traffic-optimized data placement for social media
Social media users are generating data on an unprecedented scale. Distributed storage systems are often used to cope with explosive data growth. Data partitioning and replication are two interrelated data placement issues affecting the interserver traffic caused by user-initiated read and write oper...
Saved in:
Main Authors: | , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/140032 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-140032 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1400322020-05-26T05:15:14Z Traffic-optimized data placement for social media Tang, Jing Tang, Xueyan Yuan, Junsong School of Computer Science and Engineering School of Electrical and Electronic Engineering Interdisciplinary Graduate School (IGS) Engineering::Computer science and engineering Social Media Distributed Storage Social media users are generating data on an unprecedented scale. Distributed storage systems are often used to cope with explosive data growth. Data partitioning and replication are two interrelated data placement issues affecting the interserver traffic caused by user-initiated read and write operations in distributed storage systems. This paper investigates how to minimize the interserver traffic among a cluster of social media servers through joint data partitioning and replication optimization. We formally define the problem and study its hardness. We then propose a traffic-optimized partitioning and replication (TOPR) method to continuously adapt data placement according to various dynamics. Evaluations with real Twitter and LiveJournal social graphs show that TOPR not only reduces the interserver traffic significantly but also saves much storage cost of replication compared to state-of-the-art methods. We also benchmark TOPR against the offline optimum by a binary linear program. NRF (Natl Research Foundation, S’pore) MOE (Min. of Education, S’pore) 2020-05-26T05:15:14Z 2020-05-26T05:15:14Z 2017 Journal Article Tang, J., Tang, X., & Yuan, J. (2018). Traffic-optimized data placement for social media. IEEE Transactions on Multimedia, 20(4), 1008-1023. doi:10.1109/TMM.2017.2760627 1520-9210 https://hdl.handle.net/10356/140032 10.1109/TMM.2017.2760627 2-s2.0-85031773704 4 20 1008 1023 en IEEE Transactions on Multimedia © 2017 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Social Media Distributed Storage |
spellingShingle |
Engineering::Computer science and engineering Social Media Distributed Storage Tang, Jing Tang, Xueyan Yuan, Junsong Traffic-optimized data placement for social media |
description |
Social media users are generating data on an unprecedented scale. Distributed storage systems are often used to cope with explosive data growth. Data partitioning and replication are two interrelated data placement issues affecting the interserver traffic caused by user-initiated read and write operations in distributed storage systems. This paper investigates how to minimize the interserver traffic among a cluster of social media servers through joint data partitioning and replication optimization. We formally define the problem and study its hardness. We then propose a traffic-optimized partitioning and replication (TOPR) method to continuously adapt data placement according to various dynamics. Evaluations with real Twitter and LiveJournal social graphs show that TOPR not only reduces the interserver traffic significantly but also saves much storage cost of replication compared to state-of-the-art methods. We also benchmark TOPR against the offline optimum by a binary linear program. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Tang, Jing Tang, Xueyan Yuan, Junsong |
format |
Article |
author |
Tang, Jing Tang, Xueyan Yuan, Junsong |
author_sort |
Tang, Jing |
title |
Traffic-optimized data placement for social media |
title_short |
Traffic-optimized data placement for social media |
title_full |
Traffic-optimized data placement for social media |
title_fullStr |
Traffic-optimized data placement for social media |
title_full_unstemmed |
Traffic-optimized data placement for social media |
title_sort |
traffic-optimized data placement for social media |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/140032 |
_version_ |
1681059671621763072 |