Dynamic data replication for distributed cloud storage

Cloud storage services have become increasingly popular in recent years. This thesis investigates a cost optimization problem for data replication in distributed cloud storage. We consider two different settings: offine and online. In the offine setting where the complete information of data access...

Full description

Saved in:
Bibliographic Details
Main Author: Cai, Jianfei
Other Authors: Tang Xueyan
Format: Thesis-Master by Research
Language:English
Published: Nanyang Technological University 2021
Subjects:
Online Access:https://hdl.handle.net/10356/149825
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Cloud storage services have become increasingly popular in recent years. This thesis investigates a cost optimization problem for data replication in distributed cloud storage. We consider two different settings: offine and online. In the offine setting where the complete information of data access requests is available, we develop an O(max(m^2n, n^2)) optimal algorithm, where m and n are the numbers of storage sites and requests respectively. In the online setting where the requests arriving in the future are not known, we propose two distributed algorithms: a 3-competitive basic algorithm and a dynamic algorithm. Both algorithms are lightweight and easy to implement. Experiments using the pricing data of Google Cloud Storage show that our online algorithms can perform close to the optimal.