Metaflow: a scalable metadata lookup service for distributed file systems in data centers

In large-scale distributed file systems, efficient metadata operations are critical since most file operations have to interact with metadata servers first. In existing distributed hash table (DHT) based metadata management systems, the lookup service could be a performance bottleneck due to its sig...

Full description

Saved in:
Bibliographic Details
Main Authors: SUN, Peng, WEN, Yonggang, TA, Nguyen Binh Duong, XIE, Haiyong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4767
https://ink.library.smu.edu.sg/context/sis_research/article/5770/viewcontent/1611.01594.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5770
record_format dspace
spelling sg-smu-ink.sis_research-57702020-01-16T10:26:45Z Metaflow: a scalable metadata lookup service for distributed file systems in data centers SUN, Peng WEN, Yonggang TA, Nguyen Binh Duong XIE, Haiyong In large-scale distributed file systems, efficient metadata operations are critical since most file operations have to interact with metadata servers first. In existing distributed hash table (DHT) based metadata management systems, the lookup service could be a performance bottleneck due to its significant CPU overhead. Our investigations showed that the lookup service could reduce system throughput by up to 70%, and increase system latency by a factor of up to 8 compared to ideal scenarios. In this paper, we present MetaFlow, a scalable metadata lookup service utilizing software-defined networking (SDN) techniques to distribute lookup workload over network components. MetaFlow tackles the lookup bottleneck problem by leveraging B-tree, which is constructed over the physical topology, to manage flow tables for SDN-enabled switches. Therefore, metadata requests can be forwarded to appropriate servers using only switches. Extensive performance evaluations in both simulations and testbed showed that MetaFlow increases system throughput by a factor of up to 3.2, and reduce system latency by a factor of up to 5 compared to DHT-based systems. We also deployed MetaFlow in a distributed file system, and demonstrated significant performance improvement. 2016-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4767 info:doi/10.1109/TBDATA.2016.2612241 https://ink.library.smu.edu.sg/context/sis_research/article/5770/viewcontent/1611.01594.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Metadata Management Software-Defined Networking B-tree Big Data Databases and Information Systems Data Storage Systems Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Metadata Management
Software-Defined Networking
B-tree
Big Data
Databases and Information Systems
Data Storage Systems
Software Engineering
spellingShingle Metadata Management
Software-Defined Networking
B-tree
Big Data
Databases and Information Systems
Data Storage Systems
Software Engineering
SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIE, Haiyong
Metaflow: a scalable metadata lookup service for distributed file systems in data centers
description In large-scale distributed file systems, efficient metadata operations are critical since most file operations have to interact with metadata servers first. In existing distributed hash table (DHT) based metadata management systems, the lookup service could be a performance bottleneck due to its significant CPU overhead. Our investigations showed that the lookup service could reduce system throughput by up to 70%, and increase system latency by a factor of up to 8 compared to ideal scenarios. In this paper, we present MetaFlow, a scalable metadata lookup service utilizing software-defined networking (SDN) techniques to distribute lookup workload over network components. MetaFlow tackles the lookup bottleneck problem by leveraging B-tree, which is constructed over the physical topology, to manage flow tables for SDN-enabled switches. Therefore, metadata requests can be forwarded to appropriate servers using only switches. Extensive performance evaluations in both simulations and testbed showed that MetaFlow increases system throughput by a factor of up to 3.2, and reduce system latency by a factor of up to 5 compared to DHT-based systems. We also deployed MetaFlow in a distributed file system, and demonstrated significant performance improvement.
format text
author SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIE, Haiyong
author_facet SUN, Peng
WEN, Yonggang
TA, Nguyen Binh Duong
XIE, Haiyong
author_sort SUN, Peng
title Metaflow: a scalable metadata lookup service for distributed file systems in data centers
title_short Metaflow: a scalable metadata lookup service for distributed file systems in data centers
title_full Metaflow: a scalable metadata lookup service for distributed file systems in data centers
title_fullStr Metaflow: a scalable metadata lookup service for distributed file systems in data centers
title_full_unstemmed Metaflow: a scalable metadata lookup service for distributed file systems in data centers
title_sort metaflow: a scalable metadata lookup service for distributed file systems in data centers
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/4767
https://ink.library.smu.edu.sg/context/sis_research/article/5770/viewcontent/1611.01594.pdf
_version_ 1770575025520246784