Communication-efficient Classification in P2P Networks

Distributed classification aims to learn with accuracy comparable to that of centralized approaches but at far lesser communication and computation costs. By nature, P2P networks provide an excellent environment for performing a distributed classification task due to the high availability of shared...

Full description

Saved in:
Bibliographic Details
Main Authors: ANG, Hock Hee, Gopalkrishnan, Vivekanand, NG, Wee Keong, HOI, Steven C. H.
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2009
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2374
https://ink.library.smu.edu.sg/context/sis_research/article/3374/viewcontent/chp_3A10.1007_2F978_3_642_04180_8_23.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3374
record_format dspace
spelling sg-smu-ink.sis_research-33742016-01-13T13:07:35Z Communication-efficient Classification in P2P Networks ANG, Hock Hee Gopalkrishnan, Vivekanand NG, Wee Keong HOI, Steven C. H. Distributed classification aims to learn with accuracy comparable to that of centralized approaches but at far lesser communication and computation costs. By nature, P2P networks provide an excellent environment for performing a distributed classification task due to the high availability of shared resources, such as bandwidth, storage space, and rich computational power. However, learning in P2P networks is faced with many challenging issues; viz., scalability, peer dynamism, asynchronism and fault-tolerance. In this paper, we address these challenges by presenting CEMPaR—a communication-efficient framework based on cascading SVMs that exploits the characteristics of DHT-based lookup protocols. CEMPaR is designed to be robust to parameters such as the number of peers in the network, imbalanced data sizes and class distribution while incurring extremely low communication cost yet maintaining accuracy comparable to the best-in-the-class approaches. Feasibility and effectiveness of our approach are demonstrated with extensive experimental studies on real and synthetic datasets. 2009-09-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2374 info:doi/10.1007/978-3-642-04180-8_23 https://ink.library.smu.edu.sg/context/sis_research/article/3374/viewcontent/chp_3A10.1007_2F978_3_642_04180_8_23.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences Databases and Information Systems OS and Networks
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computer Sciences
Databases and Information Systems
OS and Networks
spellingShingle Computer Sciences
Databases and Information Systems
OS and Networks
ANG, Hock Hee
Gopalkrishnan, Vivekanand
NG, Wee Keong
HOI, Steven C. H.
Communication-efficient Classification in P2P Networks
description Distributed classification aims to learn with accuracy comparable to that of centralized approaches but at far lesser communication and computation costs. By nature, P2P networks provide an excellent environment for performing a distributed classification task due to the high availability of shared resources, such as bandwidth, storage space, and rich computational power. However, learning in P2P networks is faced with many challenging issues; viz., scalability, peer dynamism, asynchronism and fault-tolerance. In this paper, we address these challenges by presenting CEMPaR—a communication-efficient framework based on cascading SVMs that exploits the characteristics of DHT-based lookup protocols. CEMPaR is designed to be robust to parameters such as the number of peers in the network, imbalanced data sizes and class distribution while incurring extremely low communication cost yet maintaining accuracy comparable to the best-in-the-class approaches. Feasibility and effectiveness of our approach are demonstrated with extensive experimental studies on real and synthetic datasets.
format text
author ANG, Hock Hee
Gopalkrishnan, Vivekanand
NG, Wee Keong
HOI, Steven C. H.
author_facet ANG, Hock Hee
Gopalkrishnan, Vivekanand
NG, Wee Keong
HOI, Steven C. H.
author_sort ANG, Hock Hee
title Communication-efficient Classification in P2P Networks
title_short Communication-efficient Classification in P2P Networks
title_full Communication-efficient Classification in P2P Networks
title_fullStr Communication-efficient Classification in P2P Networks
title_full_unstemmed Communication-efficient Classification in P2P Networks
title_sort communication-efficient classification in p2p networks
publisher Institutional Knowledge at Singapore Management University
publishDate 2009
url https://ink.library.smu.edu.sg/sis_research/2374
https://ink.library.smu.edu.sg/context/sis_research/article/3374/viewcontent/chp_3A10.1007_2F978_3_642_04180_8_23.pdf
_version_ 1770572128263864320