SLADE: A smart large-scale task decomposer in crowdsourcing

Crowdsourcing has been shown to be effective in a wide range of applications, and is seeing increasing use. A large-scale crowdsourcing task often consists of thousands or millions of atomic tasks, each of which is usually a simple task such as binary choice or simple voting. To distribute a large-s...

Full description

Saved in:
Bibliographic Details
Main Authors: TONG, Yongxin, CHEN, Lei, ZHOU, Zimu, JAGADISH, H. V., SHOU, Lidan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2018
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/4534
https://ink.library.smu.edu.sg/context/sis_research/article/5537/viewcontent/tkde18_tong.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-5537
record_format dspace
spelling sg-smu-ink.sis_research-55372019-12-26T09:13:13Z SLADE: A smart large-scale task decomposer in crowdsourcing TONG, Yongxin CHEN, Lei ZHOU, Zimu JAGADISH, H. V. SHOU, Lidan Crowdsourcing has been shown to be effective in a wide range of applications, and is seeing increasing use. A large-scale crowdsourcing task often consists of thousands or millions of atomic tasks, each of which is usually a simple task such as binary choice or simple voting. To distribute a large-scale crowdsourcing task to limited crowd workers, a common practice is to pack a set of atomic tasks into a task bin and send to a crowd worker in a batch. It is challenging to decompose a large-scale crowdsourcing task and execute batches of atomic tasks, which ensures reliable answers at a minimal total cost. Large batches lead to unreliable answers of atomic tasks, while small batches incur unnecessary cost. In this paper, we investigate a general crowdsourcing task decomposition problem, called the Smart Large-scAle task DEcomposer (SLADE) problem, which aims to decompose a large-scale crowdsourcing task to achieve the desired reliability at a minimal cost. We prove the NP-hardness of the SLADE problem and propose solutions in both homogeneous and heterogeneous scenarios. For the homogeneous SLADE problem, where all the atomic tasks share the same reliability requirement, we propose a greedy heuristic algorithm and an efficient and effective approximation framework using an optimal priority queue (OPQ) structure with provable approximation ratio. For the heterogeneous SLADE problem, where the atomic tasks can have different reliability requirements, we extend the OPQ-based framework leveraging a partition strategy, and also prove its approximation guarantee. Finally, we verify the effectiveness and efficiency of the proposed solutions through extensive experiments on representative crowdsourcing platforms. 2018-01-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/4534 info:doi/10.1109/TKDE.2018.2797962 https://ink.library.smu.edu.sg/context/sis_research/article/5537/viewcontent/tkde18_tong.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Crowdsourcing Task Decomposition Programming Languages and Compilers Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Crowdsourcing
Task Decomposition
Programming Languages and Compilers
Software Engineering
spellingShingle Crowdsourcing
Task Decomposition
Programming Languages and Compilers
Software Engineering
TONG, Yongxin
CHEN, Lei
ZHOU, Zimu
JAGADISH, H. V.
SHOU, Lidan
SLADE: A smart large-scale task decomposer in crowdsourcing
description Crowdsourcing has been shown to be effective in a wide range of applications, and is seeing increasing use. A large-scale crowdsourcing task often consists of thousands or millions of atomic tasks, each of which is usually a simple task such as binary choice or simple voting. To distribute a large-scale crowdsourcing task to limited crowd workers, a common practice is to pack a set of atomic tasks into a task bin and send to a crowd worker in a batch. It is challenging to decompose a large-scale crowdsourcing task and execute batches of atomic tasks, which ensures reliable answers at a minimal total cost. Large batches lead to unreliable answers of atomic tasks, while small batches incur unnecessary cost. In this paper, we investigate a general crowdsourcing task decomposition problem, called the Smart Large-scAle task DEcomposer (SLADE) problem, which aims to decompose a large-scale crowdsourcing task to achieve the desired reliability at a minimal cost. We prove the NP-hardness of the SLADE problem and propose solutions in both homogeneous and heterogeneous scenarios. For the homogeneous SLADE problem, where all the atomic tasks share the same reliability requirement, we propose a greedy heuristic algorithm and an efficient and effective approximation framework using an optimal priority queue (OPQ) structure with provable approximation ratio. For the heterogeneous SLADE problem, where the atomic tasks can have different reliability requirements, we extend the OPQ-based framework leveraging a partition strategy, and also prove its approximation guarantee. Finally, we verify the effectiveness and efficiency of the proposed solutions through extensive experiments on representative crowdsourcing platforms.
format text
author TONG, Yongxin
CHEN, Lei
ZHOU, Zimu
JAGADISH, H. V.
SHOU, Lidan
author_facet TONG, Yongxin
CHEN, Lei
ZHOU, Zimu
JAGADISH, H. V.
SHOU, Lidan
author_sort TONG, Yongxin
title SLADE: A smart large-scale task decomposer in crowdsourcing
title_short SLADE: A smart large-scale task decomposer in crowdsourcing
title_full SLADE: A smart large-scale task decomposer in crowdsourcing
title_fullStr SLADE: A smart large-scale task decomposer in crowdsourcing
title_full_unstemmed SLADE: A smart large-scale task decomposer in crowdsourcing
title_sort slade: a smart large-scale task decomposer in crowdsourcing
publisher Institutional Knowledge at Singapore Management University
publishDate 2018
url https://ink.library.smu.edu.sg/sis_research/4534
https://ink.library.smu.edu.sg/context/sis_research/article/5537/viewcontent/tkde18_tong.pdf
_version_ 1770574905946931200