Sequential crowdsourcing and recommendation strategies

Crowdsourcing has emerged as an online, distributed problem-solving and production model where many participating workers are involved in performing tasks collaboratively in domains such as gathering language-related data, labeling the content of an image, or voting. In modern online platforms, incl...

Full description

Saved in:

Bibliographic Details
Main Author:	Kang, Qiyu
Other Authors:	Tay Wee Peng
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Electrical and electronic engineering Engineering::Computer science and engineering::Computer applications::Social and behavioral sciences
Online Access:	https://hdl.handle.net/10356/137199
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-137199
record_format	dspace
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Electrical and electronic engineering Engineering::Computer science and engineering::Computer applications::Social and behavioral sciences
spellingShingle	Engineering::Electrical and electronic engineering Engineering::Computer science and engineering::Computer applications::Social and behavioral sciences Kang, Qiyu Sequential crowdsourcing and recommendation strategies
description	Crowdsourcing has emerged as an online, distributed problem-solving and production model where many participating workers are involved in performing tasks collaboratively in domains such as gathering language-related data, labeling the content of an image, or voting. In modern online platforms, including crowdsourcing, recommendation systems play an essential role and have become indispensable. To provide better service in crowdsourcing and recommendation, in this thesis, we study sequential crowdsourcing and recommendation strategies where historical data will be explored when making decisions at the current time step. Firstly, we consider a multi-class labeling problem in crowdsourcing. As workers may be unreliable, we propose to perform sequential questioning in which the questions posed to the workers are designed based on previous questions and answers. We propose a partially-observable Markov decision process (POMDP) framework to determine the best questioning strategy, subject to the crowdsourcer's budget constraint. As this POMDP formulation is in general intractable, we develop a suboptimal approach based on a $q$-ary Ulam-R\'enyi game. We also propose a sampling heuristic, which can be used in tandem with standard POMDP solvers, using our Ulam-R\'enyi strategy. We demonstrate through simulations that our approaches outperform a non-sequential strategy based on error correction coding and which does not utilize workers' previous responses. Secondly, we consider a sequential task recommendation problem in a crowdsourcing platform where assigning tasks more likely to be accepted by a worker who is more likely to complete it reliably results in better performance for the crowdsourcer. Without prior information about a worker, his preferences and reliabilities need to be learned over time. We propose a multi-armed bandit (MAB) framework to sequentially learn a worker's preferences and reliabilities for different categories of tasks. However, unlike the classical MAB problem, the reward from the worker's completion of a task is unobservable. We therefore include the use of gold tasks (i.e., tasks whose solutions are known \emph{a priori} and which do not produce any rewards) in our task recommendation procedure. Our model could be viewed as a new variant of MAB, in which the random rewards can only be observed at those time steps where gold tasks are used, and the accuracy of estimating the expected reward of recommending a task to a worker depends on the number of gold tasks used. We show that the optimal regret is $O(\sqrt{n})$, where $n$ is the number of time steps. We develop three task recommendation strategies to determine the number of gold tasks for different task categories, and show that they are order optimal. Simulations verify the efficiency of our approaches. Finally, we propose a linear stochastic bandit formulation to model sequential non-discriminatory recommendations. Recommendation systems targeting at finding the best matching between items and users without considering ``fairness'' have been reported as generating discriminatory recommendations from users’ perspectives. In principle, to avoid discrimination, protected attributes such as race or gender should not be used in recommendation algorithms. However, directly discarding the protected attributes will potentially leave lingering discrimination in the large bias error of underfitted modelling without enough parameters. We propose to exclude the individual’s biases and maximize the (expected) cumulative reward like users’ clicks or ratings over a subspace of item attributes, based on the reward observed for the full space. We call this orthogonal projection in linear bandits. We propose sequential non-discriminatory recommendation strategies and prove that they achieve sublinear cumulative projection regret.
author2	Tay Wee Peng
author_facet	Tay Wee Peng Kang, Qiyu
format	Thesis-Doctor of Philosophy
author	Kang, Qiyu
author_sort	Kang, Qiyu
title	Sequential crowdsourcing and recommendation strategies
title_short	Sequential crowdsourcing and recommendation strategies
title_full	Sequential crowdsourcing and recommendation strategies
title_fullStr	Sequential crowdsourcing and recommendation strategies
title_full_unstemmed	Sequential crowdsourcing and recommendation strategies
title_sort	sequential crowdsourcing and recommendation strategies
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/137199
_version_	1772826569867264000
spelling	sg-ntu-dr.10356-1371992023-07-04T17:17:58Z Sequential crowdsourcing and recommendation strategies Kang, Qiyu Tay Wee Peng School of Electrical and Electronic Engineering wptay@ntu.edu.sg Engineering::Electrical and electronic engineering Engineering::Computer science and engineering::Computer applications::Social and behavioral sciences Crowdsourcing has emerged as an online, distributed problem-solving and production model where many participating workers are involved in performing tasks collaboratively in domains such as gathering language-related data, labeling the content of an image, or voting. In modern online platforms, including crowdsourcing, recommendation systems play an essential role and have become indispensable. To provide better service in crowdsourcing and recommendation, in this thesis, we study sequential crowdsourcing and recommendation strategies where historical data will be explored when making decisions at the current time step. Firstly, we consider a multi-class labeling problem in crowdsourcing. As workers may be unreliable, we propose to perform sequential questioning in which the questions posed to the workers are designed based on previous questions and answers. We propose a partially-observable Markov decision process (POMDP) framework to determine the best questioning strategy, subject to the crowdsourcer's budget constraint. As this POMDP formulation is in general intractable, we develop a suboptimal approach based on a $q$-ary Ulam-R\'enyi game. We also propose a sampling heuristic, which can be used in tandem with standard POMDP solvers, using our Ulam-R\'enyi strategy. We demonstrate through simulations that our approaches outperform a non-sequential strategy based on error correction coding and which does not utilize workers' previous responses. Secondly, we consider a sequential task recommendation problem in a crowdsourcing platform where assigning tasks more likely to be accepted by a worker who is more likely to complete it reliably results in better performance for the crowdsourcer. Without prior information about a worker, his preferences and reliabilities need to be learned over time. We propose a multi-armed bandit (MAB) framework to sequentially learn a worker's preferences and reliabilities for different categories of tasks. However, unlike the classical MAB problem, the reward from the worker's completion of a task is unobservable. We therefore include the use of gold tasks (i.e., tasks whose solutions are known \emph{a priori} and which do not produce any rewards) in our task recommendation procedure. Our model could be viewed as a new variant of MAB, in which the random rewards can only be observed at those time steps where gold tasks are used, and the accuracy of estimating the expected reward of recommending a task to a worker depends on the number of gold tasks used. We show that the optimal regret is $O(\sqrt{n})$, where $n$ is the number of time steps. We develop three task recommendation strategies to determine the number of gold tasks for different task categories, and show that they are order optimal. Simulations verify the efficiency of our approaches. Finally, we propose a linear stochastic bandit formulation to model sequential non-discriminatory recommendations. Recommendation systems targeting at finding the best matching between items and users without considering ``fairness'' have been reported as generating discriminatory recommendations from users’ perspectives. In principle, to avoid discrimination, protected attributes such as race or gender should not be used in recommendation algorithms. However, directly discarding the protected attributes will potentially leave lingering discrimination in the large bias error of underfitted modelling without enough parameters. We propose to exclude the individual’s biases and maximize the (expected) cumulative reward like users’ clicks or ratings over a subspace of item attributes, based on the reward observed for the full space. We call this orthogonal projection in linear bandits. We propose sequential non-discriminatory recommendation strategies and prove that they achieve sublinear cumulative projection regret. Doctor of Philosophy 2020-03-06T02:43:55Z 2020-03-06T02:43:55Z 2020 Thesis-Doctor of Philosophy Kang, Q. (2020). Sequential crowdsourcing and recommendation strategies. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/137199 10.32657/10356/137199 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University

Sequential crowdsourcing and recommendation strategies

Similar Items