On truth finding in multi-agent networks
In this dissertation, an event is a phenomenon of interest and our goal is to infer the states of events (e.g., articles, movies, and signals) through biased observations of multiple agents (e.g., sensors and individuals) in a network. The local bias of each agent is unknown and affects the agent...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Doctor of Philosophy |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/137376 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | In this dissertation, an event is a phenomenon of interest and our goal is to infer the states of events (e.g., articles, movies, and signals) through biased observations of multiple agents (e.g., sensors and individuals) in a network. The local bias of each agent is unknown and affects the agent's observation of the event true states. We call this "truth finding" and discuss the following three cases of the problem: each sequence of events is temporally correlated, agents are nodes of a social network, and the observation of each event is a graph signal.
In the case where each sequence of events is temporally correlated, we consider the application of blind calibration of a sensor (agent) network where the sensor gains, offsets and the ground truth values of signals (sequences of events) are estimated from noisy observations. This is in general a non-identifiable problem, unless restrictive assumptions on the signal subspace or sensor observations are imposed. We show that if each signal observed by the sensors follows a known dynamic model with additive noise, then the sensor gains and offsets are identifiable. We propose a dynamic Bayesian network model to infer the sensors' gains and offsets. Our model allows different sensor clusters to observe different unknown signals, without knowing the sensor clusters a priori. We develop an offline algorithm using block Gibbs sampling and a linearized forward filtering backward sampling method that estimates the sensor clusters, gains and offsets jointly. Furthermore, for practical implementation, we also propose an online inference algorithm based on particle filtering and local Markov chain Monte Carlo. Simulations using a synthetic dataset, and experiments on two real datasets suggest that our proposed methods perform better than several other blind calibration methods, including a sparse Bayesian learning approach, and methods that first cluster the sensor observations and then estimate the gains and offsets.
In the case where agents are nodes of a social network, we investigate the application of truth discovery based on opinions from multiple agents who may be unreliable or biased. We consider the case where agents' reliabilities or biases are correlated if they belong to the same community, which defines a group of agents with similar opinions regarding a particular event. We incorporate knowledge of the agents' social network in our truth discovery framework.
When the observation model (i.e., the relationship between agent reliabilities, event truths, and agent opinions) is known, we develop Laplace variational inference methods (VISIT) to estimate agents' reliabilities, communities, and the event states. We also develop a stochastic variational inference method to scale our model to large social networks. Simulations and experiments on real data suggest that when observations are sparse, our proposed methods perform better than several other inference methods, including majority voting, TruthFinder, AccuSim, the Confidence-Aware Truth Discovery method, the Bayesian Classifier Combination (BCC) method, and the Community BCC method.
When the observation model is unknown, we use an autoencoder to learn the observation model. A Bayesian network model is proposed to guide the learning of the autoencoder by modeling the dependence of agent reliabilities corresponding to different data samples. At the same time, it also models the social relationships between agents in the network. The proposed approach is unsupervised and is applicable when ground truth labels of events are unavailable. A variational inference method is used to jointly estimate the hidden variables in the Bayesian network and the parameters in the autoencoder. Simulations and experiments on real data suggest that the proposed method performs better than several other inference methods, including majority voting, the Bayesian Classifier Combination (BCC) method, the Community BCC method, and the VISIT method.
In the case where the observation of each event is a graph signal, we develop a graph convolution network (GCN) method. GCN has been extensively studied in the literature and one major direction is based on spectral graph theory and graph signal processing. In this thesis, we study the problem from a completely different perspective, by introducing parallel flow decomposition of graphs. The essential idea is to decompose a graph into families of non-intersecting one dimensional (1D) paths, after which, we apply a 1D CNN along each family of paths. Our method is tested on a news article classification dataset in which each article is a graph with keywords as nodes and similarities between keywords as edges. We demonstrate our method, which we call GFCN, achieves better performance than baseline methods. |
---|