GPU-based commonsense reasoning for real-time query answering and multimodal analysis

A commonsense knowledge base is a set of facts containing the information possessed by an ordinary person. A commonsense knowledge base is also called a fundamental ontology, as it consists of very general concepts across all domains. In order to represent such a database in practice, different appr...

Full description

Saved in:
Bibliographic Details
Main Author: Tran, Ha Nguyen
Other Authors: Erik Cambria
Format: Theses and Dissertations
Language:English
Published: 2017
Subjects:
Online Access:http://hdl.handle.net/10356/72092
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-72092
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Hardware::Performance and reliability
DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications
spellingShingle DRNTU::Engineering::Computer science and engineering::Hardware::Performance and reliability
DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications
Tran, Ha Nguyen
GPU-based commonsense reasoning for real-time query answering and multimodal analysis
description A commonsense knowledge base is a set of facts containing the information possessed by an ordinary person. A commonsense knowledge base is also called a fundamental ontology, as it consists of very general concepts across all domains. In order to represent such a database in practice, different approaches have been proposed in recent years. Most of them fall into either graph-based or rule-based knowledge representations. Reasoning and querying information on such kind of representations present two major implementation issues: performance and scalability, due to the fact that many new concepts (mined from the Web or learned through crowd-sourcing) are continuously integrated into the knowledge base. Some distributed computing based methods have recently been introduced to deal with those very large networks by utilizing parallelism, yet there remains the open problem of high communication costs between the participating machines. In recent years, Graphics Processing Units (GPUs) have become popular computing devices owing to their massive parallel execution power. A typical GPU device consists of hundreds of cores running simultaneously. Modern General Purpose GPUs have been successfully adopted to accelerate heavy workload tasks such as relational database joining operations, fundamental large-scale graph algorithms, and big data analytics. Encouraged by those promising results, the dissertation investigates whether and how GPUs can be leveraged to accelerate the performance of commonsense reasoning and query answering systems on large-scale networks. Firstly, to address the problem of reasoning and querying on large-scale graph-based commonsense knowledge bases, the thesis presents a GPU-friendly method, called GpSense, to solve the subgraph matching problem which is the core function of commonsense reasoning and query answering systems. Our approach is based on a novel filtering-and-joining strategy which is suitable to be implemented on massively parallel architectures. In order to optimize the performance in depth, we utilize a series of optimization techniques which contribute towards increasing GPU occupancy, reducing workload imbalances and in particular speeding up subgraph matching on commonsense graphs. To address the issue of large graphs which cannot fit into the GPU memory, we propose a multiple-level graph compression technique to reduce graph sizes while preserving all subgraph matching results. The graph compression method converts the data graph to a weighted graph which is small enough to be maintained in GPU memory. To highlight the efficiency of our solution, we perform an extensive evaluation of GpSense against state-of-the-art subgraph matching algorithms. Extensive experimental evaluations on both real and synthetic data show that our implementation scales in a linear way and outperforms current optimized CPU-based competitors. Secondly, in order to reason and retrieve information on rule-based knowledge bases, the thesis introduces gSparql, a fast and scalable inference and querying method on mass-storage RDF data with rule-based entailment regimes. Our approach accepts different rulesets and executes the reasoning process at query time when the inferred triples are determined by the set of triple patterns defined in the query. To answer SPARQL queries in parallel, we first present a query rewriting algorithm to extend the queries and also eliminate redundant triple patterns based on the rulesets. Then, we convert the execution plan into a series of primitives such as sort, merge, prefix scan, and compaction which can be efficiently done on GPU devices. To overcome the problem of triple duplication, we utilize a combination of Bloom Filter and sort-merge algorithms on the GPU. Experiment results on the LUBM dataset show that our solution outperforms the state-of-the-art Jena method on the large datasets. Finally, we utilize commonsense knowledge bases to address the problem of real-time multimodal analysis. In particular, we focus on the problem of multimodal sentiment analysis, which consists in the simultaneous analysis of different modalities, e.g., speech and video, for emotion and polarity detection. Our approach takes advantage of the massively parallel processing power of modern GPUs to enhance the performance of feature extraction from different modalities. In addition, in order to extract important textual features from multimodal sources, we generate domain-specific graphs based on commonsense knowledge and apply GPU-based graph traversal for fast feature detection. Then, powerful ELM classifiers are applied to build the sentiment analysis model based on the extracted features. We conduct our experiments on the YouTube dataset and achieve an accuracy of 78% which outperforms all previous systems. In term of processing speed, our method shows improvements of several orders of magnitude for feature extraction compared to CPU-based counterparts.
author2 Erik Cambria
author_facet Erik Cambria
Tran, Ha Nguyen
format Theses and Dissertations
author Tran, Ha Nguyen
author_sort Tran, Ha Nguyen
title GPU-based commonsense reasoning for real-time query answering and multimodal analysis
title_short GPU-based commonsense reasoning for real-time query answering and multimodal analysis
title_full GPU-based commonsense reasoning for real-time query answering and multimodal analysis
title_fullStr GPU-based commonsense reasoning for real-time query answering and multimodal analysis
title_full_unstemmed GPU-based commonsense reasoning for real-time query answering and multimodal analysis
title_sort gpu-based commonsense reasoning for real-time query answering and multimodal analysis
publishDate 2017
url http://hdl.handle.net/10356/72092
_version_ 1759858291460014080
spelling sg-ntu-dr.10356-720922023-03-04T00:51:46Z GPU-based commonsense reasoning for real-time query answering and multimodal analysis Tran, Ha Nguyen Erik Cambria School of Computer Science and Engineering DRNTU::Engineering::Computer science and engineering::Hardware::Performance and reliability DRNTU::Engineering::Computer science and engineering::Information systems::Information systems applications A commonsense knowledge base is a set of facts containing the information possessed by an ordinary person. A commonsense knowledge base is also called a fundamental ontology, as it consists of very general concepts across all domains. In order to represent such a database in practice, different approaches have been proposed in recent years. Most of them fall into either graph-based or rule-based knowledge representations. Reasoning and querying information on such kind of representations present two major implementation issues: performance and scalability, due to the fact that many new concepts (mined from the Web or learned through crowd-sourcing) are continuously integrated into the knowledge base. Some distributed computing based methods have recently been introduced to deal with those very large networks by utilizing parallelism, yet there remains the open problem of high communication costs between the participating machines. In recent years, Graphics Processing Units (GPUs) have become popular computing devices owing to their massive parallel execution power. A typical GPU device consists of hundreds of cores running simultaneously. Modern General Purpose GPUs have been successfully adopted to accelerate heavy workload tasks such as relational database joining operations, fundamental large-scale graph algorithms, and big data analytics. Encouraged by those promising results, the dissertation investigates whether and how GPUs can be leveraged to accelerate the performance of commonsense reasoning and query answering systems on large-scale networks. Firstly, to address the problem of reasoning and querying on large-scale graph-based commonsense knowledge bases, the thesis presents a GPU-friendly method, called GpSense, to solve the subgraph matching problem which is the core function of commonsense reasoning and query answering systems. Our approach is based on a novel filtering-and-joining strategy which is suitable to be implemented on massively parallel architectures. In order to optimize the performance in depth, we utilize a series of optimization techniques which contribute towards increasing GPU occupancy, reducing workload imbalances and in particular speeding up subgraph matching on commonsense graphs. To address the issue of large graphs which cannot fit into the GPU memory, we propose a multiple-level graph compression technique to reduce graph sizes while preserving all subgraph matching results. The graph compression method converts the data graph to a weighted graph which is small enough to be maintained in GPU memory. To highlight the efficiency of our solution, we perform an extensive evaluation of GpSense against state-of-the-art subgraph matching algorithms. Extensive experimental evaluations on both real and synthetic data show that our implementation scales in a linear way and outperforms current optimized CPU-based competitors. Secondly, in order to reason and retrieve information on rule-based knowledge bases, the thesis introduces gSparql, a fast and scalable inference and querying method on mass-storage RDF data with rule-based entailment regimes. Our approach accepts different rulesets and executes the reasoning process at query time when the inferred triples are determined by the set of triple patterns defined in the query. To answer SPARQL queries in parallel, we first present a query rewriting algorithm to extend the queries and also eliminate redundant triple patterns based on the rulesets. Then, we convert the execution plan into a series of primitives such as sort, merge, prefix scan, and compaction which can be efficiently done on GPU devices. To overcome the problem of triple duplication, we utilize a combination of Bloom Filter and sort-merge algorithms on the GPU. Experiment results on the LUBM dataset show that our solution outperforms the state-of-the-art Jena method on the large datasets. Finally, we utilize commonsense knowledge bases to address the problem of real-time multimodal analysis. In particular, we focus on the problem of multimodal sentiment analysis, which consists in the simultaneous analysis of different modalities, e.g., speech and video, for emotion and polarity detection. Our approach takes advantage of the massively parallel processing power of modern GPUs to enhance the performance of feature extraction from different modalities. In addition, in order to extract important textual features from multimodal sources, we generate domain-specific graphs based on commonsense knowledge and apply GPU-based graph traversal for fast feature detection. Then, powerful ELM classifiers are applied to build the sentiment analysis model based on the extracted features. We conduct our experiments on the YouTube dataset and achieve an accuracy of 78% which outperforms all previous systems. In term of processing speed, our method shows improvements of several orders of magnitude for feature extraction compared to CPU-based counterparts. Doctor of Philosophy (SCE) 2017-05-25T03:50:19Z 2017-05-25T03:50:19Z 2017 Thesis Tran, H. N. (2017). GPU-based commonsense reasoning for real-time query answering and multimodal analysis. Doctoral thesis, Nanyang Technological University, Singapore. http://hdl.handle.net/10356/72092 10.32657/10356/72092 en 135 p. application/pdf