Text-attributed graph representation learning : Methods, applications, and challenges

Text documents are usually connected in a graph structure, resulting in an important class of data named text-attributed graph, e.g., paper citation graph and Web page hyperlink graph. On the one hand, Graph Neural Networks (GNNs) consider text in each document as general vertex attribute and do not...

Full description

Saved in:
Bibliographic Details
Main Authors: ZHANG, Ce, YANG, Menglin, YING, Rex, LAUW, Hady Wirawan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9841
https://ink.library.smu.edu.sg/context/sis_research/article/10841/viewcontent/webconf24tut.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10841
record_format dspace
spelling sg-smu-ink.sis_research-108412024-12-24T03:27:51Z Text-attributed graph representation learning : Methods, applications, and challenges ZHANG, Ce YANG, Menglin YING, Rex LAUW, Hady Wirawan Text documents are usually connected in a graph structure, resulting in an important class of data named text-attributed graph, e.g., paper citation graph and Web page hyperlink graph. On the one hand, Graph Neural Networks (GNNs) consider text in each document as general vertex attribute and do not specifically deal with text data. On the other hand, Pre-trained Language Models (PLMs) and Topic Models (TMs) learn effective document embeddings. However, most models focus on text content in each single document only, ignoring link adjacency across documents. The above two challenges motivate the development of text-attributed graph representation learning, combining GNNs with PLMs and TMs into a unified model and learning document embeddings preserving both modalities, which fulfill applications, e.g., text classification, citation recommendation, question answering, etc. In this lecture-style tutorial, we will provide a systematic review of text-attributed graph, including its formal definition, recent methods, diverse applications, and challenges. Specifically, i) we will formally define text-attributed graph and briefly review GNNs, PLMs, and TMs, which are the fundamentals of some existing methods. ii) We will then revisit the technical details of text-attributed graph models, which are generally split into two categories, PLMbased and TM-based. iii) Besides, we will show diverse applications built on text-attributed graph. iv) Finally, we will discuss some challenges of existing models and propose solutions for future research. 2024-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9841 info:doi/10.1145/3589335.3641255 https://ink.library.smu.edu.sg/context/sis_research/article/10841/viewcontent/webconf24tut.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Document representation learning Graph algorithms Neural networks Pre-trained Language Models Topic models Text mining Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Document representation learning
Graph algorithms
Neural networks
Pre-trained Language Models
Topic models
Text mining
Artificial Intelligence and Robotics
Numerical Analysis and Scientific Computing
spellingShingle Document representation learning
Graph algorithms
Neural networks
Pre-trained Language Models
Topic models
Text mining
Artificial Intelligence and Robotics
Numerical Analysis and Scientific Computing
ZHANG, Ce
YANG, Menglin
YING, Rex
LAUW, Hady Wirawan
Text-attributed graph representation learning : Methods, applications, and challenges
description Text documents are usually connected in a graph structure, resulting in an important class of data named text-attributed graph, e.g., paper citation graph and Web page hyperlink graph. On the one hand, Graph Neural Networks (GNNs) consider text in each document as general vertex attribute and do not specifically deal with text data. On the other hand, Pre-trained Language Models (PLMs) and Topic Models (TMs) learn effective document embeddings. However, most models focus on text content in each single document only, ignoring link adjacency across documents. The above two challenges motivate the development of text-attributed graph representation learning, combining GNNs with PLMs and TMs into a unified model and learning document embeddings preserving both modalities, which fulfill applications, e.g., text classification, citation recommendation, question answering, etc. In this lecture-style tutorial, we will provide a systematic review of text-attributed graph, including its formal definition, recent methods, diverse applications, and challenges. Specifically, i) we will formally define text-attributed graph and briefly review GNNs, PLMs, and TMs, which are the fundamentals of some existing methods. ii) We will then revisit the technical details of text-attributed graph models, which are generally split into two categories, PLMbased and TM-based. iii) Besides, we will show diverse applications built on text-attributed graph. iv) Finally, we will discuss some challenges of existing models and propose solutions for future research.
format text
author ZHANG, Ce
YANG, Menglin
YING, Rex
LAUW, Hady Wirawan
author_facet ZHANG, Ce
YANG, Menglin
YING, Rex
LAUW, Hady Wirawan
author_sort ZHANG, Ce
title Text-attributed graph representation learning : Methods, applications, and challenges
title_short Text-attributed graph representation learning : Methods, applications, and challenges
title_full Text-attributed graph representation learning : Methods, applications, and challenges
title_fullStr Text-attributed graph representation learning : Methods, applications, and challenges
title_full_unstemmed Text-attributed graph representation learning : Methods, applications, and challenges
title_sort text-attributed graph representation learning : methods, applications, and challenges
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9841
https://ink.library.smu.edu.sg/context/sis_research/article/10841/viewcontent/webconf24tut.pdf
_version_ 1820027796738539520