Text-attributed graph representation learning : Methods, applications, and challenges
Text documents are usually connected in a graph structure, resulting in an important class of data named text-attributed graph, e.g., paper citation graph and Web page hyperlink graph. On the one hand, Graph Neural Networks (GNNs) consider text in each document as general vertex attribute and do not...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9841 https://ink.library.smu.edu.sg/context/sis_research/article/10841/viewcontent/webconf24tut.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10841 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-108412024-12-24T03:27:51Z Text-attributed graph representation learning : Methods, applications, and challenges ZHANG, Ce YANG, Menglin YING, Rex LAUW, Hady Wirawan Text documents are usually connected in a graph structure, resulting in an important class of data named text-attributed graph, e.g., paper citation graph and Web page hyperlink graph. On the one hand, Graph Neural Networks (GNNs) consider text in each document as general vertex attribute and do not specifically deal with text data. On the other hand, Pre-trained Language Models (PLMs) and Topic Models (TMs) learn effective document embeddings. However, most models focus on text content in each single document only, ignoring link adjacency across documents. The above two challenges motivate the development of text-attributed graph representation learning, combining GNNs with PLMs and TMs into a unified model and learning document embeddings preserving both modalities, which fulfill applications, e.g., text classification, citation recommendation, question answering, etc. In this lecture-style tutorial, we will provide a systematic review of text-attributed graph, including its formal definition, recent methods, diverse applications, and challenges. Specifically, i) we will formally define text-attributed graph and briefly review GNNs, PLMs, and TMs, which are the fundamentals of some existing methods. ii) We will then revisit the technical details of text-attributed graph models, which are generally split into two categories, PLMbased and TM-based. iii) Besides, we will show diverse applications built on text-attributed graph. iv) Finally, we will discuss some challenges of existing models and propose solutions for future research. 2024-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9841 info:doi/10.1145/3589335.3641255 https://ink.library.smu.edu.sg/context/sis_research/article/10841/viewcontent/webconf24tut.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Document representation learning Graph algorithms Neural networks Pre-trained Language Models Topic models Text mining Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Document representation learning Graph algorithms Neural networks Pre-trained Language Models Topic models Text mining Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing |
spellingShingle |
Document representation learning Graph algorithms Neural networks Pre-trained Language Models Topic models Text mining Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing ZHANG, Ce YANG, Menglin YING, Rex LAUW, Hady Wirawan Text-attributed graph representation learning : Methods, applications, and challenges |
description |
Text documents are usually connected in a graph structure, resulting in an important class of data named text-attributed graph, e.g., paper citation graph and Web page hyperlink graph. On the one hand, Graph Neural Networks (GNNs) consider text in each document as general vertex attribute and do not specifically deal with text data. On the other hand, Pre-trained Language Models (PLMs) and Topic Models (TMs) learn effective document embeddings. However, most models focus on text content in each single document only, ignoring link adjacency across documents. The above two challenges motivate the development of text-attributed graph representation learning, combining GNNs with PLMs and TMs into a unified model and learning document embeddings preserving both modalities, which fulfill applications, e.g., text classification, citation recommendation, question answering, etc. In this lecture-style tutorial, we will provide a systematic review of text-attributed graph, including its formal definition, recent methods, diverse applications, and challenges. Specifically, i) we will formally define text-attributed graph and briefly review GNNs, PLMs, and TMs, which are the fundamentals of some existing methods. ii) We will then revisit the technical details of text-attributed graph models, which are generally split into two categories, PLMbased and TM-based. iii) Besides, we will show diverse applications built on text-attributed graph. iv) Finally, we will discuss some challenges of existing models and propose solutions for future research. |
format |
text |
author |
ZHANG, Ce YANG, Menglin YING, Rex LAUW, Hady Wirawan |
author_facet |
ZHANG, Ce YANG, Menglin YING, Rex LAUW, Hady Wirawan |
author_sort |
ZHANG, Ce |
title |
Text-attributed graph representation learning : Methods, applications, and challenges |
title_short |
Text-attributed graph representation learning : Methods, applications, and challenges |
title_full |
Text-attributed graph representation learning : Methods, applications, and challenges |
title_fullStr |
Text-attributed graph representation learning : Methods, applications, and challenges |
title_full_unstemmed |
Text-attributed graph representation learning : Methods, applications, and challenges |
title_sort |
text-attributed graph representation learning : methods, applications, and challenges |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/9841 https://ink.library.smu.edu.sg/context/sis_research/article/10841/viewcontent/webconf24tut.pdf |
_version_ |
1820027796738539520 |