Improving multimodal named entity recognition via entity span detection with unified multimodal transformer

In this paper, we study Multimodal Named Entity Recognition (MNER) for social media posts. Existing approaches for MNER mainly suffer from two drawbacks: (1) despite generating word-aware visual representations, their word representations are insensitive to the visual context; (2) most of them ignor...

Full description

Saved in:
Bibliographic Details
Main Authors: YU, Jianfei, Jing JIANG, YANG, Li, XIA, Rui
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5272
https://ink.library.smu.edu.sg/context/sis_research/article/6275/viewcontent/8._Improving_Multimodal_Named_Entity_Recognition_via_Entity_Span_Detection_with_United_Multimodal_Transformer__ACL2020_.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6275
record_format dspace
spelling sg-smu-ink.sis_research-62752020-08-14T04:00:53Z Improving multimodal named entity recognition via entity span detection with unified multimodal transformer YU, Jianfei Jing JIANG, YANG, Li XIA, Rui In this paper, we study Multimodal Named Entity Recognition (MNER) for social media posts. Existing approaches for MNER mainly suffer from two drawbacks: (1) despite generating word-aware visual representations, their word representations are insensitive to the visual context; (2) most of them ignore the bias brought by the visual context. To tackle the first issue, we propose a multimodal interaction module to obtain both image-aware word representations and word-aware visual representations. To alleviate the visual bias, we further propose to leverage purely text-based entity span detection as an auxiliary module, and design a Unified Multimodal Transformer to guide the final predictions with the entity span predictions. Experiments show that our unified approach achieves the new state-of-the-art performance on two benchmark datasets. 2020-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5272 info:doi/10.18653/v1/2020.acl-main.306 https://ink.library.smu.edu.sg/context/sis_research/article/6275/viewcontent/8._Improving_Multimodal_Named_Entity_Recognition_via_Entity_Span_Detection_with_United_Multimodal_Transformer__ACL2020_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Databases and Information Systems
Theory and Algorithms
spellingShingle Databases and Information Systems
Theory and Algorithms
YU, Jianfei
Jing JIANG,
YANG, Li
XIA, Rui
Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
description In this paper, we study Multimodal Named Entity Recognition (MNER) for social media posts. Existing approaches for MNER mainly suffer from two drawbacks: (1) despite generating word-aware visual representations, their word representations are insensitive to the visual context; (2) most of them ignore the bias brought by the visual context. To tackle the first issue, we propose a multimodal interaction module to obtain both image-aware word representations and word-aware visual representations. To alleviate the visual bias, we further propose to leverage purely text-based entity span detection as an auxiliary module, and design a Unified Multimodal Transformer to guide the final predictions with the entity span predictions. Experiments show that our unified approach achieves the new state-of-the-art performance on two benchmark datasets.
format text
author YU, Jianfei
Jing JIANG,
YANG, Li
XIA, Rui
author_facet YU, Jianfei
Jing JIANG,
YANG, Li
XIA, Rui
author_sort YU, Jianfei
title Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
title_short Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
title_full Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
title_fullStr Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
title_full_unstemmed Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
title_sort improving multimodal named entity recognition via entity span detection with unified multimodal transformer
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/5272
https://ink.library.smu.edu.sg/context/sis_research/article/6275/viewcontent/8._Improving_Multimodal_Named_Entity_Recognition_via_Entity_Span_Detection_with_United_Multimodal_Transformer__ACL2020_.pdf
_version_ 1770575366441664512