Improving multimodal named entity recognition via entity span detection with unified multimodal transformer
In this paper, we study Multimodal Named Entity Recognition (MNER) for social media posts. Existing approaches for MNER mainly suffer from two drawbacks: (1) despite generating word-aware visual representations, their word representations are insensitive to the visual context; (2) most of them ignor...
Saved in:
Main Authors: | , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/5272 https://ink.library.smu.edu.sg/context/sis_research/article/6275/viewcontent/8._Improving_Multimodal_Named_Entity_Recognition_via_Entity_Span_Detection_with_United_Multimodal_Transformer__ACL2020_.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-6275 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-62752020-08-14T04:00:53Z Improving multimodal named entity recognition via entity span detection with unified multimodal transformer YU, Jianfei Jing JIANG, YANG, Li XIA, Rui In this paper, we study Multimodal Named Entity Recognition (MNER) for social media posts. Existing approaches for MNER mainly suffer from two drawbacks: (1) despite generating word-aware visual representations, their word representations are insensitive to the visual context; (2) most of them ignore the bias brought by the visual context. To tackle the first issue, we propose a multimodal interaction module to obtain both image-aware word representations and word-aware visual representations. To alleviate the visual bias, we further propose to leverage purely text-based entity span detection as an auxiliary module, and design a Unified Multimodal Transformer to guide the final predictions with the entity span predictions. Experiments show that our unified approach achieves the new state-of-the-art performance on two benchmark datasets. 2020-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5272 info:doi/10.18653/v1/2020.acl-main.306 https://ink.library.smu.edu.sg/context/sis_research/article/6275/viewcontent/8._Improving_Multimodal_Named_Entity_Recognition_via_Entity_Span_Detection_with_United_Multimodal_Transformer__ACL2020_.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Databases and Information Systems Theory and Algorithms |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Databases and Information Systems Theory and Algorithms |
spellingShingle |
Databases and Information Systems Theory and Algorithms YU, Jianfei Jing JIANG, YANG, Li XIA, Rui Improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
description |
In this paper, we study Multimodal Named Entity Recognition (MNER) for social media posts. Existing approaches for MNER mainly suffer from two drawbacks: (1) despite generating word-aware visual representations, their word representations are insensitive to the visual context; (2) most of them ignore the bias brought by the visual context. To tackle the first issue, we propose a multimodal interaction module to obtain both image-aware word representations and word-aware visual representations. To alleviate the visual bias, we further propose to leverage purely text-based entity span detection as an auxiliary module, and design a Unified Multimodal Transformer to guide the final predictions with the entity span predictions. Experiments show that our unified approach achieves the new state-of-the-art performance on two benchmark datasets. |
format |
text |
author |
YU, Jianfei Jing JIANG, YANG, Li XIA, Rui |
author_facet |
YU, Jianfei Jing JIANG, YANG, Li XIA, Rui |
author_sort |
YU, Jianfei |
title |
Improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
title_short |
Improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
title_full |
Improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
title_fullStr |
Improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
title_full_unstemmed |
Improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
title_sort |
improving multimodal named entity recognition via entity span detection with unified multimodal transformer |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/5272 https://ink.library.smu.edu.sg/context/sis_research/article/6275/viewcontent/8._Improving_Multimodal_Named_Entity_Recognition_via_Entity_Span_Detection_with_United_Multimodal_Transformer__ACL2020_.pdf |
_version_ |
1770575366441664512 |