Vietnamese Semantic Role Labelling

In this paper, we study semantic role labelling (SRL), a subtask of semantic parsing of natural language sentences and its application for the Vietnamese language. We present our effort in building Vietnamese PropBank, the first Vietnamese SRL corpus and a software system for labelling semantic rol...

Full description

Saved in:
Bibliographic Details
Main Authors: Le, Hong Phuong, Pham, Thai Hoang, Pham, Xuan Khoai, Nguyen, Thi Minh Huyen, Nguyen, Thi Luong, Nguyen, Minh Hiep
Format: Article
Language:English
Published: H. : ĐHQGHN 2018
Subjects:
Online Access:http://repository.vnu.edu.vn/handle/VNU_123/62961
https://doi.org/10.25073/2588-1086/vnucsce.166
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Vietnam National University, Hanoi
Language: English
id oai:112.137.131.14:VNU_123-62961
record_format dspace
spelling oai:112.137.131.14:VNU_123-629612018-10-12T08:59:42Z Vietnamese Semantic Role Labelling Le, Hong Phuong Pham, Thai Hoang Pham, Xuan Khoai Nguyen, Thi Minh Huyen Nguyen, Thi Luong Nguyen, Minh Hiep Distributed word representation Integer linear programming Semantic role labelling Vietnamese Vietnamese PropBank In this paper, we study semantic role labelling (SRL), a subtask of semantic parsing of natural language sentences and its application for the Vietnamese language. We present our effort in building Vietnamese PropBank, the first Vietnamese SRL corpus and a software system for labelling semantic roles of Vietnamese texts. In particular, we present a novel constituent extraction algorithm in the argument candidate identification step which is more suitable and more accurate than the common node-mapping method. In the machine learning part, our system integrates distributed word features produced by two recent unsupervised learning models in two learned statistical classifiers and makes use of integer linear programming inference procedure to improve the accuracy. The system is evaluated in a series of experiments and achieves a good result, an F1 score of 74.77%. Our system, including corpus and software, is available as an open source project for free research and we believe that it is a good baseline for the development of future Vietnamese SRL systems. 2018-10-12T08:58:24Z 2018-10-12T08:58:24Z 2018 Article Le, H.P. et al. (2018). Vietnamese Semantic Role Labelling. VNU Journal of Science: Comp. Science & Com. Eng., 33(2), 39-58. 2588-1086 http://repository.vnu.edu.vn/handle/VNU_123/62961 https://doi.org/10.25073/2588-1086/vnucsce.166 en VNU Journal of Science: Comp. Science & Com. Eng.; application/pdf H. : ĐHQGHN
institution Vietnam National University, Hanoi
building VNU Library & Information Center
country Vietnam
collection VNU Digital Repository
language English
topic Distributed word representation
Integer linear programming
Semantic role labelling
Vietnamese
Vietnamese PropBank
spellingShingle Distributed word representation
Integer linear programming
Semantic role labelling
Vietnamese
Vietnamese PropBank
Le, Hong Phuong
Pham, Thai Hoang
Pham, Xuan Khoai
Nguyen, Thi Minh Huyen
Nguyen, Thi Luong
Nguyen, Minh Hiep
Vietnamese Semantic Role Labelling
description In this paper, we study semantic role labelling (SRL), a subtask of semantic parsing of natural language sentences and its application for the Vietnamese language. We present our effort in building Vietnamese PropBank, the first Vietnamese SRL corpus and a software system for labelling semantic roles of Vietnamese texts. In particular, we present a novel constituent extraction algorithm in the argument candidate identification step which is more suitable and more accurate than the common node-mapping method. In the machine learning part, our system integrates distributed word features produced by two recent unsupervised learning models in two learned statistical classifiers and makes use of integer linear programming inference procedure to improve the accuracy. The system is evaluated in a series of experiments and achieves a good result, an F1 score of 74.77%. Our system, including corpus and software, is available as an open source project for free research and we believe that it is a good baseline for the development of future Vietnamese SRL systems.
format Article
author Le, Hong Phuong
Pham, Thai Hoang
Pham, Xuan Khoai
Nguyen, Thi Minh Huyen
Nguyen, Thi Luong
Nguyen, Minh Hiep
author_facet Le, Hong Phuong
Pham, Thai Hoang
Pham, Xuan Khoai
Nguyen, Thi Minh Huyen
Nguyen, Thi Luong
Nguyen, Minh Hiep
author_sort Le, Hong Phuong
title Vietnamese Semantic Role Labelling
title_short Vietnamese Semantic Role Labelling
title_full Vietnamese Semantic Role Labelling
title_fullStr Vietnamese Semantic Role Labelling
title_full_unstemmed Vietnamese Semantic Role Labelling
title_sort vietnamese semantic role labelling
publisher H. : ĐHQGHN
publishDate 2018
url http://repository.vnu.edu.vn/handle/VNU_123/62961
https://doi.org/10.25073/2588-1086/vnucsce.166
_version_ 1680968006030589952