SEMANTIC ROLE LABELING FOR GENERATING TEMPLATE OF INDONESIAN NEWS SENTENCES
News templates have been widely used in automatic news generation. Indrayani and Khodra (2018) has generated Indonesian news automatically with manually defined templates. However, automated generation of Indonesian news sentences template had never been done before. Semantic Role Labeling (SRL)...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/40102 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
Summary: | News templates have been widely used in automatic news generation. Indrayani and Khodra
(2018) has generated Indonesian news automatically with manually defined templates. However,
automated generation of Indonesian news sentences template had never been done before.
Semantic Role Labeling (SRL) is a process of giving label to a word or phrase in a sentence
according to its semantic role based on the sentence’s predicate. In this final project, a system for
automatic template generation using SRL will be built.
A dataset for Indonesian semantic role labeling does not exist publicly, so collecting Indonesian
sentences and manually labeling them based on PropBank (Martha and Palmer, 2005) will be done.
The SRL model is built based on the research by He et al. (2017). This final project will use
BiLSTM with highway connection, dropout, recurrent dropout, and hard constraints to get the best
configuration. The result from SRL will be saved as a template and the difference between
automatically generated template and manually generated template will be analyzed.
The best configuration, 2 layer BiLSTM with dropout, recurrent dropout, and hard constraints,
managed to get F1 score of 0.92 for token level and 0.84 for sentence level. The automatically
generated templates also possess similar quality with manually generated templates. |
---|