CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH
The objective of natural language processing is to transform complex information into brief information, comprehensible and able to be used as an analysis reference for certain domain, i.e. health domain. Information that frequently utilized on health domain is an information about causality or caus...
Saved in:
Main Author: | |
---|---|
Format: | Dissertations |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/40514 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:40514 |
---|---|
spelling |
id-itb.:405142019-07-03T11:15:31ZCAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH Bagas Bhaskoro, Susetyo Indonesia Dissertations semantic template, single sentences, multiple sentences, medical element annotation, paragraph annotation, semantic annotation, causality. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/40514 The objective of natural language processing is to transform complex information into brief information, comprehensible and able to be used as an analysis reference for certain domain, i.e. health domain. Information that frequently utilized on health domain is an information about causality or cause-effect which is a knowledge that is built on particular facts that have <cause> meaning and is concluded as general facts that have <effect> meaning, and between those two meanings have a relationship that explains each other. If causal relationship is being formulated, it turns into function , where is <cause> meaning, and is <effect> meaning, so it will be . The issue is not all of <cause> and <effect> meanings are explained in simple sentence ot single sentences, but there is also possibility for <cause> and <effect> meanings to be explained in complex sentence or multiple sentences that the position itself is at the beginning of sentence or at the end of sentence. Furthermore, <cause> and <effect> meanings are not always be abstracted explicitly but also implicitly. Research's aim is to generate semantic template method, which is method to identify <cause> and <effect> meanings on simple sentence and complex sentence. Current developed of semantic template method is focusing on template using for identify causal relationship that has no dependency on certain topic, but variety of textual topic. Proposal contained in semantic template is considering <feature selection>, <word position>, <word meaning>, <frequency of word occurence>, <root dynamic> and use of <template rule>. This research using dataset from collection of causal relationship sentences found in some publications and online articles as many as 1.017 sentences for training and testing. Identification system performance testing is divided into two evaluation cateogries, extrinsic and intrinsic testing. Performance evaluation results of semantic template method on single sentences are 0,874; 0,747 and 0,803, meanwhile results on multiple sentences are 0,877; 0,815 and 0,845 for each recall, precision and f-measure, meanwhile error rate is 0.280. Evaluation result for information accuracy quality is 0.719, meanwhile for precision is 0.781 and for completeness is 0.953. Generated semantic template method that has been successfully implemented on health domain is public health surveillance system. During implementation, there are several proposals have been generated, i.e. natural language transformation into medical element annotation (LPpAJSFPePnGOD) to identify causal relationship pattern, paragraph annotation pattern to classify sentence position on medical article paragraph and build a semantic relationship pattern for semantic annotation. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The objective of natural language processing is to transform complex information into brief information, comprehensible and able to be used as an analysis reference for certain domain, i.e. health domain. Information that frequently utilized on health domain is an information about causality or cause-effect which is a knowledge that is built on particular facts that have <cause> meaning and is concluded as general facts that have <effect> meaning, and between those two meanings have a relationship that explains each other. If causal relationship is being formulated, it turns into function , where is <cause> meaning, and is <effect> meaning, so it will be . The issue is not all of <cause> and <effect> meanings are explained in simple sentence ot single sentences, but there is also possibility for <cause> and <effect> meanings to be explained in complex sentence or multiple sentences that the position itself is at the beginning of sentence or at the end of sentence. Furthermore, <cause> and <effect> meanings are not always be abstracted explicitly but also implicitly.
Research's aim is to generate semantic template method, which is method to identify <cause> and <effect> meanings on simple sentence and complex sentence. Current developed of semantic template method is focusing on template using for identify causal relationship that has no dependency on certain topic, but variety of textual topic. Proposal contained in semantic template is considering <feature selection>, <word position>, <word meaning>, <frequency of word occurence>, <root dynamic> and use of <template rule>. This research using dataset from collection of causal relationship sentences found in some publications and online articles as many as 1.017 sentences for training and testing. Identification system performance testing is divided into two evaluation cateogries, extrinsic and intrinsic testing. Performance evaluation results of semantic template method on single sentences are 0,874; 0,747 and 0,803, meanwhile results on multiple sentences are 0,877; 0,815 and 0,845 for each recall, precision and f-measure, meanwhile error rate is 0.280. Evaluation result for information accuracy quality is 0.719, meanwhile for precision is 0.781 and for completeness is 0.953.
Generated semantic template method that has been successfully implemented on health domain is public health surveillance system. During implementation, there are several proposals have been generated, i.e. natural language transformation into medical element
annotation (LPpAJSFPePnGOD) to identify causal relationship pattern, paragraph annotation pattern to classify sentence position on medical article paragraph and build a semantic relationship pattern for semantic annotation. |
format |
Dissertations |
author |
Bagas Bhaskoro, Susetyo |
spellingShingle |
Bagas Bhaskoro, Susetyo CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH |
author_facet |
Bagas Bhaskoro, Susetyo |
author_sort |
Bagas Bhaskoro, Susetyo |
title |
CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH |
title_short |
CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH |
title_full |
CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH |
title_fullStr |
CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH |
title_full_unstemmed |
CAUSAL RELATIONSHIP IDENTIFICATION SYSTEM ON INDONESIAN TEXT PROCESSING USING SEMANTIC TEMPLATE APPROACH |
title_sort |
causal relationship identification system on indonesian text processing using semantic template approach |
url |
https://digilib.itb.ac.id/gdl/view/40514 |
_version_ |
1821998116817076224 |