semantic relation classification, cnn, oversampling, binary relevance

Relation classification (RC) in the music domain is used to identify the relationship between two named entities (NE) automatically. The relations are used to update the music knowledge database which is then used in various applications such as related music information, music search engines, mu...

Full description

Saved in:
Bibliographic Details
Main Author: Insan Al-Amin, Muhammad
Format: Theses
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/40466
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:40466
spelling id-itb.:404662019-07-02T16:16:18Zsemantic relation classification, cnn, oversampling, binary relevance Insan Al-Amin, Muhammad Indonesia Theses RELATION CLASSIFICATION FOR INDONESIAN MUSIC NEWS SENTENCES INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/40466 Relation classification (RC) in the music domain is used to identify the relationship between two named entities (NE) automatically. The relations are used to update the music knowledge database which is then used in various applications such as related music information, music search engines, music recommendations, automatic generation of music news, and music encyclopedias. The purpose of this thesis is to produce RC models for Indonesian language music news. This thesis also produced an SRC Indonesian language corpus for the music domain. The hypothesis of this research is, making RC models using CNN will performs better than the baseline method, Support Vector Machine (SVM) and shallow learning when applied to Indonesian music news. A Convolutional Neural Network (CNN) architecture is used to train RC models using word vector representations as input. Oversampling methods such as SMOTE and word perturbation are used for handling unbalanced data. Multilabel problems are solved using the Binary Relevance method. This research only addresses relations between two named entities in a single sentence. Two forms of RC classification modeled in this research are single-label, and multilabel classification. NER modeling is not part of this research. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Relation classification (RC) in the music domain is used to identify the relationship between two named entities (NE) automatically. The relations are used to update the music knowledge database which is then used in various applications such as related music information, music search engines, music recommendations, automatic generation of music news, and music encyclopedias. The purpose of this thesis is to produce RC models for Indonesian language music news. This thesis also produced an SRC Indonesian language corpus for the music domain. The hypothesis of this research is, making RC models using CNN will performs better than the baseline method, Support Vector Machine (SVM) and shallow learning when applied to Indonesian music news. A Convolutional Neural Network (CNN) architecture is used to train RC models using word vector representations as input. Oversampling methods such as SMOTE and word perturbation are used for handling unbalanced data. Multilabel problems are solved using the Binary Relevance method. This research only addresses relations between two named entities in a single sentence. Two forms of RC classification modeled in this research are single-label, and multilabel classification. NER modeling is not part of this research.
format Theses
author Insan Al-Amin, Muhammad
spellingShingle Insan Al-Amin, Muhammad
semantic relation classification, cnn, oversampling, binary relevance
author_facet Insan Al-Amin, Muhammad
author_sort Insan Al-Amin, Muhammad
title semantic relation classification, cnn, oversampling, binary relevance
title_short semantic relation classification, cnn, oversampling, binary relevance
title_full semantic relation classification, cnn, oversampling, binary relevance
title_fullStr semantic relation classification, cnn, oversampling, binary relevance
title_full_unstemmed semantic relation classification, cnn, oversampling, binary relevance
title_sort semantic relation classification, cnn, oversampling, binary relevance
url https://digilib.itb.ac.id/gdl/view/40466
_version_ 1822925759313870848