Feature selection for location metonymy using augmented bag-of-words
Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the loc...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | Article |
Published: |
Institute of Electrical and Electronics Engineers
2022
|
Subjects: | |
Online Access: | http://eprints.um.edu.my/41789/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Malaya |
id |
my.um.eprints.41789 |
---|---|
record_format |
eprints |
spelling |
my.um.eprints.417892023-10-23T08:41:00Z http://eprints.um.edu.my/41789/ Feature selection for location metonymy using augmented bag-of-words Meguellati, Muhammad Elyas Mahmud, Rohana Abdul Kareem, Sameem Zeghina, Assaad Oussama Saadi, Younes QA75 Electronic computers. Computer science Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the location is used literally or metonymically, however, most of the approaches that performed well had to employ a considerable amount of resources along with complex machine learning models; those that reduced the resources experienced a decline in performance due to data sparseness. This study proposes a novel feature selection approach that uses bag-of-words and augments it with GloVe embeddings to obtain features that can be recognized based on the context of the sentence. We then implement a minimalist deep learning model making the entire classification task as light as possible. The study found that relying solely on the given datasets to identify features without depending on other external resources can achieve remarkable results despite the small size of the datasets. The results obtained from evaluating our method compared to the state-of-the-art methods show that eliminating noise based on the context notwithstanding the usage of low-cost resources has outperformed all of the previous methods with an accuracy of 99.2% on the WIMCOR dataset. Institute of Electrical and Electronics Engineers 2022 Article PeerReviewed Meguellati, Muhammad Elyas and Mahmud, Rohana and Abdul Kareem, Sameem and Zeghina, Assaad Oussama and Saadi, Younes (2022) Feature selection for location metonymy using augmented bag-of-words. IEEE Access, 10. pp. 81777-81786. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2022.3195220 <https://doi.org/10.1109/ACCESS.2022.3195220>. 10.1109/ACCESS.2022.3195220 |
institution |
Universiti Malaya |
building |
UM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Malaya |
content_source |
UM Research Repository |
url_provider |
http://eprints.um.edu.my/ |
topic |
QA75 Electronic computers. Computer science |
spellingShingle |
QA75 Electronic computers. Computer science Meguellati, Muhammad Elyas Mahmud, Rohana Abdul Kareem, Sameem Zeghina, Assaad Oussama Saadi, Younes Feature selection for location metonymy using augmented bag-of-words |
description |
Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the location is used literally or metonymically, however, most of the approaches that performed well had to employ a considerable amount of resources along with complex machine learning models; those that reduced the resources experienced a decline in performance due to data sparseness. This study proposes a novel feature selection approach that uses bag-of-words and augments it with GloVe embeddings to obtain features that can be recognized based on the context of the sentence. We then implement a minimalist deep learning model making the entire classification task as light as possible. The study found that relying solely on the given datasets to identify features without depending on other external resources can achieve remarkable results despite the small size of the datasets. The results obtained from evaluating our method compared to the state-of-the-art methods show that eliminating noise based on the context notwithstanding the usage of low-cost resources has outperformed all of the previous methods with an accuracy of 99.2% on the WIMCOR dataset. |
format |
Article |
author |
Meguellati, Muhammad Elyas Mahmud, Rohana Abdul Kareem, Sameem Zeghina, Assaad Oussama Saadi, Younes |
author_facet |
Meguellati, Muhammad Elyas Mahmud, Rohana Abdul Kareem, Sameem Zeghina, Assaad Oussama Saadi, Younes |
author_sort |
Meguellati, Muhammad Elyas |
title |
Feature selection for location metonymy using augmented bag-of-words |
title_short |
Feature selection for location metonymy using augmented bag-of-words |
title_full |
Feature selection for location metonymy using augmented bag-of-words |
title_fullStr |
Feature selection for location metonymy using augmented bag-of-words |
title_full_unstemmed |
Feature selection for location metonymy using augmented bag-of-words |
title_sort |
feature selection for location metonymy using augmented bag-of-words |
publisher |
Institute of Electrical and Electronics Engineers |
publishDate |
2022 |
url |
http://eprints.um.edu.my/41789/ |
_version_ |
1781704556049072128 |