Feature selection for location metonymy using augmented bag-of-words

Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the loc...

Full description

Saved in:
Bibliographic Details
Main Authors: Meguellati, Muhammad Elyas, Mahmud, Rohana, Abdul Kareem, Sameem, Zeghina, Assaad Oussama, Saadi, Younes
Format: Article
Published: Institute of Electrical and Electronics Engineers 2022
Subjects:
Online Access:http://eprints.um.edu.my/41789/
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Universiti Malaya
id my.um.eprints.41789
record_format eprints
spelling my.um.eprints.417892023-10-23T08:41:00Z http://eprints.um.edu.my/41789/ Feature selection for location metonymy using augmented bag-of-words Meguellati, Muhammad Elyas Mahmud, Rohana Abdul Kareem, Sameem Zeghina, Assaad Oussama Saadi, Younes QA75 Electronic computers. Computer science Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the location is used literally or metonymically, however, most of the approaches that performed well had to employ a considerable amount of resources along with complex machine learning models; those that reduced the resources experienced a decline in performance due to data sparseness. This study proposes a novel feature selection approach that uses bag-of-words and augments it with GloVe embeddings to obtain features that can be recognized based on the context of the sentence. We then implement a minimalist deep learning model making the entire classification task as light as possible. The study found that relying solely on the given datasets to identify features without depending on other external resources can achieve remarkable results despite the small size of the datasets. The results obtained from evaluating our method compared to the state-of-the-art methods show that eliminating noise based on the context notwithstanding the usage of low-cost resources has outperformed all of the previous methods with an accuracy of 99.2% on the WIMCOR dataset. Institute of Electrical and Electronics Engineers 2022 Article PeerReviewed Meguellati, Muhammad Elyas and Mahmud, Rohana and Abdul Kareem, Sameem and Zeghina, Assaad Oussama and Saadi, Younes (2022) Feature selection for location metonymy using augmented bag-of-words. IEEE Access, 10. pp. 81777-81786. ISSN 2169-3536, DOI https://doi.org/10.1109/ACCESS.2022.3195220 <https://doi.org/10.1109/ACCESS.2022.3195220>. 10.1109/ACCESS.2022.3195220
institution Universiti Malaya
building UM Library
collection Institutional Repository
continent Asia
country Malaysia
content_provider Universiti Malaya
content_source UM Research Repository
url_provider http://eprints.um.edu.my/
topic QA75 Electronic computers. Computer science
spellingShingle QA75 Electronic computers. Computer science
Meguellati, Muhammad Elyas
Mahmud, Rohana
Abdul Kareem, Sameem
Zeghina, Assaad Oussama
Saadi, Younes
Feature selection for location metonymy using augmented bag-of-words
description Location metonymy resolution is a study that deals with locations being used in a non-literal way that create problems in several natural language processing tasks such as Named entity recognition and Geographical parsing. Many studies were conducted attempting to accurately classify whether the location is used literally or metonymically, however, most of the approaches that performed well had to employ a considerable amount of resources along with complex machine learning models; those that reduced the resources experienced a decline in performance due to data sparseness. This study proposes a novel feature selection approach that uses bag-of-words and augments it with GloVe embeddings to obtain features that can be recognized based on the context of the sentence. We then implement a minimalist deep learning model making the entire classification task as light as possible. The study found that relying solely on the given datasets to identify features without depending on other external resources can achieve remarkable results despite the small size of the datasets. The results obtained from evaluating our method compared to the state-of-the-art methods show that eliminating noise based on the context notwithstanding the usage of low-cost resources has outperformed all of the previous methods with an accuracy of 99.2% on the WIMCOR dataset.
format Article
author Meguellati, Muhammad Elyas
Mahmud, Rohana
Abdul Kareem, Sameem
Zeghina, Assaad Oussama
Saadi, Younes
author_facet Meguellati, Muhammad Elyas
Mahmud, Rohana
Abdul Kareem, Sameem
Zeghina, Assaad Oussama
Saadi, Younes
author_sort Meguellati, Muhammad Elyas
title Feature selection for location metonymy using augmented bag-of-words
title_short Feature selection for location metonymy using augmented bag-of-words
title_full Feature selection for location metonymy using augmented bag-of-words
title_fullStr Feature selection for location metonymy using augmented bag-of-words
title_full_unstemmed Feature selection for location metonymy using augmented bag-of-words
title_sort feature selection for location metonymy using augmented bag-of-words
publisher Institute of Electrical and Electronics Engineers
publishDate 2022
url http://eprints.um.edu.my/41789/
_version_ 1781704556049072128