Constructing a word similarity graph from vector based word representation for named entity recognition

In this paper, we discuss a method for identifying a seed word that would best represent a class of named entities in a graphical representation of words and their similarities. Word networks, or word graphs, are representations of vectorized text where nodes are the words encountered in a corpus, a...

Full description

Saved in:
Bibliographic Details
Main Authors: Feria, Miguel, Balbin, Juan Paolo Santos, Bautista, Francis Michael
Format: text
Published: Animo Repository 2018
Subjects:
Online Access:https://animorepository.dlsu.edu.ph/faculty_research/3405
https://animorepository.dlsu.edu.ph/context/faculty_research/article/4407/type/native/viewcontent/0006926201660171.html
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: De La Salle University
id oai:animorepository.dlsu.edu.ph:faculty_research-4407
record_format eprints
spelling oai:animorepository.dlsu.edu.ph:faculty_research-44072022-12-20T10:58:40Z Constructing a word similarity graph from vector based word representation for named entity recognition Feria, Miguel Balbin, Juan Paolo Santos Bautista, Francis Michael In this paper, we discuss a method for identifying a seed word that would best represent a class of named entities in a graphical representation of words and their similarities. Word networks, or word graphs, are representations of vectorized text where nodes are the words encountered in a corpus, and the weighted edges incident on the nodes represent how similar the words are to each other. Word networks are then divided into communities using the Louvain Method for community detection, then betweenness centrality of each node in each community is computed. The most central node in each community represents the most ideal candidate for a seed word of a named entity group which represents the community. Our results from our bilingual data set show that words with similar lexical content, from either language, belong to the same community. Copyright © 2018 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved 2018-01-01T08:00:00Z text text/html https://animorepository.dlsu.edu.ph/faculty_research/3405 info:doi/10.5220/0006926201660171 https://animorepository.dlsu.edu.ph/context/faculty_research/article/4407/type/native/viewcontent/0006926201660171.html Faculty Research Work Animo Repository Linguistics—Graphic methods Semantics—Mathematical models Computer Sciences Mathematics
institution De La Salle University
building De La Salle University Library
continent Asia
country Philippines
Philippines
content_provider De La Salle University Library
collection DLSU Institutional Repository
topic Linguistics—Graphic methods
Semantics—Mathematical models
Computer Sciences
Mathematics
spellingShingle Linguistics—Graphic methods
Semantics—Mathematical models
Computer Sciences
Mathematics
Feria, Miguel
Balbin, Juan Paolo Santos
Bautista, Francis Michael
Constructing a word similarity graph from vector based word representation for named entity recognition
description In this paper, we discuss a method for identifying a seed word that would best represent a class of named entities in a graphical representation of words and their similarities. Word networks, or word graphs, are representations of vectorized text where nodes are the words encountered in a corpus, and the weighted edges incident on the nodes represent how similar the words are to each other. Word networks are then divided into communities using the Louvain Method for community detection, then betweenness centrality of each node in each community is computed. The most central node in each community represents the most ideal candidate for a seed word of a named entity group which represents the community. Our results from our bilingual data set show that words with similar lexical content, from either language, belong to the same community. Copyright © 2018 by SCITEPRESS - Science and Technology Publications, Lda. All rights reserved
format text
author Feria, Miguel
Balbin, Juan Paolo Santos
Bautista, Francis Michael
author_facet Feria, Miguel
Balbin, Juan Paolo Santos
Bautista, Francis Michael
author_sort Feria, Miguel
title Constructing a word similarity graph from vector based word representation for named entity recognition
title_short Constructing a word similarity graph from vector based word representation for named entity recognition
title_full Constructing a word similarity graph from vector based word representation for named entity recognition
title_fullStr Constructing a word similarity graph from vector based word representation for named entity recognition
title_full_unstemmed Constructing a word similarity graph from vector based word representation for named entity recognition
title_sort constructing a word similarity graph from vector based word representation for named entity recognition
publisher Animo Repository
publishDate 2018
url https://animorepository.dlsu.edu.ph/faculty_research/3405
https://animorepository.dlsu.edu.ph/context/faculty_research/article/4407/type/native/viewcontent/0006926201660171.html
_version_ 1767195899395571712