Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis

This study aims to generate insights on the hierarchy and importance of letters in the game Scrabble by employing two operational research frameworks. Both frameworks begin by using a vector space model whose basis vectors are all the valid Scrabble words and where each letter is treated as a vector...

Full description

Saved in:
Bibliographic Details
Main Authors: Tolentino, Mark Anthony C, Lee, Vince Andrew L, Lorenzo, Axirazel D, Ramos, Tristan Emmanuel A
Format: text
Published: Archīum Ateneo 2024
Subjects:
Online Access:https://archium.ateneo.edu/mathematics-faculty-pubs/248
https://doi.org/10.1063/5.0192067
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Ateneo De Manila University
id ph-ateneo-arc.mathematics-faculty-pubs-1249
record_format eprints
spelling ph-ateneo-arc.mathematics-faculty-pubs-12492024-04-15T07:44:42Z Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis Tolentino, Mark Anthony C Lee, Vince Andrew L Lorenzo, Axirazel D Ramos, Tristan Emmanuel A This study aims to generate insights on the hierarchy and importance of letters in the game Scrabble by employing two operational research frameworks. Both frameworks begin by using a vector space model whose basis vectors are all the valid Scrabble words and where each letter is treated as a vector. A network of the letters is then constructed where the edge weight between each pair of letters is determined using the corresponding vectors' cosine similarity, which is effectively a measure of the co-occurrence rate of the two letters. The first framework continues by obtaining the minimum spanning tree of the network and performing centrality analysis on the MST. Through the first framework, a hierarchy of the letters is obtained. This hierarchical arrangement shows how letters lower in the hierarchy depend on higher-level letters. On the other hand, the second framework involves performing centrality analysis on the original network of letters and results in a ranking of letters based on their co-occurrence rate with other letters. Based on the frameworks in the study, letter E emerges as the highest ranked letter while the letter Q consistently ranks at the bottom. Thus, the study demonstrates how the two frameworks can be used for a novel application and other possible applications of a similar nature. 2024-03-07T08:00:00Z text https://archium.ateneo.edu/mathematics-faculty-pubs/248 https://doi.org/10.1063/5.0192067 Mathematics Faculty Publications Archīum Ateneo Applied Mathematics Mathematics Physical Sciences and Mathematics
institution Ateneo De Manila University
building Ateneo De Manila University Library
continent Asia
country Philippines
Philippines
content_provider Ateneo De Manila University Library
collection archium.Ateneo Institutional Repository
topic Applied Mathematics
Mathematics
Physical Sciences and Mathematics
spellingShingle Applied Mathematics
Mathematics
Physical Sciences and Mathematics
Tolentino, Mark Anthony C
Lee, Vince Andrew L
Lorenzo, Axirazel D
Ramos, Tristan Emmanuel A
Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis
description This study aims to generate insights on the hierarchy and importance of letters in the game Scrabble by employing two operational research frameworks. Both frameworks begin by using a vector space model whose basis vectors are all the valid Scrabble words and where each letter is treated as a vector. A network of the letters is then constructed where the edge weight between each pair of letters is determined using the corresponding vectors' cosine similarity, which is effectively a measure of the co-occurrence rate of the two letters. The first framework continues by obtaining the minimum spanning tree of the network and performing centrality analysis on the MST. Through the first framework, a hierarchy of the letters is obtained. This hierarchical arrangement shows how letters lower in the hierarchy depend on higher-level letters. On the other hand, the second framework involves performing centrality analysis on the original network of letters and results in a ranking of letters based on their co-occurrence rate with other letters. Based on the frameworks in the study, letter E emerges as the highest ranked letter while the letter Q consistently ranks at the bottom. Thus, the study demonstrates how the two frameworks can be used for a novel application and other possible applications of a similar nature.
format text
author Tolentino, Mark Anthony C
Lee, Vince Andrew L
Lorenzo, Axirazel D
Ramos, Tristan Emmanuel A
author_facet Tolentino, Mark Anthony C
Lee, Vince Andrew L
Lorenzo, Axirazel D
Ramos, Tristan Emmanuel A
author_sort Tolentino, Mark Anthony C
title Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis
title_short Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis
title_full Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis
title_fullStr Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis
title_full_unstemmed Insights on the Hierarchy of Letters in Scrabble Using Cosine Similarity, Minimum Spanning Tree, and Centrality Analysis
title_sort insights on the hierarchy of letters in scrabble using cosine similarity, minimum spanning tree, and centrality analysis
publisher Archīum Ateneo
publishDate 2024
url https://archium.ateneo.edu/mathematics-faculty-pubs/248
https://doi.org/10.1063/5.0192067
_version_ 1797546531880960000