Morphologically-aware vocabulary reduction of word embeddings

We propose SubText, a compression mechanism via vocabulary reduction. The crux is to judiciously select a subset of word embeddings which support the reconstruction of the remaining word embeddings based on their form alone. The proposed algorithm considers the preservation of the original embedding...

Full description

Saved in:
Bibliographic Details
Main Authors: CHIA, Chong Cher, TKACHENKO, Maksim, LAUW, Hady Wirawan
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/7608
https://ink.library.smu.edu.sg/context/sis_research/article/8611/viewcontent/wiiat22.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English