IMPROVING THE PERFORMANCE OF HDBSCAN ON SHORT TEXT CLUSTERING BY USING WORD EMBEDDINGS AND UMAP
Short text is one of the data formats usually generated by people on social media, for instance, tweets on Twitter. They are often used as data to analyze what is trending in the community. However, topic modeling or text clustering algorithms on short text have some unique problems. Namely, s...
Saved in:
Main Author: | Sidik Asyaky, Muhammad |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/58051 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
Similar Items
-
Semantic visualization for short texts with word embeddings
by: LE, Van Minh Tuan, et al.
Published: (2017) -
PRELIMINARY STUDY OF HDBSCAN AND ISOCHRONE CURVES TO DETERMINE THE AGE OF OPEN CLUSTER IC 1590
by: Adam Alifka, Fadhilla -
Short text classification using very few words
by: Sun, Aixin
Published: (2013) -
KEANGGOTAAN GUGUS TERBUKA GANDA DENGAN METODE PEMBELAJARAN MESIN HDBSCAN
by: Chikmah -
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
by: Gianmarg H. Siahaan, Steven