SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
Analysis of learning outcomes is one of the things needed to carry out continuous improvement in the curriculum used. This problem is included in the multi-label classification task. However, there is no data that has ground truth yet, and the annotation process takes quite a long time. Therefore...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/82443 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:82443 |
---|---|
spelling |
id-itb.:824432024-07-08T13:23:42ZSOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT Gianmarg H. Siahaan, Steven Indonesia Final Project Learning outcome, sentence embedding, Soft Clustering, Fuzzy C- Means, Gaussian Mixture Models, weiszfeld algorithm, all-mpnet-base-v2 INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/82443 Analysis of learning outcomes is one of the things needed to carry out continuous improvement in the curriculum used. This problem is included in the multi-label classification task. However, there is no data that has ground truth yet, and the annotation process takes quite a long time. Therefore, in this final assignment, soft clustering of texts is carried out into competencies. In grouping text, sentence embedding shows very good results compared to embedding at word level granularity. This final assignment analyzes learning outcomes by producing a final score obtained from the product of the weight and competency representation value. The weights were obtained through experiments in developing alternative soft clustering models, namely fuzzy c-means, Gaussian mixture models and calculating semantic similarity scores. Meanwhile, the representation of the value of each competency is obtained by implementing the Weiszfeld algorithm. Experimental and testing results show that the fuzzy c-means model using a variation of the all-mpnet-base-v2 sentence embedding model shows the best results with macro average F-1 score 0.73, micro average f-1 score 0.63, and weighted average f-1 Score 0.69 compared to other variations. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Analysis of learning outcomes is one of the things needed to carry out continuous
improvement in the curriculum used. This problem is included in the multi-label
classification task. However, there is no data that has ground truth yet, and the
annotation process takes quite a long time. Therefore, in this final assignment, soft
clustering of texts is carried out into competencies. In grouping text, sentence
embedding shows very good results compared to embedding at word level
granularity. This final assignment analyzes learning outcomes by producing a
final score obtained from the product of the weight and competency
representation value.
The weights were obtained through experiments in developing alternative soft
clustering models, namely fuzzy c-means, Gaussian mixture models and
calculating semantic similarity scores. Meanwhile, the representation of the value
of each competency is obtained by implementing the Weiszfeld algorithm.
Experimental and testing results show that the fuzzy c-means model using a
variation of the all-mpnet-base-v2 sentence embedding model shows the best
results with macro average F-1 score 0.73, micro average f-1 score 0.63, and
weighted average f-1 Score 0.69 compared to other variations. |
format |
Final Project |
author |
Gianmarg H. Siahaan, Steven |
spellingShingle |
Gianmarg H. Siahaan, Steven SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT |
author_facet |
Gianmarg H. Siahaan, Steven |
author_sort |
Gianmarg H. Siahaan, Steven |
title |
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT |
title_short |
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT |
title_full |
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT |
title_fullStr |
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT |
title_full_unstemmed |
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT |
title_sort |
soft clustering using sentence embedding on learning outcomes text |
url |
https://digilib.itb.ac.id/gdl/view/82443 |
_version_ |
1822009775863365632 |