SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT

Analysis of learning outcomes is one of the things needed to carry out continuous improvement in the curriculum used. This problem is included in the multi-label classification task. However, there is no data that has ground truth yet, and the annotation process takes quite a long time. Therefore...

Full description

Saved in:
Bibliographic Details
Main Author: Gianmarg H. Siahaan, Steven
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/82443
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:82443
spelling id-itb.:824432024-07-08T13:23:42ZSOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT Gianmarg H. Siahaan, Steven Indonesia Final Project Learning outcome, sentence embedding, Soft Clustering, Fuzzy C- Means, Gaussian Mixture Models, weiszfeld algorithm, all-mpnet-base-v2 INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/82443 Analysis of learning outcomes is one of the things needed to carry out continuous improvement in the curriculum used. This problem is included in the multi-label classification task. However, there is no data that has ground truth yet, and the annotation process takes quite a long time. Therefore, in this final assignment, soft clustering of texts is carried out into competencies. In grouping text, sentence embedding shows very good results compared to embedding at word level granularity. This final assignment analyzes learning outcomes by producing a final score obtained from the product of the weight and competency representation value. The weights were obtained through experiments in developing alternative soft clustering models, namely fuzzy c-means, Gaussian mixture models and calculating semantic similarity scores. Meanwhile, the representation of the value of each competency is obtained by implementing the Weiszfeld algorithm. Experimental and testing results show that the fuzzy c-means model using a variation of the all-mpnet-base-v2 sentence embedding model shows the best results with macro average F-1 score 0.73, micro average f-1 score 0.63, and weighted average f-1 Score 0.69 compared to other variations. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Analysis of learning outcomes is one of the things needed to carry out continuous improvement in the curriculum used. This problem is included in the multi-label classification task. However, there is no data that has ground truth yet, and the annotation process takes quite a long time. Therefore, in this final assignment, soft clustering of texts is carried out into competencies. In grouping text, sentence embedding shows very good results compared to embedding at word level granularity. This final assignment analyzes learning outcomes by producing a final score obtained from the product of the weight and competency representation value. The weights were obtained through experiments in developing alternative soft clustering models, namely fuzzy c-means, Gaussian mixture models and calculating semantic similarity scores. Meanwhile, the representation of the value of each competency is obtained by implementing the Weiszfeld algorithm. Experimental and testing results show that the fuzzy c-means model using a variation of the all-mpnet-base-v2 sentence embedding model shows the best results with macro average F-1 score 0.73, micro average f-1 score 0.63, and weighted average f-1 Score 0.69 compared to other variations.
format Final Project
author Gianmarg H. Siahaan, Steven
spellingShingle Gianmarg H. Siahaan, Steven
SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
author_facet Gianmarg H. Siahaan, Steven
author_sort Gianmarg H. Siahaan, Steven
title SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
title_short SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
title_full SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
title_fullStr SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
title_full_unstemmed SOFT CLUSTERING USING SENTENCE EMBEDDING ON LEARNING OUTCOMES TEXT
title_sort soft clustering using sentence embedding on learning outcomes text
url https://digilib.itb.ac.id/gdl/view/82443
_version_ 1822009775863365632