DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION

Before the development of this Final Project, the Social Network Analysis Application by Institut Teknologi Bandung (SNA ITB) was only able to procure data for analyzation from the social media Twitter. This is not ideal as the number of users for other social medias like Facebook dwarfs the numb...

Full description

Saved in:
Bibliographic Details
Main Author: Junod, Girvin
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/74152
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:74152
spelling id-itb.:741522023-06-26T14:06:11ZDEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION Junod, Girvin Indonesia Final Project social network analysis, scraper, Facebook, classification, hate speech. INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/74152 Before the development of this Final Project, the Social Network Analysis Application by Institut Teknologi Bandung (SNA ITB) was only able to procure data for analyzation from the social media Twitter. This is not ideal as the number of users for other social medias like Facebook dwarfs the number of Twitter users in Indonesia. Because of that, the Facebook scraper feature was added to the SNA ITB application in this final project. Another feature was also developed for the application which was the hate speech classification feature. The significant amount of hate speech that is spreading around social media and its negative impact on society causes the feature to be highly sought and it was decided that the feature would be added to the SNA ITB application. The development of the Facebook scraper feature is done using tools that can scrape data from Facebook based on keywords or user profiles. Several APIs and web pages related to the uses of Facebook scraper in the application is created in this Final Project so that the feature can be used. The addition of Facebook scraper causes a change in the application flow for several features. Because of that, several changes are also done to existing web pages and APIs to fit the new application flow. The addition of the hate speech classification feature is done by developing a hate speech classification model using a pre-trained model and developing various APIs and web pages related to hate speech statistics. Other than that, an experiment is also attempted to try to improve the performance the hate speech classification by injecting knowledge into the pre-trained model. The development of the hate speech classification model is successful with the final model demonstrating a strong performance with an accuracy score of 73.89% and F1 score of 80.77% for multi-label classification. This final model is a pre-trained IndoBERTweet model that is fine-tuned using an Indonesian hate speech dataset. The attempt to improve the model’s performance using knowledge injection failed to produce a better performing model. The result of this Final Project is the SNA ITB Application that now has new features such as the Facebook scraper feature and the hate speech classification feature. Users of the SNA ITB application now can scrape data from facebook and analyze it using the features of the application. Hate speech analysis can also be done to the scraped data. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Before the development of this Final Project, the Social Network Analysis Application by Institut Teknologi Bandung (SNA ITB) was only able to procure data for analyzation from the social media Twitter. This is not ideal as the number of users for other social medias like Facebook dwarfs the number of Twitter users in Indonesia. Because of that, the Facebook scraper feature was added to the SNA ITB application in this final project. Another feature was also developed for the application which was the hate speech classification feature. The significant amount of hate speech that is spreading around social media and its negative impact on society causes the feature to be highly sought and it was decided that the feature would be added to the SNA ITB application. The development of the Facebook scraper feature is done using tools that can scrape data from Facebook based on keywords or user profiles. Several APIs and web pages related to the uses of Facebook scraper in the application is created in this Final Project so that the feature can be used. The addition of Facebook scraper causes a change in the application flow for several features. Because of that, several changes are also done to existing web pages and APIs to fit the new application flow. The addition of the hate speech classification feature is done by developing a hate speech classification model using a pre-trained model and developing various APIs and web pages related to hate speech statistics. Other than that, an experiment is also attempted to try to improve the performance the hate speech classification by injecting knowledge into the pre-trained model. The development of the hate speech classification model is successful with the final model demonstrating a strong performance with an accuracy score of 73.89% and F1 score of 80.77% for multi-label classification. This final model is a pre-trained IndoBERTweet model that is fine-tuned using an Indonesian hate speech dataset. The attempt to improve the model’s performance using knowledge injection failed to produce a better performing model. The result of this Final Project is the SNA ITB Application that now has new features such as the Facebook scraper feature and the hate speech classification feature. Users of the SNA ITB application now can scrape data from facebook and analyze it using the features of the application. Hate speech analysis can also be done to the scraped data.
format Final Project
author Junod, Girvin
spellingShingle Junod, Girvin
DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION
author_facet Junod, Girvin
author_sort Junod, Girvin
title DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION
title_short DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION
title_full DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION
title_fullStr DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION
title_full_unstemmed DEVELOPMENT OF FACEBOOK SCRAPER AND HATE SPEECH CLASSIFICATION OF SOCIAL MEDIA USERS USING PRE-TRAINED MODEL FEATURES IN INSTITUT TEKNOLOGI BANDUNG’S SOCIAL NETWORK ANALYSIS APPLICATION
title_sort development of facebook scraper and hate speech classification of social media users using pre-trained model features in institut teknologi bandung’s social network analysis application
url https://digilib.itb.ac.id/gdl/view/74152
_version_ 1822993569516879872