Automation of scraping for conflict of interest in webpages

As the world advances in technology, researchers compete to submit their papers to gain recognition and to show advances in technology. However, just as these calls are held by people, some of these submissions could be submitted by those who had recent contact or relations with the organiser of the...

Full description

Saved in:

Bibliographic Details
Main Author:	Lee, Ming Jia
Other Authors:	Sourav S Bhowmick
Format:	Final Year Project
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science Web scraping
Online Access:	https://hdl.handle.net/10356/181181
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-181181
record_format	dspace
spelling	sg-ntu-dr.10356-1811812024-11-18T01:16:39Z Automation of scraping for conflict of interest in webpages Lee, Ming Jia Sourav S Bhowmick College of Computing and Data Science ASSourav@ntu.edu.sg Computer and Information Science Web scraping As the world advances in technology, researchers compete to submit their papers to gain recognition and to show advances in technology. However, just as these calls are held by people, some of these submissions could be submitted by those who had recent contact or relations with the organiser of the events. As a result, this could result in an unfair competition, leading to conflict of interest between the organisers and candidates. Hence, during the submission of a paper, the submission sites will request information about conflicts of interest of the paper's authors with program committee (PC) members. This project presents the development of an automation python-based application that can be used to extract for information relating to conflict of interest related to an event. The goal of this project is to automate the process so that it can be done on multiple webpages at the same time, hence not requiring the user to individually type down every webpage to be scraped from. Future work on the application will focus on optimising the code to prevent the code from extracting excessive information as well as improving its capabilities in scraping other information stored in the webpage. Bachelor's degree 2024-11-18T01:16:39Z 2024-11-18T01:16:39Z 2024 Final Year Project (FYP) Lee, M. J. (2024). Automation of scraping for conflict of interest in webpages. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/181181 https://hdl.handle.net/10356/181181 en application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Computer and Information Science Web scraping
spellingShingle	Computer and Information Science Web scraping Lee, Ming Jia Automation of scraping for conflict of interest in webpages
description	As the world advances in technology, researchers compete to submit their papers to gain recognition and to show advances in technology. However, just as these calls are held by people, some of these submissions could be submitted by those who had recent contact or relations with the organiser of the events. As a result, this could result in an unfair competition, leading to conflict of interest between the organisers and candidates. Hence, during the submission of a paper, the submission sites will request information about conflicts of interest of the paper's authors with program committee (PC) members. This project presents the development of an automation python-based application that can be used to extract for information relating to conflict of interest related to an event. The goal of this project is to automate the process so that it can be done on multiple webpages at the same time, hence not requiring the user to individually type down every webpage to be scraped from. Future work on the application will focus on optimising the code to prevent the code from extracting excessive information as well as improving its capabilities in scraping other information stored in the webpage.
author2	Sourav S Bhowmick
author_facet	Sourav S Bhowmick Lee, Ming Jia
format	Final Year Project
author	Lee, Ming Jia
author_sort	Lee, Ming Jia
title	Automation of scraping for conflict of interest in webpages
title_short	Automation of scraping for conflict of interest in webpages
title_full	Automation of scraping for conflict of interest in webpages
title_fullStr	Automation of scraping for conflict of interest in webpages
title_full_unstemmed	Automation of scraping for conflict of interest in webpages
title_sort	automation of scraping for conflict of interest in webpages
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/181181
_version_	1816859058304450560

Automation of scraping for conflict of interest in webpages

Similar Items