HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN

<p align="justify">Educational institution have many scattered educational and scientific publication data that is available to public on the internet. The data is scattered from research groups website, department website, and majors website. Other body inside the institute may also...

Full description

Saved in:
Bibliographic Details
Main Author: ARYO TYASONO (NIM : 13513075), BIMO
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/26119
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:26119
spelling id-itb.:261192018-03-16T13:39:54ZHETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN ARYO TYASONO (NIM : 13513075), BIMO Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/26119 <p align="justify">Educational institution have many scattered educational and scientific publication data that is available to public on the internet. The data is scattered from research groups website, department website, and majors website. Other body inside the institute may also contain the education and scientific publication data. (Darmawan, 2017) have finished final project about how to collect those data to the structured data but still in the heterogenous structure. The next process is to determine how to utilize the structured data. <br /> <br /> This final project aims to build a system that consists of data integration part and data visualization part. Integration process consists of data cleaning, data indexing, record pair comparison, and record pair classification. The focus of this final project <br /> <br /> is to make a whole working system that will use the existing tools and library and will find the most optimal solution to the problem. Testing for the final project including comparison of the performance from various algorithm (Jaro, Winkler, Q-Gram, and Levenshtein) to integrate the data. It is concluded that the final project successfully build the whole working system to utilize every educational and scientific publication data in the form of output from another final project (Darmawan, 2017) with removing the duplicates and display the visualisation in the web platform. It is hoped that this final project will be <br /> <br /> developed further to handle more cases outside the domain of education and scientific publication.<p align="justify"> text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description <p align="justify">Educational institution have many scattered educational and scientific publication data that is available to public on the internet. The data is scattered from research groups website, department website, and majors website. Other body inside the institute may also contain the education and scientific publication data. (Darmawan, 2017) have finished final project about how to collect those data to the structured data but still in the heterogenous structure. The next process is to determine how to utilize the structured data. <br /> <br /> This final project aims to build a system that consists of data integration part and data visualization part. Integration process consists of data cleaning, data indexing, record pair comparison, and record pair classification. The focus of this final project <br /> <br /> is to make a whole working system that will use the existing tools and library and will find the most optimal solution to the problem. Testing for the final project including comparison of the performance from various algorithm (Jaro, Winkler, Q-Gram, and Levenshtein) to integrate the data. It is concluded that the final project successfully build the whole working system to utilize every educational and scientific publication data in the form of output from another final project (Darmawan, 2017) with removing the duplicates and display the visualisation in the web platform. It is hoped that this final project will be <br /> <br /> developed further to handle more cases outside the domain of education and scientific publication.<p align="justify">
format Final Project
author ARYO TYASONO (NIM : 13513075), BIMO
spellingShingle ARYO TYASONO (NIM : 13513075), BIMO
HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN
author_facet ARYO TYASONO (NIM : 13513075), BIMO
author_sort ARYO TYASONO (NIM : 13513075), BIMO
title HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN
title_short HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN
title_full HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN
title_fullStr HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN
title_full_unstemmed HETEROGENOUS STRUCTURE DATA INTEGRATION ON EDUCATION AND SCIENTIFIC PUBLICATION DOMAIN
title_sort heterogenous structure data integration on education and scientific publication domain
url https://digilib.itb.ac.id/gdl/view/26119
_version_ 1822921790710611968