On exploring and visualizing conference relationship
The Digital Bibliography & Library Project (DBLP) is a computer science bibliography which holds records of millions of publications. A XML copy of the DBLP data is available for download from its official web page. Due to the large file size of the DBLP XML, loading of the DBLP XML into the mem...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2012
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/48781 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-48781 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-487812023-03-03T20:42:16Z On exploring and visualizing conference relationship Lim, Jia Xing. Sun Aixin School of Computer Engineering Centre for Advanced Information Systems DRNTU::Engineering::Computer science and engineering::Information systems The Digital Bibliography & Library Project (DBLP) is a computer science bibliography which holds records of millions of publications. A XML copy of the DBLP data is available for download from its official web page. Due to the large file size of the DBLP XML, loading of the DBLP XML into the memory for every run of for any type of program created would be very inefficient. Furthermore, manipulation of the DBLP XML data in the memory would require the system to have a large amount of memory. In this report, we propose a design of a database through the study and understanding of the structure use in the storing of data in the DBLP XML. With the design of the database, a program will be created to migrate the data from the DBLP XML to MySQL database. With the migration of the data to a database, various types of programs could then be written to perform various kinds of data manipulation. In the context of this project, this report discusses how a program would be written to extract conference related information from the database and build a Lucene index on the extracted information. With the index created, similarities between different conferences will be computed based on papers published and the authors of the papers published. A graph based on the similarities of the conferences would then be generate and visualize. Bachelor of Engineering (Computer Science) 2012-05-09T04:49:56Z 2012-05-09T04:49:56Z 2012 2012 Final Year Project (FYP) http://hdl.handle.net/10356/48781 en Nanyang Technological University 34 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
DRNTU::Engineering::Computer science and engineering::Information systems |
spellingShingle |
DRNTU::Engineering::Computer science and engineering::Information systems Lim, Jia Xing. On exploring and visualizing conference relationship |
description |
The Digital Bibliography & Library Project (DBLP) is a computer science bibliography which holds records of millions of publications. A XML copy of the DBLP data is available for download from its official web page. Due to the large file size of the DBLP XML, loading of the DBLP XML into the memory for every run of for any type of program created would be very inefficient. Furthermore, manipulation of the DBLP XML data in the memory would require the system to have a large amount of memory.
In this report, we propose a design of a database through the study and understanding of the structure use in the storing of data in the DBLP XML. With the design of the database, a program will be created to migrate the data from the DBLP XML to MySQL database. With the migration of the data to a database, various types of programs could then be written to perform various kinds of data manipulation.
In the context of this project, this report discusses how a program would be written to extract conference related information from the database and build a Lucene index on the extracted information. With the index created, similarities between different conferences will be computed based on papers published and the authors of the papers published. A graph based on the similarities of the conferences would then be generate and visualize. |
author2 |
Sun Aixin |
author_facet |
Sun Aixin Lim, Jia Xing. |
format |
Final Year Project |
author |
Lim, Jia Xing. |
author_sort |
Lim, Jia Xing. |
title |
On exploring and visualizing conference relationship |
title_short |
On exploring and visualizing conference relationship |
title_full |
On exploring and visualizing conference relationship |
title_fullStr |
On exploring and visualizing conference relationship |
title_full_unstemmed |
On exploring and visualizing conference relationship |
title_sort |
on exploring and visualizing conference relationship |
publishDate |
2012 |
url |
http://hdl.handle.net/10356/48781 |
_version_ |
1759856776736407552 |