On exploring and visualizing conference relationship

The Digital Bibliography & Library Project (DBLP) is a computer science bibliography which holds records of millions of publications. A XML copy of the DBLP data is available for download from its official web page. Due to the large file size of the DBLP XML, loading of the DBLP XML into the mem...

Full description

Saved in:
Bibliographic Details
Main Author: Lim, Jia Xing.
Other Authors: Sun Aixin
Format: Final Year Project
Language:English
Published: 2012
Subjects:
Online Access:http://hdl.handle.net/10356/48781
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-48781
record_format dspace
spelling sg-ntu-dr.10356-487812023-03-03T20:42:16Z On exploring and visualizing conference relationship Lim, Jia Xing. Sun Aixin School of Computer Engineering Centre for Advanced Information Systems DRNTU::Engineering::Computer science and engineering::Information systems The Digital Bibliography & Library Project (DBLP) is a computer science bibliography which holds records of millions of publications. A XML copy of the DBLP data is available for download from its official web page. Due to the large file size of the DBLP XML, loading of the DBLP XML into the memory for every run of for any type of program created would be very inefficient. Furthermore, manipulation of the DBLP XML data in the memory would require the system to have a large amount of memory. In this report, we propose a design of a database through the study and understanding of the structure use in the storing of data in the DBLP XML. With the design of the database, a program will be created to migrate the data from the DBLP XML to MySQL database. With the migration of the data to a database, various types of programs could then be written to perform various kinds of data manipulation. In the context of this project, this report discusses how a program would be written to extract conference related information from the database and build a Lucene index on the extracted information. With the index created, similarities between different conferences will be computed based on papers published and the authors of the papers published. A graph based on the similarities of the conferences would then be generate and visualize. Bachelor of Engineering (Computer Science) 2012-05-09T04:49:56Z 2012-05-09T04:49:56Z 2012 2012 Final Year Project (FYP) http://hdl.handle.net/10356/48781 en Nanyang Technological University 34 p. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems
Lim, Jia Xing.
On exploring and visualizing conference relationship
description The Digital Bibliography & Library Project (DBLP) is a computer science bibliography which holds records of millions of publications. A XML copy of the DBLP data is available for download from its official web page. Due to the large file size of the DBLP XML, loading of the DBLP XML into the memory for every run of for any type of program created would be very inefficient. Furthermore, manipulation of the DBLP XML data in the memory would require the system to have a large amount of memory. In this report, we propose a design of a database through the study and understanding of the structure use in the storing of data in the DBLP XML. With the design of the database, a program will be created to migrate the data from the DBLP XML to MySQL database. With the migration of the data to a database, various types of programs could then be written to perform various kinds of data manipulation. In the context of this project, this report discusses how a program would be written to extract conference related information from the database and build a Lucene index on the extracted information. With the index created, similarities between different conferences will be computed based on papers published and the authors of the papers published. A graph based on the similarities of the conferences would then be generate and visualize.
author2 Sun Aixin
author_facet Sun Aixin
Lim, Jia Xing.
format Final Year Project
author Lim, Jia Xing.
author_sort Lim, Jia Xing.
title On exploring and visualizing conference relationship
title_short On exploring and visualizing conference relationship
title_full On exploring and visualizing conference relationship
title_fullStr On exploring and visualizing conference relationship
title_full_unstemmed On exploring and visualizing conference relationship
title_sort on exploring and visualizing conference relationship
publishDate 2012
url http://hdl.handle.net/10356/48781
_version_ 1759856776736407552