Data pre-processing for event data mining using Java

Data preprocessing include cleaning, standardizing, transforming raw data to a required formation for another processing procedure. It is commonly used as a preliminary data mining practice. In this project, raw data with different timestamp formats and text descriptions are required to be transform...

Full description

Saved in:
Bibliographic Details
Main Author: Chang, Lai Yan.
Other Authors: Chen Lihui
Format: Final Year Project
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/40709
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:Data preprocessing include cleaning, standardizing, transforming raw data to a required formation for another processing procedure. It is commonly used as a preliminary data mining practice. In this project, raw data with different timestamp formats and text descriptions are required to be transformed into a standard format. The processed data will be further inputted into an Ant colony based algorithm for event data mining. As the size of the data is huge, manual process will be time consuming and error prone. A Java based data preprocessing tool is developed. It can handle raw data from different sources with various formats. Large dataset will be processed in an acceptable time period with required accuracy with this automated data pre-processing tool.