Data pre-processing for event data mining using Java
Data preprocessing include cleaning, standardizing, transforming raw data to a required formation for another processing procedure. It is commonly used as a preliminary data mining practice. In this project, raw data with different timestamp formats and text descriptions are required to be transform...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
2010
|
Subjects: | |
Online Access: | http://hdl.handle.net/10356/40709 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
Summary: | Data preprocessing include cleaning, standardizing, transforming raw data to a required formation for another processing procedure. It is commonly used as a preliminary data mining practice. In this project, raw data with different timestamp formats and text descriptions are required to be transformed into a standard format. The processed data will be further inputted into an Ant colony based algorithm for event data mining. As the size of the data is huge, manual process will be time consuming and error prone. A Java based data preprocessing tool is developed. It can handle raw data from different sources with various formats. Large dataset will be processed in an acceptable time period with required accuracy with this automated data pre-processing tool. |
---|