Real-time data warehousing : data integration framework.

A data warehouse primarily provides intelligent business facilities such as analytical processing, decision making and data mining. Design and implementation of data warehouses has been well studied and supported by many commercial offerings. However increasing demands for real-time systems fuels th...

Full description

Saved in:
Bibliographic Details
Main Author: Kyaw, Ye Lin.
Other Authors: Vivekanand Gopalkrishnan
Format: Theses and Dissertations
Language:English
Published: 2010
Subjects:
Online Access:http://hdl.handle.net/10356/42312
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-42312
record_format dspace
spelling sg-ntu-dr.10356-423122019-12-10T13:07:01Z Real-time data warehousing : data integration framework. Kyaw, Ye Lin. Vivekanand Gopalkrishnan Wee Kim Wee School of Communication and Information Centre for Advanced Information Systems DRNTU::Engineering::Computer science and engineering::Information systems::Database management A data warehouse primarily provides intelligent business facilities such as analytical processing, decision making and data mining. Design and implementation of data warehouses has been well studied and supported by many commercial offerings. However increasing demands for real-time systems fuels the need for changes in existing data warehouse designs and frameworks. One such change required to realise a real-time approach is to redefine the process of synchronization between transactional data and the data warehouse. Current loading processes are typically carried out by using the methods of insert, batch or bulk loading which can deteriorate the real-time performance. Though bulk loading is faster, it needs additional preprocessing of the data, which is not cost effective particularly when transferring large amounts of data. It is thought that real-time data integration can improve the process of frequently loading small amounts of data. This dissertation discusses the performance and functionalities of existing data integration tools along with their associated problems and drawbacks. Current trends in real-time data integration are also analysed and a new real-time data integration framework is introduced. Implementations of the well known real-time data integration method are also presented, as are possible extensions for implementing or introducing the new real-time data integration implementation method to address other problems. An in-depth empirical analysis demonstrates the validity of the proposed approach. Master of Science (Information Studies) 2010-10-29T08:27:03Z 2010-10-29T08:27:03Z 2010 2010 Thesis http://hdl.handle.net/10356/42312 en Nanyang Technological University 108 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic DRNTU::Engineering::Computer science and engineering::Information systems::Database management
spellingShingle DRNTU::Engineering::Computer science and engineering::Information systems::Database management
Kyaw, Ye Lin.
Real-time data warehousing : data integration framework.
description A data warehouse primarily provides intelligent business facilities such as analytical processing, decision making and data mining. Design and implementation of data warehouses has been well studied and supported by many commercial offerings. However increasing demands for real-time systems fuels the need for changes in existing data warehouse designs and frameworks. One such change required to realise a real-time approach is to redefine the process of synchronization between transactional data and the data warehouse. Current loading processes are typically carried out by using the methods of insert, batch or bulk loading which can deteriorate the real-time performance. Though bulk loading is faster, it needs additional preprocessing of the data, which is not cost effective particularly when transferring large amounts of data. It is thought that real-time data integration can improve the process of frequently loading small amounts of data. This dissertation discusses the performance and functionalities of existing data integration tools along with their associated problems and drawbacks. Current trends in real-time data integration are also analysed and a new real-time data integration framework is introduced. Implementations of the well known real-time data integration method are also presented, as are possible extensions for implementing or introducing the new real-time data integration implementation method to address other problems. An in-depth empirical analysis demonstrates the validity of the proposed approach.
author2 Vivekanand Gopalkrishnan
author_facet Vivekanand Gopalkrishnan
Kyaw, Ye Lin.
format Theses and Dissertations
author Kyaw, Ye Lin.
author_sort Kyaw, Ye Lin.
title Real-time data warehousing : data integration framework.
title_short Real-time data warehousing : data integration framework.
title_full Real-time data warehousing : data integration framework.
title_fullStr Real-time data warehousing : data integration framework.
title_full_unstemmed Real-time data warehousing : data integration framework.
title_sort real-time data warehousing : data integration framework.
publishDate 2010
url http://hdl.handle.net/10356/42312
_version_ 1681040004455858176