An efficient relational to column oriented database schema transformation technique
NoSQL database is introduced to overcome the high demand of managing database management systems in addition to the need for managing huge amount of data in unstructured format. Thus, data migration has become an important process in database management to migrate relational database to NoSQL dat...
Saved in:
Main Author: | |
---|---|
Format: | Thesis |
Language: | English |
Published: |
2019
|
Subjects: | |
Online Access: | http://psasir.upm.edu.my/id/eprint/90672/1/FSKTM%202020%201%20IR.pdf http://psasir.upm.edu.my/id/eprint/90672/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Universiti Putra Malaysia |
Language: | English |
id |
my.upm.eprints.90672 |
---|---|
record_format |
eprints |
spelling |
my.upm.eprints.906722021-09-03T01:06:04Z http://psasir.upm.edu.my/id/eprint/90672/ An efficient relational to column oriented database schema transformation technique Zaidi, Norwini NoSQL database is introduced to overcome the high demand of managing database management systems in addition to the need for managing huge amount of data in unstructured format. Thus, data migration has become an important process in database management to migrate relational database to NoSQL database due to the limitations in managing relational database. Schema transformation is an important process in data migration and there are various techniques that have been proposed to improve schema transformation and data migration from the relational database to the NoSQL database. The most common technique of schema transformation to NoSQL database is denormalization. However, schema transformation using denormalization suffers in terms of unnecessary data duplication in the NoSQL database that increases storage size. Furthermore, NoSQL database also has its limitations in terms of table joining and unable to perform queries on multiple tables. Schema transformation techniques using nested table merging describes only two related tables to merge. This inefficient schema transformation techniques lead to querying to be done on multiple tables and cause high query processing time. This research proposed a schema transformation technique for migrating data from relational database to column oriented database. The schema transformation technique has three main steps which are denormalization with read pattern, nested and multiple nested table merging, and rowkey design to reduce data redundancy and storage size to produce efficient query performance. In this technique, the read pattern identifies the access key of the query. The nested and multiple nested table merging techniques combined the tables that have the same access key to be in a nested form. The nested and multiple nested table merging on column oriented database leads the query to be performed on a single table to retrieve the data and thus improved query performance. Meanwhile, the rowkey design helps to determine the rowkey based on access keys that are identified in the read pattern technique. The experimental results showed that the proposed schema transformation technique managed to reduce data redundancy by eight column families thus reducing the storage size by 13.83% and improve the query performance time by 29.28% for DELL DVD dataset. While by using the Employees dataset, the proposed technique managed to reduce data redundancy by five column families thus reducing the storage size by 15.67% and improve the query performance time by 29.13%. 2019-09 Thesis NonPeerReviewed text en http://psasir.upm.edu.my/id/eprint/90672/1/FSKTM%202020%201%20IR.pdf Zaidi, Norwini (2019) An efficient relational to column oriented database schema transformation technique. Masters thesis, Universiti Putra Malaysia. Non-relational databases SQL (Computer program language) Database management |
institution |
Universiti Putra Malaysia |
building |
UPM Library |
collection |
Institutional Repository |
continent |
Asia |
country |
Malaysia |
content_provider |
Universiti Putra Malaysia |
content_source |
UPM Institutional Repository |
url_provider |
http://psasir.upm.edu.my/ |
language |
English |
topic |
Non-relational databases SQL (Computer program language) Database management |
spellingShingle |
Non-relational databases SQL (Computer program language) Database management Zaidi, Norwini An efficient relational to column oriented database schema transformation technique |
description |
NoSQL database is introduced to overcome the high demand of managing database
management systems in addition to the need for managing huge amount of data in
unstructured format. Thus, data migration has become an important process in database
management to migrate relational database to NoSQL database due to the limitations in
managing relational database. Schema transformation is an important process in data
migration and there are various techniques that have been proposed to improve schema
transformation and data migration from the relational database to the NoSQL database.
The most common technique of schema transformation to NoSQL database is
denormalization. However, schema transformation using denormalization suffers in
terms of unnecessary data duplication in the NoSQL database that increases storage size.
Furthermore, NoSQL database also has its limitations in terms of table joining and
unable to perform queries on multiple tables. Schema transformation techniques using
nested table merging describes only two related tables to merge. This inefficient schema
transformation techniques lead to querying to be done on multiple tables and cause high
query processing time.
This research proposed a schema transformation technique for migrating data from
relational database to column oriented database. The schema transformation technique
has three main steps which are denormalization with read pattern, nested and multiple
nested table merging, and rowkey design to reduce data redundancy and storage size to
produce efficient query performance. In this technique, the read pattern identifies the
access key of the query. The nested and multiple nested table merging techniques
combined the tables that have the same access key to be in a nested form. The nested and
multiple nested table merging on column oriented database leads the query to be
performed on a single table to retrieve the data and thus improved query performance.
Meanwhile, the rowkey design helps to determine the rowkey based on access keys that
are identified in the read pattern technique. The experimental results showed that the
proposed schema transformation technique managed to reduce data redundancy by eight column families thus reducing the storage size by 13.83% and improve the query
performance time by 29.28% for DELL DVD dataset. While by using the Employees
dataset, the proposed technique managed to reduce data redundancy by five column
families thus reducing the storage size by 15.67% and improve the query performance
time by 29.13%. |
format |
Thesis |
author |
Zaidi, Norwini |
author_facet |
Zaidi, Norwini |
author_sort |
Zaidi, Norwini |
title |
An efficient relational to column oriented database schema transformation technique |
title_short |
An efficient relational to column oriented database schema transformation technique |
title_full |
An efficient relational to column oriented database schema transformation technique |
title_fullStr |
An efficient relational to column oriented database schema transformation technique |
title_full_unstemmed |
An efficient relational to column oriented database schema transformation technique |
title_sort |
efficient relational to column oriented database schema transformation technique |
publishDate |
2019 |
url |
http://psasir.upm.edu.my/id/eprint/90672/1/FSKTM%202020%201%20IR.pdf http://psasir.upm.edu.my/id/eprint/90672/ |
_version_ |
1710677229311098880 |