OLAP CUBE AGGREGATION OPERATOR ON COLUMN- ORIENTED NOSQL DATABASE USING RESILIENT DISTRIBUTED DATASET APPROACH

OLAP cube is a multidimensional data structure that enables efficient and fast data analysis. OLAP cubes are formed by performing aggregation operations on each hierarchy of dimensions used. Modeling OLAP cube aggregation operators refers to the process of defining how data should be summarized o...

全面介紹

Saved in:
書目詳細資料
主要作者: Septian Adhitia, Ginanjar
格式: Theses
語言:Indonesia
在線閱讀:https://digilib.itb.ac.id/gdl/view/76853
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Institut Teknologi Bandung
語言: Indonesia
實物特徵
總結:OLAP cube is a multidimensional data structure that enables efficient and fast data analysis. OLAP cubes are formed by performing aggregation operations on each hierarchy of dimensions used. Modeling OLAP cube aggregation operators refers to the process of defining how data should be summarized or combined at various cube levels. Previous research has modelled column-based OLAP cube aggregation operators using MapReduce. However, these aggregation operators suffer from slow computational times. This study aims to develop an OLAP cube aggregation operator model that can provide faster computational times in a column-based NoSQL environment. This research focuses on analyzing the utilization of the RDD (Resilient Distributed Dataset) approach to model column-based NoSQL OLAP cube aggregation operators for achieving efficiency and faster computational times. This analysis involves adjusting the concepts and logical architecture of OLAP cube aggregation operators to align with the characteristics and capabilities of RDD. The resulting model is implemented using the Python programming language for testing purposes. Testing is conducted by comparing the execution time of RDD aggregation operators with those using MapReduce. From the testing results covering functionality and performance, it is evident that RDD aggregation operators offer shorter computational times and can be applied across various column-based NoSQL databases.