Text Cube: Computing IR Measures for Multidimensional Text Database Analysis

Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimen...

Full description

Saved in:
Bibliographic Details
Main Authors: LIN, Cindy Xinde, DING, Bolin, HAN, Jiawei, ZHU, Feida, ZHAO, Bo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2008
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/1008
https://ink.library.smu.edu.sg/context/sis_research/article/2007/viewcontent/TextCube_2008.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-2007
record_format dspace
spelling sg-smu-ink.sis_research-20072017-11-22T06:07:59Z Text Cube: Computing IR Measures for Multidimensional Text Database Analysis LIN, Cindy Xinde DING, Bolin HAN, Jiawei ZHU, Feida ZHAO, Bo Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a Text-Cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods. 2008-12-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1008 info:doi/10.1109/ICDM.2008.135 https://ink.library.smu.edu.sg/context/sis_research/article/2007/viewcontent/TextCube_2008.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Cube Text OLAP Databases and Information Systems
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Cube
Text
OLAP
Databases and Information Systems
spellingShingle Cube
Text
OLAP
Databases and Information Systems
LIN, Cindy Xinde
DING, Bolin
HAN, Jiawei
ZHU, Feida
ZHAO, Bo
Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
description Since Jim Gray introduced the concept of ”data cube” in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a Text-Cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.
format text
author LIN, Cindy Xinde
DING, Bolin
HAN, Jiawei
ZHU, Feida
ZHAO, Bo
author_facet LIN, Cindy Xinde
DING, Bolin
HAN, Jiawei
ZHU, Feida
ZHAO, Bo
author_sort LIN, Cindy Xinde
title Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
title_short Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
title_full Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
title_fullStr Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
title_full_unstemmed Text Cube: Computing IR Measures for Multidimensional Text Database Analysis
title_sort text cube: computing ir measures for multidimensional text database analysis
publisher Institutional Knowledge at Singapore Management University
publishDate 2008
url https://ink.library.smu.edu.sg/sis_research/1008
https://ink.library.smu.edu.sg/context/sis_research/article/2007/viewcontent/TextCube_2008.pdf
_version_ 1770570821571444736