Concepedia

TLDR

Data cube and OLAP, introduced by Jim Gray in 1997, have driven data warehousing, and the explosion of internet text motivates a cube model that merges OLAP with information‑retrieval techniques for multidimensional text. This paper proposes a text‑cube model for multidimensional text databases and investigates effective OLAP over such data. The model distinguishes dimensional and term hierarchies, and uses them to design efficient text‑cube implementation, OLAP execution, and query processing. Performance experiments demonstrate the high promise of the proposed methods.

Abstract

Since Jim Gray introduced the concept of rdquodata cuberdquo in 1997, data cube, associated with online analytical processing (OLAP), has become a driving engine in data warehouse industry. Because the boom of Internet has given rise to an ever increasing amount of text data associated with other multidimensional information, it is natural to propose a data cube model that integrates the power of traditional OLAP and IR techniques for text. In this paper, we propose a text-cube model on multidimensional text database and study effective OLAP over such data. Two kinds of hierarchies are distinguishable inside: dimensional hierarchy and term hierarchy. By incorporating these hierarchies, we conduct systematic studies on efficient text-cube implementation, OLAP execution and query processing. Our performance study shows the high promise of our methods.

References

YearCitations

Page 1