OLAP query processing for partitioned data warehouses

In this vision paper, the authors discuss models and techniques for integrating, processing and querying data, information and knowledge within data warehouses in a user-centric manner. The user-centric emphasis allows us to achieve a number of clear advantages with respect to classical data warehouse architectures, whose most relevant ones are the following: (i) a unified and meaningful representation of multidimensional data and knowledge patterns throughout the data warehouse layers (i.e., loading, storage, metadata, etc); (ii) advanced query mechanisms and guidance that are capable of extracting targeted information and knowledge by means of innovative information retrieval and data mining techniques. Following this main framework, the authors first outline the importance of knowledge representation and management in data warehouses, where knowledge is expressed by existing ontology or patterns discovered from data. Then, the authors propose a user-centric architecture for OLAP query processing, which is the typical applicative interface to data warehouse systems. Finally, the authors propose insights towards cooperative query answering that make use of knowledge management principles and exploit the peculiarities of data warehouses (e.g., multidimensionality, multi-resolution, and so forth).

Download Full-text

Optimizing Aggregate Query Processing in Cloud Data Warehouses

Lecture Notes in Computer Science - Data Management in Cloud, Grid and P2P Systems ◽

10.1007/978-3-319-10067-8_1 ◽

2014 ◽

pp. 1-12 ◽

Cited By ~ 3

Author(s):

Swathi Kurunji ◽

Tingjian Ge ◽

Xinwen Fu ◽

Benyuan Liu ◽

Amrith Kumar ◽

...

Keyword(s):

Query Processing ◽

Data Warehouses ◽

Cloud Data ◽

Aggregate Query

Download Full-text

Range sum query processing in parallel data warehouses

Proceedings of the 8th International Scientific and Practical Conference of Students, Post-graduates and Young Scientists. Modern Technique and Technologies. MTT'2002 (Cat. No.02EX550) ◽

10.1109/pdcat.2003.1236437 ◽

2004 ◽

Cited By ~ 1

Author(s):

Li Jianzhong ◽

Gao Hong

Keyword(s):

Query Processing ◽

Data Warehouses ◽

Parallel Data

Download Full-text

Query Processing of Pre-partitioned Data Using Sandwich Operators

Lecture Notes in Business Information Processing - Enabling Real-Time Business Intelligence ◽

10.1007/978-3-642-39872-8_6 ◽

2013 ◽

pp. 76-92 ◽

Cited By ~ 1

Author(s):

Stephan Baumann ◽

Peter Boncz ◽

Kai-Uwe Sattler

Keyword(s):

Query Processing ◽

Partitioned Data

Download Full-text

Query Processing in Data Warehouses

Encyclopedia of Database Systems ◽

10.1007/978-0-387-39940-9_298 ◽

2009 ◽

pp. 2297-2301

Author(s):

Wolfgang Lehner

Keyword(s):

Query Processing ◽

Data Warehouses

Download Full-text

Efficient and Robust Node-Partitioned Data Warehouses

Data Warehouses and OLAP ◽

10.4018/978-1-59904-364-7.ch009 ◽

2011 ◽

pp. 203-229 ◽

Cited By ~ 4

Author(s):

Pedro Furtado

Keyword(s):

Local Area Network ◽

Low Cost ◽

Middle Layer ◽

Database Systems ◽

Area Network ◽

Data Warehouses ◽

Periodic Load ◽

Partitioned Data ◽

Target Environment ◽

Database Engine

Running large data warehouses (DW) efficiently over low cost platforms places special requirements on the design of system architecture. The idea is to have the DW on a set of low-cost nodes in a non-dedicated local-area network (LAN). Nodes can run any relational database engine, and the system relies on a partitioning strategy and query processing middle layer. These characteristics are in contrast with typical parallel database systems, which rely on fast dedicated interconnects and hardware, as well as a specialized parallel query optimizer for a specific database engine. This chapter describes the architecture of the Node-Partitioned Data Warehouse (NPDW), designed to run on the low cost environment, focusing on the design for partitioning, efficient parallel join and query transformations. Given the low reliability of the target environment, we also show how replicas are incorporated in the design of a robust NPDW strategy with availability guarantees and how the replicas are used for always-on, always efficient behavior in the presence of periodic load and maintenance tasks.

Download Full-text