OLAP query processing for partitioned data warehouses

Author(s):  
L. Bellatreche ◽  
K. Karlapalem ◽  
M. Mohania
2013 ◽  
Vol 9 (2) ◽  
pp. 89-109 ◽  
Author(s):  
Marie-Aude Aufaure ◽  
Alfredo Cuzzocrea ◽  
Cécile Favre ◽  
Patrick Marcel ◽  
Rokia Missaoui

In this vision paper, the authors discuss models and techniques for integrating, processing and querying data, information and knowledge within data warehouses in a user-centric manner. The user-centric emphasis allows us to achieve a number of clear advantages with respect to classical data warehouse architectures, whose most relevant ones are the following: (i) a unified and meaningful representation of multidimensional data and knowledge patterns throughout the data warehouse layers (i.e., loading, storage, metadata, etc); (ii) advanced query mechanisms and guidance that are capable of extracting targeted information and knowledge by means of innovative information retrieval and data mining techniques. Following this main framework, the authors first outline the importance of knowledge representation and management in data warehouses, where knowledge is expressed by existing ontology or patterns discovered from data. Then, the authors propose a user-centric architecture for OLAP query processing, which is the typical applicative interface to data warehouse systems. Finally, the authors propose insights towards cooperative query answering that make use of knowledge management principles and exploit the peculiarities of data warehouses (e.g., multidimensionality, multi-resolution, and so forth).


Author(s):  
Swathi Kurunji ◽  
Tingjian Ge ◽  
Xinwen Fu ◽  
Benyuan Liu ◽  
Amrith Kumar ◽  
...  

2011 ◽  
pp. 203-229 ◽  
Author(s):  
Pedro Furtado

Running large data warehouses (DW) efficiently over low cost platforms places special requirements on the design of system architecture. The idea is to have the DW on a set of low-cost nodes in a non-dedicated local-area network (LAN). Nodes can run any relational database engine, and the system relies on a partitioning strategy and query processing middle layer. These characteristics are in contrast with typical parallel database systems, which rely on fast dedicated interconnects and hardware, as well as a specialized parallel query optimizer for a specific database engine. This chapter describes the architecture of the Node-Partitioned Data Warehouse (NPDW), designed to run on the low cost environment, focusing on the design for partitioning, efficient parallel join and query transformations. Given the low reliability of the target environment, we also show how replicas are incorporated in the design of a robust NPDW strategy with availability guarantees and how the replicas are used for always-on, always efficient behavior in the presence of periodic load and maintenance tasks.


Sign in / Sign up

Export Citation Format

Share Document