Selection of views to materialize in data warehouse: A hybrid solution

Author(s):  
A. Boukra ◽  
M. Ahmed Nacer ◽  
S. Bouroubi
2008 ◽  
pp. 3085-3115
Author(s):  
Biren Shah ◽  
Karthik Ramachandran ◽  
Vijay Raghavan

Materialized view selection is one of the crucial decisions in designing a data warehouse for optimal efficiency. Static selection of views may materialize certain views that are not beneficial as the data and usage trends change over time. On the contrary, dynamic selection of views works better only for queries demanding a high degree of aggregation. These facts point to the need for a technique that combines the improved response time of the static approach and the automated tuning capability of the dynamic approach. In this article, we propose a hybrid approach for the selection of materialized views. The idea is to partition the collection of all views into a static and dynamic set such that views selected for materialization from the static set are persistent over multiple query (and maintenance) windows, whereas views selected from the dynamic set can be queried and/or replaced on the fly. Highly aggregated views are selected on the fly based on the query access patterns of users, whereas the more detailed static set of views plays a significant role in the efficient maintenance of the dynamic set of views and in answering certain detailed view queries. We prove that our proposed strategy satisfies the monotonicity requirements, which is essential in order for the greedy heuristic to deliver competitive solutions. Experimental results show that our approach outperforms Dynamat, a well-known dynamic view management system that is known to outperform optimal static view selection.


2015 ◽  
Vol 5 (3) ◽  
pp. 1-25 ◽  
Author(s):  
Biri Arun ◽  
T.V. Vijay Kumar

Data warehouse was designed to cater to the strategic decision making needs of an organization. Most queries posed on them are on-line analytical queries, which are complex and computation intensive in nature and have high query response times when processed against a large data warehouse. This time can be substantially reduced by materializing pre-computed summarized views and storing them in a data warehouse. All possible views cannot be materialized due to storage space constraints. Also, an optimal selection of subsets of views is shown to be an NP-Complete problem. This problem of view selection has been addressed in this paper by selecting a beneficial set of views, from amongst all possible views, using the swarm intelligence technique Marriage in Honey Bees Optimization (MBO). An MBO based view selection algorithm (MBOVSA), which aims to select views that incur the minimum total cost of evaluating all the views (TVEC), is proposed. In MBOVSA, the search has been intensified by incorporating the royal jelly feeding phase into MBO. MBOVSA, when compared with the most fundamental greedy based view selection algorithm HRUA, is able to select comparatively better quality views.


Author(s):  
Arik Sofan Tohir Sofan Tohir ◽  
Kusrini Kusrini ◽  
Sudarmawan Sudarmawan

  Data warehouse is a concept and a technology to store transactional data from several sources that have been through the process of filtering and selection of data. By using the Ectract, Transform and Load (ETL) process in the data warehouse, OLTP data is processed to produce good data and ready for use for the analysis process. For the design of this warehouse data will be built by using the Nine Step Method from Kimbal, so that the resulting warhouse data can be as expected. For the development of life flow system (SDLC) with waterfall model. By using the wate fall model will be built a prototype to implement the data warehouse design results.


Author(s):  
Putu Veda Andreyana ◽  
Putu Angelina Widya ◽  
Made Suartika

The growth of patient data increasing the hospital resulted in even harder to compile data and analyze the data manually, so it takes a data warehouse that can perform this task automatically. The design of the data warehouse Sanglah hospital aims to build a data warehouse that can store data in structured and easier to analyze the data to make a decision. Design methods Fact Constellation Schema and method used is a Nine-step methodology consisting of nine stages, namely the Electoral Process, Selection of Grain, identification and adjustment of the dimensions, the Electoral Facts, Storage pre-calculation in the fact table, Ensuring the dimension tables, the Electoral duration database, Track changes of dimensions is slowly, prioritization and query model. Designing Data Warehouse Sanglah Hospital helpful enough for data processing in large enough quantities, so expect the needs and information about the patient can be met. Data warehouse Sanglah Hospital can be used to analyze patient data in order to get information on the number of patients of various dimensions


2012 ◽  
Vol 229-231 ◽  
pp. 2125-2129
Author(s):  
Robert Halenar

This paper describes how to build routines in Matlab environment for improved ETL process. New architecture real time ETL process stills automated without human – database administrator interference, in cost of reduced accuracy rendered by level of trust. This method is constructed in Matlab environment, due to simple transformation and convert routines and functions. Important is the selection of proper environment for implementation of real time ETL process. Using enhanced architecture of real time ETL process implemented in Matlab make this process stills automated without human – database administrator interference, in cost of reduced accuracy. This method provides a simple transformation and converting routines and functions. First we are describing advantages of Matlab environment followed by real time ETL process description and problem how to achieve real – time in real world. In next part we present our enhanced near real time ETL model with new architecture. And finally we show how to use Matlab routines and toolkit to achieve simplicity in ETL phases of data warehouse needed for simulation and modeling.


Sign in / Sign up

Export Citation Format

Share Document