Exact estimates of the probability of overfitting for multidimensional modeling families of algorithms

2011 ◽  
Vol 21 (1) ◽  
pp. 52-65 ◽  
Author(s):  
P. V. Botov
Author(s):  
Oscar Romero ◽  
Alberto Abelló

In the last years, data warehousing systems have gained relevance to support decision making within organizations. The core component of these systems is the data warehouse and nowadays it is widely assumed that the data warehouse design must follow the multidimensional paradigm. Thus, many methods have been presented to support the multidimensional design of the data warehouse.The first methods introduced were requirement-driven but the semantics of the data warehouse (since the data warehouse is the result of homogenizing and integrating relevant data of the organization in a single, detailed view of the organization business) require to also consider the data sources during the design process. Considering the data sources gave rise to several data-driven methods that automate the data warehouse design process, mainly, from relational data sources. Currently, research on multidimensional modeling is still a hot topic and we have two main research lines. On the one hand, new hybrid automatic methods have been introduced proposing to combine data-driven and requirement-driven approaches. These methods focus on automating the whole process and improving the feedback retrieved by each approach to produce better results. On the other hand, some new approaches focus on considering alternative scenarios than relational sources. These methods also consider (semi)-structured data sources, such as ontologies or XML, that have gained relevance in the last years. Thus, they introduce innovative solutions for overcoming the heterogeneity of the data sources. All in all, we discuss the current scenario of multidimensional modeling by carrying out a survey of multidimensional design methods. We present the most relevant methods introduced in the literature and a detailed comparison showing the main features of each approach.


2014 ◽  
Vol 24 ◽  
pp. 90-106 ◽  
Author(s):  
Kamal Boulil ◽  
Florence Le Ber ◽  
Sandro Bimonte ◽  
Corinne Grac ◽  
Flavie Cernesson

Sign in / Sign up

Export Citation Format

Share Document