data marts Latest Research Papers

Optimal data warehouse design with data marts and data cube aggregation

10.1063/5.0066804 ◽

2022 ◽

Author(s):

Zainab Alkhayat ◽

Kadhim B. S. Aljanabi

Keyword(s):

Data Warehouse ◽

Data Cube ◽

Data Marts ◽

Warehouse Design

Application of OLAP Technology in the Environment of Electronic Demography

INFORMACIONNYE TEHNOLOGII ◽

10.17587/it.27.542-549 ◽

2021 ◽

Vol 27 (10) ◽

pp. 542-549

Author(s):

G. Ch. Nabibayova ◽

Keyword(s):

Decision Makers ◽

Practical Application ◽

Demographic Research ◽

Computing Model ◽

Interdisciplinary Field ◽

Parallel Data ◽

Data Marts ◽

Data Volume ◽

Analytical Processing ◽

High Level

The article proposes an approach to the development of an electronic demographic decision support system using technologies of Data Warehouse (DW) and Interactive Analytical Processing OLAP. This makes it possible to conduct high-level demographic research and provide support to decision-makers in demographic sphere. The article notes that demography is an interdisciplinary field of research and is defined as a complex science. Each industry of demography has many indicators. A sample list of these indicators is presented. The main characteristics of the DW, which should be taken into account when developing its architecture, are stated. Among these characteristics, one can find the main defining characteristics of Big Data — volume, velocity, variety, veracity, variability, visualization, value etc. For a more rational and efficient use of a large amount of information, taking into account its constant increase, to ensure the speed of execution of requests for a given system, it is proposed to use a Bus of Interconnected Data Marts (DM) as an architecture of DW. One of the advantages of using DM is that their use assumes distributed parallel data processing. This architecture allows for much faster results generation. It is based on the MapReduce distributed computing model and the Hadoop project. In addition, to effectively use large amounts of data, it is also proposed to use OLAP operations such as roll-up and drill-down, as well as fuzzy set theory, based on the technique of computing with words. The article also shows the practical application of interconnected DM. An OLAP cube is built on the basis of these DM. OLAP operations provide the ability to view cubes in different slices and provide aggregate data.

Experimental Characteristics Study of Data Storage Formats for Data Marts Development within Data Lakes

Applied Sciences ◽

10.3390/app11188651 ◽

2021 ◽

Vol 11 (18) ◽

pp. 8651

Author(s):

Vladimir Belov ◽

Alexander N. Kosenkov ◽

Evgeny Nikulchev

Keyword(s):

Big Data ◽

Data Storage ◽

Storage System ◽

Apache Hadoop ◽

Aggregated Data ◽

Data Marts ◽

Hadoop Platform ◽

Analytical Platforms ◽

Big Data Storage

One of the most popular methods for building analytical platforms involves the use of the concept of data lakes. A data lake is a storage system in which the data are presented in their original format, making it difficult to conduct analytics or present aggregated data. To solve this issue, data marts are used, representing environments of stored data of highly specialized information, focused on the requests of employees of a certain department, the vector of an organization’s work. This article presents a study of big data storage formats in the Apache Hadoop platform when used to build data marts.

A Novel Approach Using Non-Synonymous Materialized Queries for Data Warehousing

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2021070102 ◽

2021 ◽

Vol 17 (3) ◽

pp. 22-43

Author(s):

Sonali Ashish Chakraborty

Keyword(s):

Data Warehouse ◽

Relational Database ◽

Execution Time ◽

Data Warehousing ◽

Multiple Sources ◽

Novel Approach ◽

Union Operation ◽

Data Marts ◽

Using Data

Data from multiple sources are loaded into the organization data warehouse for analysis. Since some OLAP queries are quite frequently fired on the warehouse data, their execution time is reduced by storing the queries and results in a relational database, referred as materialized query database (MQDB). If the tables, fields, functions, and criteria of input query and stored query are the same but the query criteria specified in WHERE or HAVING clause do not match, then they are considered non-synonymous to each other. In the present research, the results of non-synonymous queries are generated by reusing the existing stored results after applying UNION or MINUS operations on them. This will reduce the execution time of non-synonymous queries. For superset criteria values of input query, UNION operation is applied, and for subset values, MINUS operation is applied. Incremental result processing of existing stored results, if required, is performed using Data Marts.

An Approach for Retrieving Faster Query Results From Data Warehouse Using Synonymous Materialized Queries

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2021040105 ◽

2021 ◽

Vol 17 (2) ◽

pp. 85-105

Author(s):

Sonali Ashish Chakraborty ◽

Jyotika Doshi

Keyword(s):

Data Warehouse ◽

Strategic Decision ◽

Maintenance Cost ◽

Multidimensional Data ◽

Materialized Views ◽

Multiple Sources ◽

Query Result ◽

Novel Approach ◽

Data Marts ◽

Analytical Processing

The enterprise data warehouse stores an enormous amount of data collected from multiple sources for analytical processing and strategic decision making. The analytical processing is done using online analytical processing (OLAP) queries where the performance in terms of result retrieval time is an important factor. The major existing approaches for retrieving results from a data warehouse are multidimensional data cubes and materialized views that incur more storage, processing, and maintenance costs. The present study strives to achieve a simpler and faster query result retrieval approach from data warehouse with reduced storage space and minimal maintenance cost. The execution time of frequent queries is saved in the present approach by storing their results for reuse when the query is fired next time. The executed OLAP queries are stored along with the query results and necessary metadata information in a relational database is referred as materialized query database (MQDB). The tables, fields, functions, relational operators, and criteria used in the input query are matched with those of stored query, and if they are found to be same, then the input query and the stored query are considered as a synonymous query. Further, the stored query is checked for incremental updates, and if no incremental updates are required, then the existing stored results are fetched from MQDB. On the other hand, if the stored query requires an incremental update of results, then the processing of only incremental result is considered from data marts. The performance of MQDB model is evaluated by comparing with the developed novel approach, and it is observed that, using MQDB, a significant reduction in query processing time is achieved as compared to the major existing approaches. The developed model will be useful for the organizations keeping their historical records in the data warehouse.

Importance of Data Mart for Data Warehouse

International Journal of Advanced Research in Science, Communication and Technology ◽

10.48175/ijarsct-v2-i3-312 ◽

2021 ◽

pp. 65-68

Author(s):

Hoemra N. Halvadi Assistant Professor

Keyword(s):

Decision Support ◽

Data Warehouse ◽

Market Share ◽

Data Science ◽

Data Repository ◽

Data Mart ◽

Data Marts ◽

Conceptual Data ◽

Gathering Data ◽

Significant Data

In today’s world there are large amount of significant data to counter highly force races, extends market share and improve profitability. for that they required the information in a such way that can be a subject oriented, combined, non-volatile and time-variant. It a conceptual data for data repository of data for gathering data from various sources and merge by the whole enterprise. The data mart is newly progress area of data Science which is to be used to important deployment of decision support ability with the fire answer on investment need by the pace of extant business. Basically, data mart is coming from the data warehouse to emerged the technology and getting data in faster way which is used to merge to create Data warehouse. In the paper we will take review about design and integration of data marts and various techniques used for merged data mart.

Propuesta de solución de inteligencia de negocios para optimizar la planificación de servicios mineros en una empresa de seguridad privada

10.19083/tesis/654009 ◽

2020 ◽

Author(s):

◽

Julia Juro Barrios ◽

Fredy Enrique Salazar Timoteo

Keyword(s):

Data Mart ◽

Data Marts ◽

Toma De Decisiones

La propuesta de solución de inteligencia de negocios pretende contribuir a la organización objeto de estudio la oportunidad que mejorar en la toma de decisiones por parte de los directivos; ya que, se proporcionará la entrega de cuadro de mandos corporativos con información recopilada y centralizada que permita realizar consultas variadas que se encuentren basadas en modelos de estructuras multidimensionales o más conocidos como Cubos OLAP1. La inteligencia de negocios establece un determinado proceso para tratar la información que se encuentra dispersa, así como, mediante la técnica denominada ETL2 que permite convertir datos explotables que serán mostrador en Data marts o cubos OLAP, y, la herramienta que se utiliza para la presentación de los indicadores, reportes o cuadro de mandos con la información recopilada dentro del Data mart será el Power BI. Por ello, se procedió a realizar el análisis del negocio y del proceso de Planificación de Servicios Mineros con el apoyo del marco de referencia Zachman3, luego se detalla la situación actual del proceso y se propone la situación futura. Cuando la propuesta es aceptada, se establece el resultado del proyecto a través de las reglas de negocio, requerimientos funcionales y no funcionales, restricciones, drivers funcionales y los casos de uso del sistema. Con toda la información previa, se procederá a efectuar los drivers arquitectónicos basada en la metodología Kimball4, conceptos de diseño, estilos arquitecturales y tácticas, se adiciona, las especificaciones de los casos de uso del sistema y los prototipos de los cuadros de mandos propuestos. Para finalizar, se mostrará la gestión del proyecto con el apoyo de la Guía del PMBOK5.

Data Discovery Over Time Series From Star Schemas Based on Association, Correlation, and Causality

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2020100106 ◽

2020 ◽

Vol 16 (4) ◽

pp. 95-111

Author(s):

Wallace Anacleto Pinheiro ◽

Geraldo Xexéo ◽

Jano Moreira de Souza ◽

Ana Bárbara Sapienza Pinheiro

Keyword(s):

Time Series ◽

Data Discovery ◽

Neglected Disease ◽

Tegumentary Leishmaniasis ◽

Data Marts ◽

American Tegumentary Leishmaniasis ◽

Over Time

This work proposes a methodology applied to repositories modeled using star schemas, such as data marts, to discover relevant time series relations. This paper applies a set of measures related to association, correlation, and causality to create connections among data. In this context, the research proposes a new causality function based on peaks and values that relate coherently time series. To evaluate the approach, the authors use a set of experiments exploring time series about a particular neglected disease that affects several Brazilian cities called American Tegumentary Leishmaniasis and time series about the climate of some cities in Brazil. The authors populate data marts with these data, and the proposed methodology has generated a set of relations linking the notifications of this disease to the variation of temperature and pluviometry.

Electronic demography decision making system

PROBLEMS IN PROGRAMMING ◽

10.15407/pp2020.02-03.228 ◽

2020 ◽

pp. 228-236

Author(s):

G.Ch. Nabibekova ◽

Keyword(s):

Data Warehouse ◽

Decision Makers ◽

Practical Application ◽

Data Marts ◽

Decision Making System ◽

Demographic Processes ◽

Analytical Processing ◽

Using Data ◽

High Level ◽

Bus Architecture

The article suggests an approach to the development of an electronic demographic decision support system using data warehouse and interactive analytical processing OLAP. This makes it possible to conduct research on demographic processes at a high level and to support decision makers in the field of demography. Due to the presence of many types of demography and a large number of indicators, proposed in the article, a Data Mart Bus Architecture with Linked Dimensional Data Marts is proposed as a Data Warehouse architecture. The article also shows the practical application of this approach using two Data Marts as an example. Based on these Data Marts, OLAP-cubes are built. OLAP operations provide the ability to view cubes in various slices, as well as provide aggregate data.

MATERIALIZED VIEWS QUANTUM OPTIMIZED PICKING for INDEPENDENT DATA MARTS QUALITY

Iraqi Journal of Information & Communications Technology ◽

10.31987/ijict.3.1.88 ◽

2020 ◽

Vol 3 (1) ◽

pp. 26-39

Author(s):

Refed Adnan ◽

Talib M. J. Abbas

Keyword(s):

Response Time ◽

Data Warehouse ◽

Maintenance Cost ◽

Materialized Views ◽

Stochastic Algorithm ◽

Independent Data ◽

Materialized View ◽

Query Response Time ◽

Data Marts ◽

Better Than

Particular and timely unified information along with quick and effective query response times is the basic fundamental requirement for the success of any collection of independent data marts (data warehouse) which forms Fact Constellation Schema or Galaxy Schema. Because of the materialized view storage area, the materialization of all views is practically impossible thus suitable materialized views (MVs) picking is one of the intelligent decisions in designing a Fact Constellation Schema to get optimal efficiency. This study presents a framework for picking best-materialized view using Quantum Particle Swarm Optimization (QPSO) algorithm where it is one of the stochastic algorithm in order to achieve the effective combination of good query response time, low query handling cost and low view maintenance cost. The results reveals that the proposed method for picking best-materialized view using QPSO algorithm is better than other techniques via computing the ratio of query response time and compare it to the response time of the same queries on the materialized views. Ratio of implementing the query on the base table takes five times more time than the query implementation on the materialized views. Where the response time of queries through MVs access were found 0.084 seconds while by direct access queries were found 0.422 seconds. This outlines that the performance of query through materialized views access is 402.38% better than those directly access via data warehouse-logical.

data marts
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Optimal data warehouse design with data marts and data cube aggregation

Application of OLAP Technology in the Environment of Electronic Demography

Experimental Characteristics Study of Data Storage Formats for Data Marts Development within Data Lakes

A Novel Approach Using Non-Synonymous Materialized Queries for Data Warehousing

An Approach for Retrieving Faster Query Results From Data Warehouse Using Synonymous Materialized Queries

Importance of Data Mart for Data Warehouse

Propuesta de solución de inteligencia de negocios para optimizar la planificación de servicios mineros en una empresa de seguridad privada

Data Discovery Over Time Series From Star Schemas Based on Association, Correlation, and Causality

Electronic demography decision making system

MATERIALIZED VIEWS QUANTUM OPTIMIZED PICKING for INDEPENDENT DATA MARTS QUALITY

Export Citation Format

data martsRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Optimal data warehouse design with data marts and data cube aggregation

Application of OLAP Technology in the Environment of Electronic Demography

Experimental Characteristics Study of Data Storage Formats for Data Marts Development within Data Lakes

A Novel Approach Using Non-Synonymous Materialized Queries for Data Warehousing

An Approach for Retrieving Faster Query Results From Data Warehouse Using Synonymous Materialized Queries

Importance of Data Mart for Data Warehouse

Propuesta de solución de inteligencia de negocios para optimizar la planificación de servicios mineros en una empresa de seguridad privada

Data Discovery Over Time Series From Star Schemas Based on Association, Correlation, and Causality

Electronic demography decision making system

MATERIALIZED VIEWS QUANTUM OPTIMIZED PICKING for INDEPENDENT DATA MARTS QUALITY

data marts
Recently Published Documents