Towards Comparative Analysis of Resumption Techniques in ETL

Mohammed Muddasir; Raghuveer K; Dayanand R

doi:10.24002/ijis.v3i2.3776

Towards Comparative Analysis of Resumption Techniques in ETL

Indonesian Journal of Information Systems ◽

10.24002/ijis.v3i2.3776 ◽

2021 ◽

Vol 3 (2) ◽

pp. 82

Author(s):

Mohammed Muddasir ◽

Raghuveer K ◽

Dayanand R

Keyword(s):

Comparative Analysis ◽

Real Time ◽

Data Warehouse ◽

Data Warehouses ◽

Data Bases ◽

Loading Process ◽

Application Analysis ◽

Block Based ◽

Operational Data ◽

Real Time Application

Data warehouses are loaded with data from sources such as operational data bases. Failure of loading process or failure of any of the process such as extraction or transformation is expensive because of the non-availability of data for analysis. With the advent of e-commerce and many real time application analysis of data in real time becomes a norm and hence any misses while the data is being loaded into data warehouse needs to be handled in an efficient and optimized way. The techniques to handle failure of process to populate the data are very much important as the actual loading process. Alternative arrangement needs to be made for in case of failure so that processes of populating the data warehouse are done in time. This paper explores the various ways through which a failed process of populating the data warehouse could be resumed. Various resumption techniques are compared and a novel block based technique is proposed to improve one of the existing resumption techniques.

Download Full-text

Data Warehouse Schemas using Multidimensional Data Model for Retail

International Journal of Engineering and Emerging Technology ◽

10.24843/ijeet.2017.v02.i01.p17 ◽

2017 ◽

Vol 2 (1) ◽

pp. 84

Author(s):

Kheri Arionadi Shobirin ◽

Adi Panca Saputra Iskandar ◽

Ida Bagus Alit Swamardika

Keyword(s):

Decision Making ◽

Real Time ◽

Data Warehouse ◽

Data Model ◽

Transaction Processing ◽

Multidimensional Data ◽

Complex Query ◽

On Line ◽

Analytical Processing ◽

Operational Data

A data warehouse are central repositories of integrated data from one or more disparate sources from operational data in On-Line Transaction Processing (OLTP) system to use in decision making strategy and business intelligent using On-Line Analytical Processing (OLAP) techniques. Data warehouses support OLAP applications by storing and maintaining data in multidimensional format. Multidimensional data models as an integral part of OLAP designed to solve complex query analysis in real time.

Download Full-text

Optimization of ETL Process in Data Warehouse Through a Combination of Parallelization and Shared Cache Memory

Engineering, Technology & Applied Science Research ◽

10.48084/etasr.849 ◽

2016 ◽

Vol 6 (6) ◽

pp. 1241-1244 ◽

Cited By ~ 2

Author(s):

M. Faridi Masouleh ◽

M. A. Afshar Kazemi ◽

M. Alborzi ◽

A. Toloie Eshlaghy

Keyword(s):

Big Data ◽

Data Warehouse ◽

Cache Memory ◽

Data Warehouses ◽

Data Bases ◽

Shared Cache ◽

Optimization Management ◽

Implementation Time ◽

The Creation ◽

Management Improvement

Extraction, Transformation and Loading (ETL) is introduced as one of the notable subjects in optimization, management, improvement and acceleration of processes and operations in data bases and data warehouses. The creation of ETL processes is potentially one of the greatest tasks of data warehouses and so its production is a time-consuming and complicated procedure. Without optimization of these processes, the implementation of projects in data warehouses area is costly, complicated and time-consuming. The present paper used the combination of parallelization methods and shared cache memory in systems distributed on the basis of data warehouse. According to the conducted assessment, the proposed method exhibited 7.1% speed improvement to kattle optimization instrument and 7.9% to talend instrument in terms of implementation time of the ETL process. Therefore, parallelization could notably improve the ETL process. It eventually caused the management and integration processes of big data to be implemented in a simple way and with acceptable speed.

Download Full-text

Data Warehousing

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch171 ◽

2008 ◽

pp. 2749-2761

Author(s):

Hugh J. Watson ◽

Barbara H. Wixom ◽

Dale L. Goodhue

Keyword(s):

Decision Support ◽

Real Time ◽

Data Warehouse ◽

Data Warehousing ◽

Top Management ◽

Data Warehouses ◽

Web Based ◽

Good Data ◽

Customer Services ◽

Enterprise Data Warehouse

Data warehouses are helping resolve a major problem that has plagued decision support applications over the years — a lack of good data. Top management at 3M realized that the company had to move from being product-centric to being customer savvy. In response, 3M built a terabyte data warehouse (global enterprise data warehouse) that provides thousands of 3M employees with real-time access to accurate, global, detailed information. The data warehouse underlies new Web-based customer services that are dynamically generated based on warehouse information. There are useful lessons that were learned at 3M during their years of developing the data warehouse.

Download Full-text

Real Time Data Warehouse Updates Through Extraction-Transformation-Loading Process Using Change Data Capture Method

Second International Conference on Computer Networks and Communication Technologies - Lecture Notes on Data Engineering and Communications Technologies ◽

10.1007/978-3-030-37051-0_62 ◽

2020 ◽

pp. 552-560

Author(s):

Sunaadh Thulasiram ◽

Nagaraja Ramaiah

Keyword(s):

Real Time ◽

Data Warehouse ◽

Data Capture ◽

Time Data ◽

Loading Process ◽

Change Data Capture ◽

Real Time Data ◽

Capture Method

Download Full-text

On-Demand ELT Architecture for Right-Time BI

International Journal of Data Warehousing and Mining ◽

10.4018/jdwm.2013040102 ◽

2013 ◽

Vol 9 (2) ◽

pp. 21-38 ◽

Cited By ~ 13

Author(s):

Florian Waas ◽

Robert Wrembel ◽

Tobias Freudenreich ◽

Maik Thiele ◽

Christian Koncilia ◽

...

Keyword(s):

Real Time ◽

Data Warehouse ◽

Data Cleaning ◽

Data Sources ◽

Materialized Views ◽

Event Processing ◽

On Demand ◽

Database Technology ◽

Processing Mechanisms ◽

Operational Data

In a typical BI infrastructure, data, extracted from operational data sources, is transformed, cleansed, and loaded into a data warehouse by a periodic ETL process, typically executed on a nightly basis, i.e., a full day’s worth of data is processed and loaded during off-hours. However, it is desirable to have fresher data for business insights at near real-time. To this end, the authors propose to leverage a data warehouse’s capability to directly import raw, unprocessed records and defer the transformation and data cleaning until needed by pending reports. At that time, the database’s own processing mechanisms can be deployed to process the data on-demand. Event-processing capabilities are seamlessly woven into our proposed architecture. Besides outlining an overall architecture, the authors also developed a roadmap for implementing a complete prototype using conventional database technology in the form of hierarchical materialized views.

Download Full-text

The Real-time Application Analysis of Highs-peed Train Scheduling System

Information Technology Journal ◽

10.3923/itj.2012.1310.1315 ◽

2012 ◽

Vol 11 (9) ◽

pp. 1310-1315

Author(s):

Xuedong Du ◽

Guilin Li ◽

Jiangtao Ji ◽

Xiaomei Tan

Keyword(s):

Real Time ◽

Train Scheduling ◽

Scheduling System ◽

The Real ◽

Application Analysis ◽

Real Time Application

Download Full-text

A comparative analysis of route-based power management strategies for real-time application in plug-in hybrid electric vehicles

2014 American Control Conference ◽

10.1109/acc.2014.6859191 ◽

2014 ◽

Cited By ~ 14

Author(s):

Mahyar Vajedi ◽

Amir Taghavipour ◽

Nasser L. Azad ◽

John McPhee

Keyword(s):

Comparative Analysis ◽

Real Time ◽

Power Management ◽

Electric Vehicles ◽

Hybrid Electric Vehicles ◽

Management Strategies ◽

Hybrid Electric ◽

Real Time Application

Download Full-text

A Proposed DDS Enabled Model for Data Warehouses with Real Time Updates

International Journal of Informatics and Communication Technology (IJ-ICT) ◽

10.11591/ijict.v7i1.pp31-38 ◽

2018 ◽

Vol 7 (1) ◽

pp. 31

Author(s):

Munesh Chandra Trivedi ◽

Virendra Kumar Yadav ◽

Avadhesh Kumar Gupta

Keyword(s):

Real Time ◽

Data Warehouse ◽

User Acceptance ◽

Current Data ◽

Decision Makers ◽

Data Sources ◽

Design Phase ◽

Data Warehouses ◽

Inconsistent Data ◽

Proposed Model

<p>Data warehouse generally contains both types of data i.e. historical & current data from various data sources. Data warehouse in world of computing can be defined as system created for analysis and reporting of these both types of data. These analysis report is then used by an organization to make decisions which helps them in their growth. Construction of data warehouse appears to be simple, collection of data from data sources into one place (after extraction, transform and loading). But construction involves several issues such as inconsistent data, logic conflicts, user acceptance, cost, quality, security, stake holder’s contradictions, REST alignment etc. These issues need to be overcome otherwise will lead to unfortunate consequences affecting the organization growth. Proposed model tries to solve these issues such as REST alignment, stake holder’s contradiction etc. by involving experts of various domains such as technical, analytical, decision makers, management representatives etc. during initialization phase to better understand the requirements and mapping these requirements to data sources during design phase of data warehouse.</p>

Download Full-text

Data Warehousing

Organizational Data Mining ◽

10.4018/978-1-59140-134-6.ch014 ◽

2011 ◽

pp. 202-216 ◽

Cited By ~ 8

Author(s):

Hugh J. Watson ◽

Barbara H. Wixom ◽

Dale L. Goodhue

Keyword(s):

Decision Support ◽

Real Time ◽

Data Warehouse ◽

Data Warehousing ◽

Top Management ◽

Data Warehouses ◽

Web Based ◽

Good Data ◽

Customer Services ◽

Enterprise Data Warehouse

Download Full-text

A Multi-Start Greedy Algorithm for Optimal Reconfiguration of Solar Photovoltaic Arrays for Maximum Power Output in Real-Time Application

International Review of Electrical Engineering (IREE) ◽

10.15866/iree.v12i5.12763 ◽

2017 ◽

Vol 12 (5) ◽

pp. 431

Author(s):

Mabel Usunobun Olanipekun ◽

Josiah L. Munda ◽

Yskandar Hamam

Keyword(s):

Real Time ◽

Greedy Algorithm ◽

Power Output ◽

Maximum Power ◽

Solar Photovoltaic ◽

Maximum Power Output ◽

Photovoltaic Arrays ◽

Optimal Reconfiguration ◽

Real Time Application

Download Full-text