Perancangan dan Implementasi Data Warehouse Penjualan (Studi Kasus: Northwind Sample Database)

2021 ◽  
Vol 10 (1) ◽  
pp. 175
Author(s):  
Ni Putu Novia Ardiyanti ◽  
Muhammad Firdaus Zulkarnain ◽  
I Wayan Wijaya Kusuma Sandi ◽  
I Dewa Ngurah Tri Hendrawan ◽  
Ida Bagus Made Mahendra

Analisa yang kompleks sangat dibutuhkan dalam pengambilan keputusan bisnis suatu perusahaan. Untuk mendukung proses analisa tersebut, digunakan data warehouse yang dianggap efektif membantu proses analisis, perancangan dan pengambilan keputusan bisnis. Umumnya perusahaan akan membangun data warehouse untuk menyimpan data operasional yang berguna dalam proses analisa bisnis, sehingga informasi yang diinginkan perusahaan dapat diperoleh dengan lebih mudah. Pada penelitian ini akan dilakukan proses perancangan dan implementasi data warehouse, yang menggunakan database northwind sebagai sumber datanya. Untuk merancang data warehouse, digunakan metode nine-step design methodology dan Pentaho Data Integration Software untuk proses implementasinya. Dari hasil perancangan dan implementasi tersebut akan terbentuk sebuah tabel fakta penjualan yang berisi informasi – informasi yang berguna untuk membantu analisis dan pengambilan keputusan bisnis perusahaan, yang pada penelitian ini divisualisasikan dengan menggunakan aplikasi Microsoft Power Business Intelligence.

Author(s):  
Tom Breur

Business Intelligence (BI) projects that involve substantial data integration have often proven failure-prone and difficult to plan. Data quality issues trigger rework, which makes it difficult to accurately schedule deliverables. Two things can bring improvement. Firstly, one should deliver information products in the smallest possible chunks, but without adding prohibitive overhead for breaking up the work in tiny increments. This will increase the frequency and improve timeliness of feedback on suitability of information products and hence make planning and progress more predictable. Secondly, BI teams need to provide better stewardship when they facilitate discussions between departments whose data cannot easily be integrated. Many so-called data quality errors do not stem from inaccurate source data, but rather from incorrect interpretation of data. This is mostly caused by different interpretation of essentially the same underlying source system facts across departments with misaligned performance objectives. Such problems require prudent stakeholder management and informed negotiations to resolve such differences. In this chapter, the authors suggest an innovation to data warehouse architecture to help accomplish these objectives.


Entity Resolution (ER) is the process of identifying records that refer to the same real-world entity. It plays a key role in many applications as data warehouse, data integration, and business intelligence. Comparing every record with all corresponding records is infeasible especially for a big dataset. To overcome such a problem, blocking techniques have been implemented. In this paper, we propose a novel Efficient Multi-Phase Blocking Strategy (EMPBS) for resolving duplicates in big data. As per our knowledge, some state of art blocking techniques may result in overlapping blocks (i.e. Q-grams) which cause redundant comparisons and hence increase the time complexity. Our proposed blocking strategy has disjoint blocks and less time complexity compared to Q-grams and slandered blocking techniques. In addition, EMPBS is general and requires no restrictions on the type of blocking keys. EMPBS consists of three phases. The first one generates three single efficient blocking keys. The second phase takes the output of the first phase as an input to construct a compound key. The compound key is composed of concatenation of two single blocking keys. Three compound blocking keys are the output of this phase that will be used as an input for the last phase, which is generating the Efficient Multi-Phase Blocking Key (EMPBK). EMPBK is constructed using the union of two compound blocking keys. The implementation of EMPBS presents promising results in terms of Reduction Ratio (RR) as it achieves a higher value of RR than adopting only a single blocking key, while at the same time maintains nearly the same precision and recall. EMPBS reduced about 84% of the average number of comparisons accomplished in a single blocking key. To evaluate EMPBS, we developed a Duplicate Generation tool (DupGen) that accepts a clean semi-structured file as an input and generates labeled duplicate records according to certain criteria.


Author(s):  
Indrabudhi Lokaadinugroho ◽  
Abba Suganda Girsang ◽  
Burhanudin Burhanudin

This paper discusses about how to build a data warehouse (DW) in business intelligence (BI) for a typical marketing division in a university. This study uses a descriptive method that attempts to describe the object or subject under study as it is, with the aim of systematically describing the facts and characteristics of the object under study precisely. In the elaboration of the methodology, there are four phases that include the identification and source data collection phase, the analysis phase, the design phase, and then the results phase of each detail in accordance with the nine steps of Kimball’s data warehouse and the Pentaho Data Integration (PDI). The result is a tableau as a tool of BI that does not have complete ETL tools. So, the process approach in combining PDI and DW as a data source certainly makes a tableau as a BI tool more useful in presenting data thus minimizing the time needed to obtain strategic data from 2-3 weeks to 77 minutes.


Author(s):  
Vladimir Alberto Torres Torres ◽  
Édgar Núñez Torres ◽  
Yanet Molina Hernández ◽  
Daykenis Caballero feria ◽  
Yanet Peña González ◽  
...  

La Inteligencia de Negocios es una estrategia que ha alcanzado un nivel elevado en la competitividad empresarial. Aplicar una solución de Inteligencia de Negocios parte de los sistemas de origen de datos que posee una organización, apoyándose de un conjunto de herramientas encargadas de la extracción, depuración y consolidación de los datos. Esta información será almacenada en un Data Warehouse o en los Data Mart, los cuales son unidades más pequeñas orientadas a áreas específicas o un tema en particular. Esta investigación realiza el diseño e implementación de un Data Mart como solución de Inteligencia de Negocios para los servicios de alimentación prestados por la Empresa de Servicios a la Unión del Níquel (Esuni), radicada en Moa (Cuba). Se emplearon las herramientas Pentaho Bussiness Intelligence, Pentaho Data Integration 4.2.1, Pentaho Schema Workbench, PostgreSQL 9.0 y Embarcadero ERStudio 8.0.que permitieron la construcción del Data Mart y fue seleccionada la metodología Ralph Kimball para el diseño de la arquitectura y Hefesto para el desarrollo del mercado de datos, permitiendo que la información generada por los servicios gastronómicos se encuentre en un lugar específico, depurada y consolidada sirva como soporte a la toma de decisiones en la empresa.Palabras claves: Servicios Gastronómicos, Mercado de datos, Pentaho.The Intelligence of Business is a strategy that has reached a high level when of managerial competitiveness it is. Applying a solution of Business Intelligence it begin with the systems data origin that it possesses a company, leaning on a tools group in charge of the extraction, purification and consolidation of the data. This information will be stored in a Data Warehouse or in Data Mart which are smaller units guided in to specific areas or a particular topic. In this investigation is carried out the design and implementation of a Data Mart like solution of Business Intelligence for gastronomic services for the Company of Services to the Union of Nickel which resides in the municipality of Moa. Several tools were used that allowed the construction of the Data Mart and Hefesto was the methodology selected for the development of the same. Allowing that all the information generated by the gastronomic services is in a specific place purified and consolidated serves like support to the taking of decisions in the gastronomic services of the Esuni.Keywords: Food Services, Data Mart, Pentaho


Author(s):  
Harkiran Kaur ◽  
Kawaljeet Singh ◽  
Tejinder Kaur

Background: Numerous E – Migrants databases assist the migrants to locate their peers in various countries; hence contributing largely in communication of migrants, staying overseas. Presently, these traditional E – Migrants databases face the issues of non – scalability, difficult search mechanisms and burdensome information update routines. Furthermore, analysis of migrants’ profiles in these databases has remained unhandled till date and hence do not generate any knowledge. Objective: To design and develop an efficient and multidimensional knowledge discovery framework for E - Migrants databases. Method: In the proposed technique, results of complex calculations related to most probable On-Line Analytical Processing operations required by end users, are stored in the form of Decision Trees, at the pre- processing stage of data analysis. While browsing the Cube, these pre-computed results are called; thus offering Dynamic Cubing feature to end users at runtime. This data-tuning step reduces the query processing time and increases efficiency of required data warehouse operations. Results: Experiments conducted with Data Warehouse of around 1000 migrants’ profiles confirm the knowledge discovery power of this proposal. Using the proposed methodology, authors have designed a framework efficient enough to incorporate the amendments made in the E – Migrants Data Warehouse systems on regular intervals, which was totally missing in the traditional E – Migrants databases. Conclusion: The proposed methodology facilitate migrants to generate dynamic knowledge and visualize it in the form of dynamic cubes. Applying Business Intelligence mechanisms, blending it with tuned OLAP operations, the authors have managed to transform traditional datasets into intelligent migrants Data Warehouse.


Author(s):  
Michael Yulianto ◽  
Abba Suganda Girsang ◽  
Reinert Yosua Rumagit

Electronic ticket (eticket) provider services are growing fast in Indonesia, makingthe competition between companies increasingly intense. Moreover, most of them have the sameservice or feature for serving their customers. To get back the feedback of their customers, manycompanies use social media (Facebook and Twitter) for marketing activity or communicatingdirectly with their customers. The development of current technology allows the company totake data from social media. Thus, many companies take social media data for analyses. Thisstudy proposed developing a data warehouse to analyze data in social media such as likes,comments, and sentiment. Since the sentiment is not provided directly from social media data,this study uses lexicon based classification to categorize the sentiment of users’ comments. Thisdata warehouse provides business intelligence to see the performance of the company based ontheir social media data. The data warehouse is built using three travel companies in Indonesia.As a result, this data warehouse provides the comparison of the performance based on the socialmedia data.


2021 ◽  
Author(s):  
Monkgogi Mudongo ◽  
Edwin Thuma ◽  
Nkwebi Peace Motlogelwa ◽  
Tebo Leburu-Dingalo ◽  
Pulafela Majoo

Road traffic accidents are a serious problem for the nation of Botswana. A large amount of money is used to compensate those who are affected by road accidents. Traffic accidents are one of the major causes of Deaths in Botswana. It is important for relevant organizations to have a reliable source of data for accurate evaluation of traffic accidents. Similarly, data on vehicle registration must be transformed and be readily available to assist managerial decision makers. In this article, we deploy a Business Intelligence (BI) and Data Warehouse (DW) solution in an attempt to assist the relevant departments in their road traffic accidents and vehicle registration evaluation. In Our evaluation of the traffic accidents our findings suggest that across accident severity, Damage Only accidents had the most interesting recent trend with a 11.93% decrease in the last 3 years on record. Count of Accident Severity for Damage Only accidents dropped from 13,491 to 11,881 between 2018 and 2020 whilst Minor accidents experienced the longest period of growth. Most accidents take place in rural locations and more accidents take place during the weekend. At 28,439, Sunday had the highest number of accidents and was 47.59% higher than Wednesday, which had the lowest count of accidents at 19,269. The results for vehicle registration reveal that the number of vehicle registration decreased for the last 3 years on record. The number of vehicles registered dropped from 65535 to 24457 during its steepest decline between 2019 and 2021.


Sign in / Sign up

Export Citation Format

Share Document