Web Technologies and Data Warehousing Synergies

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch214 ◽

2008 ◽

pp. 3411-3415

Author(s):

John M. Artz

Keyword(s):

Data Warehouse ◽

Web Site ◽

Relational Databases ◽

Data Warehousing ◽

Emerging Technology ◽

Web Technologies ◽

The Past ◽

Large Sets ◽

The Web

Data warehousing is an emerging technology that greatly extends the capabilities of relational databases specifically in the analysis of very large sets of time-oriented data. The emergence of data warehousing has been somewhat eclipsed over the past decade by the simultaneous emergence of Web technologies. However, Web technologies and data warehousing have some natural synergies that are not immediately obvious. First, Web technologies make data warehouse data more easily available to a much wider variety of users. Second, data warehouse technologies can be used to analyze traffic to a Web site in order to gain a much better understanding of the visitors to the Web site. It is this second synergy that is the focus of this article.

Download Full-text

From Web Log to Data Warehouse

Managing Internet and Intranet Technologies in Organizations ◽

10.4018/978-1-878289-95-7.ch012 ◽

2001 ◽

pp. 203-216

Author(s):

John M. Artz

Keyword(s):

Data Warehouse ◽

Web Site ◽

Relational Databases ◽

Data Warehousing ◽

Research Data ◽

Data Set ◽

Web Technologies ◽

Web Log ◽

Large Sets ◽

The Web

Data warehousing is an emerging technology that greatly extends the capabilities of relational databases specifically in the analysis of very large sets of time-oriented data. The emergence of data warehousing has been somewhat eclipsed by the simultaneous emergence of Web technologies. However, Web technologies and data warehousing have some natural synergies that are just now being recognized. First, Web technologies make data warehouse data more easily available to a much wider variety of users both internally and externally. Since the value of data is directly related to its availability for exploitation, Internets and intranets help increase the value of the data in the warehouse. Second, data warehouse technologies can be used to analyze traffic to a Web site in a wide variety of ways in order to make the Web site more effective. This chapter will focus on the latter of these synergies and show, through an evolving example, how a simple data set from the Web log can be enhanced, in a step-wise fashion, into a full-fledged market research data warehouse.

Download Full-text

Humanitites Data Warehousing

Data Warehousing and Mining ◽

10.4018/978-1-59904-951-9.ch141 ◽

2008 ◽

pp. 2364-2370

Author(s):

Janet Delve

Keyword(s):

Data Warehouse ◽

Relational Databases ◽

Data Warehousing ◽

Numerical Data ◽

Complex Nature ◽

Data Warehouses ◽

Textual Data ◽

Numeric Data ◽

First Time ◽

And Linguistics

Data Warehousing is now a well-established part of the business and scientific worlds. However, up until recently, data warehouses were restricted to modeling essentially numerical data – examples being sales figures in the business arena (e.g. Wal-Mart’s data warehouse) and astronomical data (e.g. SKICAT) in scientific research, with textual data providing a descriptive rather than a central role. The lack of ability of data warehouses to cope with mainly non-numeric data is particularly problematic for humanities1 research utilizing material such as memoirs and trade directories. Recent innovations have opened up possibilities for non-numeric data warehouses, making them widely accessible to humanities research for the first time. Due to its irregular and complex nature, humanities research data is often difficult to model and manipulating time shifts in a relational database is problematic as is fitting such data into a normalized data model. History and linguistics are exemplars of areas where relational databases are cumbersome and which would benefit from the greater freedom afforded by data warehouse dimensional modeling.

Download Full-text

BST Algorithm for Duplicate Elimination in Data Warehouse

INTERNATIONAL JOURNAL OF MANAGEMENT & INFORMATION TECHNOLOGY ◽

10.24297/ijmit.v4i1.4636 ◽

2013 ◽

Vol 4 (1) ◽

pp. 190-197

Author(s):

Payal Pahwa ◽

Rashmi Chhabra

Keyword(s):

Data Quality ◽

Data Warehouse ◽

Data Warehousing ◽

Data Cleaning ◽

Emerging Technology ◽

Quality Data ◽

Business Organization ◽

Data Cleansing ◽

Business Decisions ◽

Related Data

Data warehousing is an emerging technology and has proved to be very important for an organization. Today every business organization needs accurate and large amount of information to make proper decisions. For taking the business decisions the data should be of good quality. To improve the data quality data cleansing is needed. Data cleansing is fundamental to warehouse data reliability, and to data warehousing success. There are various methods for datacleansing. This paper addresses issues related data cleaning. We focus on the detection of duplicate records. Also anefficient algorithm for data cleaning is proposed. A review of data cleansing methods and comparison between them is presented.

Download Full-text

Humanities Data Warehousing

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch153 ◽

2011 ◽

pp. 987-992

Author(s):

Janet Delve

Keyword(s):

Data Warehouse ◽

Relational Databases ◽

Data Warehousing ◽

Numerical Data ◽

Complex Nature ◽

Data Warehouses ◽

Textual Data ◽

Numeric Data ◽

First Time ◽

And Linguistics

Data Warehousing is now a well-established part of the business and scientific worlds. However, up until recently, data warehouses were restricted to modeling essentially numerical data – examples being sales figures in the business arena (in say Wal-Mart’s data warehouse (Westerman, 2000)) and astronomical data (for example SKICAT) in scientific research, with textual data providing a descriptive rather than a central analytic role. The lack of ability of data warehouses to cope with mainly non-numeric data is particularly problematic for humanities1 research utilizing material such as memoirs and trade directories. Recent innovations have opened up possibilities for ‘non-numeric’ data warehouses, making them widely accessible to humanities research for the first time. Due to its irregular and complex nature, humanities research data is often difficult to model, and manipulating time shifts in a relational database is problematic as is fitting such data into a normalized data model. History and linguistics are exemplars of areas where relational databases are cumbersome and which would benefit from the greater freedom afforded by data warehouse dimensional modeling.

Download Full-text

The Development of Ordered SQL Packages to Support Data Warehousing

Data Warehousing and Web Engineering ◽

10.4018/978-1-931777-02-5.ch018 ◽

2011 ◽

pp. 285-311

Author(s):

Wilfred Ng ◽

Mark Levene

Keyword(s):

Data Warehouse ◽

Relational Databases ◽

Corporate Strategy ◽

Data Warehousing ◽

Effective Means ◽

Relational Model ◽

Minimal Extension ◽

Wide Range ◽

Partial Orderings ◽

Advanced Applications

Data warehousing is a corporate strategy that needs to integrate information from several sources of separately developed Database Management Systems (DBMSs). A future DBMS of a data warehouse should provide adequate facilities to manage a wide range of information arising from such integration. We propose that the capabilities of database languages should be enhanced to manipulate user-defined data orderings, since business queries in an enterprise usually involve order. We extend the relational model to incorporate partial orderings into data domains and describe the ordered relational model. We have already defined and implemented a minimal extension of SQL, called OSQL, which allows querying over ordered relational databases. One of the important facilities provided by OSQL is that it allows users to capture the underlying semantics of the ordering of the data for a given application. Herein we demonstrate that OSQL aided with a package discipline can be an effective means to manage the inter-related operations and the underlying data domains of a wide range of advanced applications that are vital in data warehousing, such as temporal, incomplete and fuzzy information. We present the details of the generic operations arising from these applications in the form of three OSQL packages called: OSQL_TIME, OSQL_INCOMP and OSQL_FUZZY.

Download Full-text

The Contribution of GEOS to the Study of Stellar Pulsations

International Astronomical Union Colloquium ◽

10.1017/s0252921100015906 ◽

2002 ◽

Vol 185 ◽

pp. 166-167 ◽

Cited By ~ 1

Author(s):

R. Boninsegna ◽

J. Vandenbroere ◽

J.F. Le Borgne ◽

Keyword(s):

Web Site ◽

Variable Stars ◽

Light Curves ◽

Maximum Brightness ◽

Eclipsing Binary ◽

Large Sample ◽

The Past ◽

Stellar Pulsations ◽

Be Star ◽

The Web

The GEOS, Groupe Européen d’Observation Stellaires, is composed of observers living in France, Italy, Spain, Belgium and Switzerland. The main purpose is to give amateur astronomers the opportunity of carrying out scientific analyses in specific fields. Further details can be found at the web site http://www.upv.es/geos/In the past years, GEOS has approached the analysis of visual estimates of variable stars (Ralincourt et al., 1987) in an original manner, obtaining accurate light curves on red semiregulars. Moreover, the collaboration with teams of professional researchers allowed the group to obtain interesting results on the double-mode Cepheid EW Set (Figer et al., 1991), on the Be star OT Gem (Arellano Ferro et al., 1998) and on the eclipsing binary V753 Cyg (Beltraminelli et al., 2000). More recently, several GEOS members have started collecting a large sample of times of maximum brightness of RR Lyr stars in order to build-up a database as extensive as possible. Such a database is decribed below together with the results of a campaign on RR Lyr itself.

Download Full-text

Humanitites Data Warehousing

Encyclopedia of Data Warehousing and Mining ◽

10.4018/978-1-59140-557-3.ch108 ◽

2011 ◽

pp. 570-574

Author(s):

Janet Delve

Keyword(s):

Data Warehouse ◽

Relational Databases ◽

Data Warehousing ◽

Numerical Data ◽

Complex Nature ◽

Data Warehouses ◽

Textual Data ◽

Numeric Data ◽

First Time ◽

And Linguistics

Data Warehousing is now a well-established part of the business and scientific worlds. However, up until recently, data warehouses were restricted to modeling essentially numerical data – examples being sales figures in the business arena (e.g. Wal-Mart’s data warehouse) and astronomical data (e.g. SKICAT) in scientific research, with textual data providing a descriptive rather than a central role. The lack of ability of data warehouses to cope with mainly non-numeric data is particularly problematic for humanities1 research utilizing material such as memoirs and trade directories. Recent innovations have opened up possibilities for non-numeric data warehouses, making them widely accessible to humanities research for the first time. Due to its irregular and complex nature, humanities research data is often difficult to model and manipulating time shifts in a relational database is problematic as is fitting such data into a normalized data model. History and linguistics are exemplars of areas where relational databases are cumbersome and which would benefit from the greater freedom afforded by data warehouse dimensional modeling.

Download Full-text