A Cost-Based Range Estimation for Mapping Top-k Selection Queries over Relational Databases

Finding efficient methods for supporting top-k relational queries has received significant attention in academic research. One of the approaches in the recent literature is query-mapping, in which top-k queries are mapped (translated) into equivalent range queries that relational database systems (RDBMSs) normally support. This approach combines the advantage of simplicity as well as practicality by avoiding the need for modifications to the query engine, or specialized data structures or indexing techniques to handle top-k queries separately. However, existing methods following this approach fall short of adequately modeling the problem environment and providing consistent results. In this article, the authors propose a cost-based range estimation model for the query-mapping approach. They provide a methodology for trading-off relevant query execution cost components and mapping a top-k query into a cost-optimal range query for efficient execution. Their experiments on real world and synthetic data sets show that the proposed strategy not only avoids the need to calibrate workloads on specific database contents, but also performs at least as well as prior methods.

Download Full-text

Cost Modeling and Range Estimation for Top-k Retrieval in Relational Databases

Theoretical and Practical Advances in Information Systems Development ◽

10.4018/978-1-60960-521-6.ch012 ◽

2011 ◽

pp. 295-315

Author(s):

Anteneh Ayanso ◽

Paulo B. Goes ◽

Kumar Mehta

Keyword(s):

Relational Databases ◽

Estimation Method ◽

Synthetic Data ◽

Cost Modeling ◽

Data Sets ◽

Range Estimation ◽

Mapping Techniques ◽

The Cost ◽

Query Estimation ◽

Query Mapping

Relational databases have increasingly become the basis for a wide range of applications that require efficient methods for exploratory search and retrieval. Top-k retrieval addresses this need and involves finding a limited number of records whose attribute values are the closest to those specified in a query. One of the approaches in the recent literature is query-mapping which deals with converting top-k queries into equivalent range queries that relational database management systems (RDBMSs) normally support. This approach combines the advantages of simplicity as well as practicality by avoiding the need for modifications to the query engine, or specialized data structures and indexing techniques to handle top-k queries separately. This paper reviews existing query-mapping techniques in the literature and presents a range query estimation method based on cost modeling. Experiments on real world and synthetic data sets show that the cost-based range estimation method performs at least as well as prior methods and avoids the need to calibrate workloads on specific database contents.

Download Full-text

An Evaluation of Dynamic Electronic Catalog Models in Relational Database Systems

Managing E-Commerce and Mobile Computing Technologies ◽

10.4018/978-1-93177-746-9.ch007 ◽

2011 ◽

pp. 73-90 ◽

Cited By ~ 1

Author(s):

Kiryoong Kim ◽

Dongkyu Kim ◽

Jeuk Kim ◽

Sang-uk Park ◽

Ighoon Lee ◽

...

Keyword(s):

Electronic Commerce ◽

Relational Database ◽

Relational Databases ◽

Database Systems ◽

Relational Schemas ◽

Electronic Catalogs ◽

Electronic Catalog ◽

Software Product ◽

New Models ◽

Relational Database Systems

Electronic catalogs are electronic representations about products and services in the electronic commerce environment and require diverse and flexible schemas. Although relational database systems seem to be an obvious choice for their storage, traditional designs of relational schemas do not support electronic catalogs in the most effective ways. Therefore, new models for managing diverse and flexible schemas in relational databases are required for such systems. Proposed in this paper are several models for electronic catalogs using relational tables, and an experimental evaluation of their efficiency. The results of this study can be put to practical use and are, in fact, being applied in the design of a commercial software product.

Download Full-text

Performance analysis of relational databases Oracle and MS SQL based on desktop application

Journal of Computer Sciences Institute ◽

10.35784/jcsi.693 ◽

2018 ◽

Vol 8 ◽

pp. 263-269

Author(s):

Grzegorz Dziewit ◽

Jakub Korczyński ◽

Maria Skublewska-Paszkowska

Keyword(s):

Binary Data ◽

Relational Databases ◽

Database Systems ◽

Database System ◽

Sql Server ◽

External Application ◽

Oracle Database ◽

Sql Server Database ◽

Relational Database Systems ◽

Mean Time

Comparison of efficiency is not a trivial phenomenon because of disparities between different database systems. This paper presents a methodology of comparing relational database systems in respect of mean time of execution individual DML queries containing subqueries and conjunction of tables. The presented methodology can be additionally accommodated to studies of efficiency in a range of database system itself (study of queries executed directly in database engine). The described methodology allows to receive statement telling which database system is better in comparison to another in dependency of functionalities fulfilled by external application. In the article the analysis of mean time of execution individual DML queries was performed.Two research hypotheses have been put forward: "Microsoft SQL Server database system needs less time to execute INSERT and UPDATE queries than Oracle database" and "Oracle database system needs less time to execute DML queries with binary data than SQL Server"

Download Full-text

Evaluation of Optimization Strategies for Incremental Graph Queries

Periodica Polytechnica Electrical Engineering and Computer Science ◽

10.3311/ppee.9769 ◽

2017 ◽

Vol 61 (2) ◽

pp. 175 ◽

Cited By ~ 2

Author(s):

Gábor Szárnyas ◽

János Maginecz ◽

Dániel Varró

Keyword(s):

Response Times ◽

Distributed Storage ◽

Large Data ◽

Database Systems ◽

Optimization Techniques ◽

Large Data Sets ◽

Data Sets ◽

Multiple Datasets ◽

Graph Queries ◽

Relational Database Systems

The last decade brought considerable improvements in distributed storage and query technologies, known as NoSQL systems. These systems provide quick evaluation of simple retrieval operations and are able to answer certain complex queries in a scalable way, albeit not instantly. Providing scalability and quick response times at the same time for querying large data sets is still a challenging task. Evaluating complex graph queries is particularly difficult, as it requires lots of join, antijoin and filtering operations. This paper presents optimization techniques used in relational database systems and applies them on graph queries. We evaluate various query plans on multiple datasets and discuss the effect of different optimization techniques.

Download Full-text

Fuzzy Classification on Relational Databases

Handbook of Research on Fuzzy Information Processing in Databases ◽

10.4018/978-1-59904-853-6.ch023 ◽

2011 ◽

pp. 586-614 ◽

Cited By ~ 25

Author(s):

Andreas Meier ◽

Günter Schindler ◽

Nicolas Werro

Keyword(s):

Relational Databases ◽

Information Overload ◽

Query Language ◽

Large Data ◽

Database Systems ◽

Fuzzy Classification ◽

Linguistic Variables ◽

Context Model ◽

Relational Database Systems ◽

Data Collections

In practice, information systems are based on very large data collections mostly stored in relational databases. As a result of information overload, it has become increasingly difficult to analyze huge amounts of data and to generate appropriate management decisions. Furthermore, data are often imprecise because they do not accurately represent the world or because they are themselves imperfect. For these reasons, a context model with fuzzy classes is proposed to extend relational database systems. More precisely, fuzzy classes and linguistic variables and terms, together with appropriate membership functions, are added to the database schema. The fuzzy classification query language (fCQL) allows the user to formulate unsharp queries that are then transformed into appropriate SQL statements using the fCQL toolkit so that no migration of the raw data is needed. In addition to the context model with fuzzy classes, fCQL and its implementation are presented here, illustrated by concrete examples.

Download Full-text

Graph Encoding and Recursion Computation

Encyclopedia of Information Science and Technology, First Edition ◽

10.4018/978-1-59140-553-5.ch231 ◽

2005 ◽

pp. 1309-1316

Author(s):

Yangjun Chen

Keyword(s):

Relational Database ◽

Relational Databases ◽

Database Systems ◽

Cad Cam ◽

Relational Systems ◽

Relational Database Systems ◽

Novel Applications ◽

General Opinion ◽

Office Systems ◽

Document Databases

It is a general opinion that relational database systems are inadequate for manipulating composite objects that arise in novel applications such as Web and document databases (Abiteboul, Cluet, Christophides, Milo, Moerkotte & Simon, 1997; Chen & Aberer, 1998, 1999; Mendelzon, Mihaila & Milo, 1997; Zhang, Naughton, Dewitt, Luo & Lohman, 2001), CAD/ CAM, CASE, office systems and software management. Especially, when recursive relationships are involved, it is cumbersome to handle them in relational databases, which sets current relational systems far behind the navigational ones (Kuno & Rundensteiner, 1998; Lee & Lee, 1998). To overcome this problem, a lot of interesting graph encoding methods have been developed to mitigate the difficulty to some extent. In this article, we give a brief description of some important methods, including analysis and comparison of their space and time complexities.

Download Full-text

Towards the Maturity of Object-Relational Database Technology: Promises and Reality

International Journal of Technology Diffusion ◽

10.4018/ijtd.2015100101 ◽

2015 ◽

Vol 6 (4) ◽

pp. 1-19 ◽

Cited By ~ 4

Author(s):

Negin Keivani ◽

Abdelsalam M. Maatuk ◽

Shadi Aljawarneh ◽

Muhammad Akhtar Ali

Keyword(s):

Relational Database ◽

Relational Databases ◽

Database Systems ◽

Database System ◽

Relational Database System ◽

Key Factor ◽

Object Relational ◽

Database Technology ◽

Relational Database Systems ◽

Additional Object

Object-relational technology provides a significant increase in scalability and flexibility over the traditional relational databases. The additional object-relational features are particularly satisfying for advanced database applications that relational database systems have experienced difficulties. The key factor to the success of object-relational database systems is their performance. This paper aims to review the promises of Object-Relational database systems, examine the reality, and how their promises may be fulfilled through unification with the relational technology. To investigate the performance implications of using object-relational relative to relational technology, the query-oriented BUCKY benchmark has been previously applied to an early object-relational database system, i.e., Illustra 97. This paper presents the results obtained from implementing and running the BUCKY benchmark on Oracle 10g. The results acquired from the work described in this paper are compared with the results obtained in BUCKY benchmark. This study throws light on the functionality of object-relational databases, where object-relational technology has made improvements but some limitations are identified as well. In general, the performance of relational supersedes that of object-relational database system.

Download Full-text

Closing the Gap Between XML and Relational Database Technologies

Open and Novel Issues in XML Database Applications ◽

10.4018/978-1-60566-308-1.ch001 ◽

2010 ◽

pp. 1-27

Author(s):

Mary Ann Malloy ◽

Irena Mlynkova

Keyword(s):

Relational Database ◽

Relational Databases ◽

Database Systems ◽

Data Representation ◽

Xml Data ◽

Advantages And Disadvantages ◽

Closing The Gap ◽

Relational Database Systems ◽

Xml Technologies

As XML technologies have become a standard for data representation, it is inevitable to propose and implement efficient techniques for managing XML data. A natural alternative is to exploit tools and functions offered by relational database systems. Unfortunately, this approach has many detractors, especially due to inefficiency caused by structural differences between XML data and relations. But, on the other hand, relational databases represent a mature, verified and reliable technology for managing any kind of data including XML documents. In this chapter, the authors provide an overview and classification of existing approaches to XML data management in relational databases. They view the problem from both state-of-the-practice and state-of-the-art perspectives. The authors describe the current best known solutions, their advantages and disadvantages. Finally, they discuss some open issues and their possible solutions.

Download Full-text

Supporting Imprecision in Database Systems

Encyclopedia of Data Warehousing and Mining, Second Edition ◽

10.4018/978-1-60566-010-3.ch288 ◽

2011 ◽

pp. 1884-1887 ◽

Cited By ~ 1

Author(s):

Ullas Nambiar

Keyword(s):

Data Storage ◽

Database Systems ◽

Travel Agency ◽

Specific Information ◽

George Bush ◽

Domain Specific ◽

Query Engine ◽

Management Schemes ◽

Support Mechanisms ◽

Relational Database Systems

A query against incomplete or imprecise data in a database1, or a query whose search conditions are imprecise can both result in answers that do not satisfy the query completely. Such queries can be broadly termed as imprecise queries. Today’s database systems are designed largely for precise queries against a database of precise and complete data. Range queries (e.g., Age BETWEEN 20 AND 30) and disjunctive queries (e.g., Name=“G. W. Bush” OR Name=“George Bush”) do allow for some imprecision in queries. However, these extensions to precise queries are unable to completely capture the expressiveness of an imprecise query. Supporting imprecise queries (e.g., Model like “Camry” and Price around “$15000”) over databases necessitates a system that integrates a similarity search paradigm over structured and semi-structured data. Today’s relational database systems, as they are designed to support precise queries against precise data, use such precise access support mechanisms as indexing, hashing, and sorting. Such mechanisms are used for fast selective searches of records within a table and for joining two tables based on precise matching of values in join fields in the tables. The imprecise nature of the search conditions in queries will make such access mechanisms largely useless. Thus, supporting imprecise queries over existing databases would require adding support for imprecision within the query engine and meta-data management schemes like indexes. Extending a database to support imprecise queries would involve changing the query processing and data storage models being used by the database. But, the fact that databases are generally used by other applications and therefore must retain their behaviour could become a key inhibitor to any technique that relies on modifying the database to enable support for imprecision. For example, changing an airline reservation database will necessitate changes to other connected systems including travel agency databases, partner airline databases etc. Even if the database is modifiable, we would still require a domain expert and/or end user to provide the necessary distance metrics and domain ontology. Domain ontologies do not exist for all possible domains and the ones that are available are far from being complete. Therefore, a feasible solution for answering imprecise queries should neither assume the ability to modify the properties of the database nor require users (both lay and expert) to provide much domain specific information.

Download Full-text

Set Valued Attributes

Encyclopedia of Database Technologies and Applications ◽

10.4018/978-1-59140-560-3.ch104 ◽

2005 ◽

pp. 632-637 ◽

Cited By ~ 3

Author(s):

Karthikeyan Ramasamy ◽

Prasad M. Deshpande

Keyword(s):

Relational Database ◽

Data Model ◽

Relational Databases ◽

Database Systems ◽

Sql Server ◽

Relational Model ◽

Database Model ◽

Competitive Edge ◽

The Past ◽

Relational Database Systems

About three decades ago, when Codd (1970) invented the relational database model, it took the database world by storm. The enterprises that adapted it early won a large competitive edge. The past two decades have witnessed tremendous growth of relational database systems, and today the relational model is by far the dominant data model and is the foundation for leading DBMS products, including IBM DB2, Informix, Oracle, Sybase, and Microsoft SQL server. Relational databases have become a multibillion-dollar industry.

Download Full-text