incomplete databases Latest Research Papers

The Complexity of Counting Problems Over Incomplete Databases

ACM Transactions on Computational Logic ◽

10.1145/3461642 ◽

2021 ◽

Vol 22 (4) ◽

pp. 1-52

Author(s):

Marcelo Arenas ◽

Pablo BarcelÓ ◽

Mikaël Monet

Keyword(s):

Polynomial Time ◽

Relational Databases ◽

Approximation Scheme ◽

Query Languages ◽

Complexity Classes ◽

Conjunctive Query ◽

Counting Problems ◽

Incomplete Databases ◽

Boolean Query ◽

The Impact

We study the complexity of various fundamental counting problems that arise in the context of incomplete databases, i.e., relational databases that can contain unknown values in the form of labeled nulls. Specifically, we assume that the domains of these unknown values are finite and, for a Boolean query q , we consider the following two problems: Given as input an incomplete database D , (a) return the number of completions of D that satisfy q ; or (b) return the number of valuations of the nulls of D yielding a completion that satisfies q . We obtain dichotomies between #P-hardness and polynomial-time computability for these problems when q is a self-join–free conjunctive query and study the impact on the complexity of the following two restrictions: (1) every null occurs at most once in D (what is called Codd tables ); and (2) the domain of each null is the same. Roughly speaking, we show that counting completions is much harder than counting valuations: For instance, while the latter is always in #P, we prove that the former is not in #P under some widely believed theoretical complexity assumption. Moreover, we find that both (1) and (2) can reduce the complexity of our problems. We also study the approximability of these problems and show that, while counting valuations always has a fully polynomial-time randomized approximation scheme (FPRAS), in most cases counting completions does not. Finally, we consider more expressive query languages and situate our problems with respect to known complexity classes.

Decision tree classifiers for unmanned aircraft configuration selection

Aircraft Engineering and Aerospace Technology ◽

10.1108/aeat-03-2021-0074 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

João Antônio Dantas de Jesus Ferreira ◽

Ney Rafael Secco

Keyword(s):

Decision Tree ◽

Aircraft Design ◽

Classification Problem ◽

Unmanned Aircraft ◽

Requirements Traceability ◽

Content Type ◽

Configuration Selection ◽

Incomplete Databases ◽

Engine Type ◽

Selection Of

Purpose This paper aims to investigate the possibility of lowering the time taken during the aircraft design for unmanned aerial vehicles by using machine learning (ML) for the configuration selection phase. In this work, a database of unmanned aircraft is compiled and is proposed that decision tree classifiers (DTC) can understand the relations between mission and operational requirements and the resulting aircraft configuration. Design/methodology/approach This paper presents a ML-based approach to configuration selection of unmanned aircraft. Multiple DTC are built to predict the overall configuration. The classifiers are trained with a database of 118 unmanned aircraft with 57 characteristics, 47 of which are inputs for the classification problem, and 10 are the desired outputs, such as wing configuration or engine type. Findings This paper shows that DTC can be used for the configuration selection of unmanned aircraft with reasonable accuracy, understanding the connections between the different mission requirements and the culminating configuration. The framework is also capable of dealing with incomplete databases, maximizing the available knowledge. Originality/value This paper increases the computational usage for the aircraft design while retaining requirements’ traceability and increasing decision awareness.

Unholy goals and flawed methods

Nordic Perspectives on Open Science ◽

10.7557/11.5659 ◽

2020 ◽

Author(s):

Charlotte Wien ◽

Bertil Fabricius Dorch

Keyword(s):

Social Sciences ◽

The Other ◽

H Index ◽

Social Sciences And Humanities ◽

Future Production ◽

The Social ◽

Job Advertisements ◽

The World ◽

Incomplete Databases

A problematic practice has evolved, which is threatening to undermine research in the social sciences and humanities. Bibliometrics is often claimed to be able to measure researchers’ efficiency. We find this quite problematic and, in this article, we illustrate this point by discussing two different bibliometric practices. One is the so-called h-index, the other the so-called BFI-points (Den bibliometriske Forskningsindikator, The Bibliometric Research Indicator). The BFI was never intended to be used for evaluating individual researchers and their productivity. Yet since its introduction in 2008 especially the social sciences and the humanities experience a pressure to deliver “BFI points” and academic job advertisements within the social sciences and the humanities increasingly mention expectations for people’s past and/or future production of BFI points. The h-index is even more problematic because no one academic database covers all the research publications in the world. The whole thing is completely disorganized, and as many as five different h-indexes exist for each researcher. What makes the h-index even more useless is that it will not let you make comparisons across disciplines. Furthermore, like other simple measurements, it is liable to be manipulated and misinterpreted. On that background, it is remarkable that numbers extracted from incomplete databases are used for describing the quality of researchers and their institutions.

Toward an Axiomatization of Strongly Possible Functional Dependencies

Vietnam Journal of Computer Science ◽

10.1142/s2196888821500056 ◽

2020 ◽

Vol 08 (01) ◽

pp. 133-151

Author(s):

Munqath Alattar ◽

Attila Sali

Keyword(s):

Missing Data ◽

Computer Science ◽

Relational Databases ◽

Possible Worlds ◽

Missing Values ◽

Preceding Paper ◽

Functional Dependency ◽

Functional Dependencies ◽

Lecture Notes ◽

Incomplete Databases

In general, there are two main approaches to handle the missing data values problem in SQL tables. One is to ignore or remove any record with some missing data values. The other approach is to fill or impute the missing data with new values [A. Farhangfar, L. A. Kurgan and W. Pedrycz, A novel framework for imputation of missing values in databases, IEEE Trans. Syst. Man Cybern. A, Syst. Hum. 37(5) (2007) 692–709]. In this paper, the second method is considered. Possible worlds, possible and certain keys, and weak and strong functional dependencies were introduced in Refs. 4 and 2 [H. Köhler, U. Leck, S. Link and X. Zhou, Possible and certain keys for SQL, VLDB J. 25(4) (2016) 571–596; M. Levene and G. Loizou, Axiomatisation of functional dependencies in incomplete relations, Theor. Comput. Sci. 206(1) (1998) 283–300]. We introduced the intermediate concept of strongly possible worlds in a preceding paper, which are obtained by filling missing data values with values already existing in the table. Using strongly possible worlds, strongly possible keys and strongly possible functional dependencies (spFDs) were introduced in Refs. 5 and 1 [M. Alattar and A. Sali, Keys in relational databases with nulls and bounded domains, in ADBIS 2019: Advances in Databases and Information Systems, Lecture Notes in Computer Science, Vol. 11695 (Springer, Cham, 2019), pp. 33–50; Functional dependencies in incomplete databases with limited domains, in FoiKS 2020: Foundations of Information and Knowledge Systems, Lecture Notes in Computer Science, Vol. 12012 (Springer, Cham, 2020), pp. 1–21]. In this paper, some axioms and rules for strongly possible functional dependencies are provided, These axioms and rules form the basis for a possible axiomatization of spFDs. For that, we analyze which weak/strong functional dependency and certain functional dependency axioms remain sound for strongly possible functional dependencies, and for the axioms that are not sound, we give appropriate modifications for soundness.

Knowledge-Preserving Certain Answers for SQL-like Queries

Proceedings of the Seventeenth International Conference on Principles of Knowledge Representation and Reasoning ◽

10.24963/kr.2020/78 ◽

2020 ◽

Author(s):

Etienne Toussaint ◽

Paolo Guagliardo ◽

Leonid Libkin

Keyword(s):

Information Content ◽

Data Model ◽

Incomplete Data ◽

Relational Databases ◽

Missing Values ◽

A Priori ◽

Real Life ◽

Query Languages ◽

Incomplete Databases ◽

Certain Answers

Answering queries over incomplete data is based on finding answers that are certainly true, independently of how missing values are interpreted. This informal description has given rise to several different mathematical definitions of certainty. To unify them, a framework based on "explanations", or extra information about incomplete data, was recently proposed. It partly succeeded in justifying query answering methods for relational databases under set semantics, but had two major limitations. First, it was firmly tied to the set data model, and a fixed way of comparing incomplete databases with respect to their information content. These assumptions fail for real-life database queries in languages such as SQL that use bag semantics instead. Second, it was restricted to queries that only manipulate data, while in practice most analytical SQL queries invent new values, typically via arithmetic operations and aggregation. To leverage our understanding of the notion of certainty for queries in SQL-like languages, we consider incomplete databases whose information content may be enriched by additional knowledge. The knowledge order among them is derived from their semantics, rather than being fixed a priori. The resulting framework allows us to capture and justify existing notions of certainty, and extend these concepts to other data models and query languages. As natural applications, we provide for the first time a well-founded definition of certain answers for the relational bag data model and for value-inventing queries on incomplete databases, addressing the key shortcomings of previous approaches.

Queries with Arithmetic on Incomplete Databases

Proceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems ◽

10.1145/3375395.3387666 ◽

2020 ◽

Cited By ~ 1

Author(s):

Marco Console ◽

Matthias Hofer ◽

Leonid Libkin

Keyword(s):

Incomplete Databases

Counting Problems over Incomplete Databases

Proceedings of the 39th ACM SIGMOD-SIGACT-SIGAI Symposium on Principles of Database Systems ◽

10.1145/3375395.3387656 ◽

2020 ◽

Author(s):

Marcelo Arenas ◽

Pablo Barceló ◽

Mikaël Monet

Keyword(s):

Counting Problems ◽

Incomplete Databases

A Model for Computing Skyline Data Items in Cloud Incomplete Databases

Procedia Computer Science ◽

10.1016/j.procs.2020.03.037 ◽

2020 ◽

Vol 170 ◽

pp. 249-256

Author(s):

Yonis Gulzar ◽

Ali A. Alwan ◽

Abedallah Zaid Abualkishik ◽

Abid Mehmood

Keyword(s):

Incomplete Databases

Functional Dependencies in Incomplete Databases with Limited Domains

Lecture Notes in Computer Science - Foundations of Information and Knowledge Systems ◽

10.1007/978-3-030-39951-1_1 ◽

2020 ◽

pp. 1-21 ◽

Cited By ~ 2

Author(s):

Munqath Alattar ◽

Attila Sali

Keyword(s):

Functional Dependencies ◽

Incomplete Databases

Approximation algorithms for querying incomplete databases

Information Systems ◽

10.1016/j.is.2019.03.010 ◽

2019 ◽

Vol 86 ◽

pp. 28-45 ◽

Cited By ~ 4

Author(s):

Sergio Greco ◽

Cristian Molinaro ◽

Irina Trubitsyna

Keyword(s):

Approximation Algorithms ◽

Incomplete Databases

incomplete databases
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Complexity of Counting Problems Over Incomplete Databases

Decision tree classifiers for unmanned aircraft configuration selection

Unholy goals and flawed methods

Toward an Axiomatization of Strongly Possible Functional Dependencies

Knowledge-Preserving Certain Answers for SQL-like Queries

Queries with Arithmetic on Incomplete Databases

Counting Problems over Incomplete Databases

A Model for Computing Skyline Data Items in Cloud Incomplete Databases

Functional Dependencies in Incomplete Databases with Limited Domains

Approximation algorithms for querying incomplete databases

Export Citation Format

incomplete databasesRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

The Complexity of Counting Problems Over Incomplete Databases

Decision tree classifiers for unmanned aircraft configuration selection

Unholy goals and flawed methods

Toward an Axiomatization of Strongly Possible Functional Dependencies

Knowledge-Preserving Certain Answers for SQL-like Queries

Queries with Arithmetic on Incomplete Databases

Counting Problems over Incomplete Databases

A Model for Computing Skyline Data Items in Cloud Incomplete Databases

Functional Dependencies in Incomplete Databases with Limited Domains

Approximation algorithms for querying incomplete databases

incomplete databases
Recently Published Documents