Mediated Data Integration Systems Using Functional Dependencies Embedded in Ontologies

This chapter addresses the problem of data integration in a P2P environment, where each peer stores schema of its local data, mappings between the schemas, and some schema constraints. The goal of the integration is to answer queries formulated against a chosen peer. The answer must consist of data stored in the queried peer as well as data of its direct and indirect partners. The chapter focuses on defining and using mappings, schema constraints, query propagation across the P2P system, and query answering in such scenario. Schemas, mappings, constraints (functional dependencies) and queries are all expressed using a unified approach based on tree-pattern formulas. The chapter discusses how functional dependencies can be exploited to increase information content of answers (by discovering missing values) and to control merging operations and propagation strategies. The chapter proposes algorithms for translating high-level specifications of mappings and queries into XQuery programs, and it shows how the discussed method has been implemented in SixP2P (or 6P2P) system.

Dealing with categorical missing data using CleanerR

10.5753/bresci.2019.6310 ◽

2019 ◽

Author(s):

Rafael Pereira ◽

Fabio Porto

Keyword(s):

Information Theory ◽

Data Analysis ◽

Missing Data ◽

Data Integration ◽

Missing Values ◽

Data Input ◽

Functional Dependencies ◽

The World ◽

Account Information

Missing data is a common problem in the world of data analysis. They appear in datasets due to a multitude of reasons, from data integration to poor data input. When faced with the problem, the analyst must decide what to do with the missing data since its not always advisable to discard these values from your analysis. On this paper we shall discuss a method that takes into account information theory and functional dependencies to best imput missing values.

Dealing with categorical missing data using CleanerR

10.5753/bresci.2019.10032 ◽

2019 ◽

Author(s):

Rafael S. Pereira ◽

Fabio Porto

Keyword(s):

Information Theory ◽

Data Analysis ◽

Missing Data ◽

Data Integration ◽

Missing Values ◽

Data Input ◽

Functional Dependencies ◽

The World ◽

Account Information

Missing data is a common problem in the world of data analysis. They appear in datasets due to a multitude of reasons, from data integration to poor data input. When faced with the problem, the analyst must decide what to do with the missing data since its not always advisable to discard these values from your analysis. On this paper we shall discuss a method that takes into account information theory and functional dependencies to best imput missing values.

Pay-As-You-Go Data Integration Using Functional Dependencies

Lecture Notes in Computer Science - Multidisciplinary Research and Practice for Information Systems ◽

10.1007/978-3-642-32498-7_28 ◽

2012 ◽

pp. 375-389 ◽

Cited By ~ 4

Author(s):

Naser Ayat ◽

Hamideh Afsarmanesh ◽

Reza Akbarinia ◽

Patrick Valduriez

Keyword(s):

Data Integration ◽

Functional Dependencies

Ontologies and Functional Dependencies for Data Integration and Reconciliation

Advances in Conceptual Modeling. Recent Developments and New Directions - Lecture Notes in Computer Science ◽

10.1007/978-3-642-24574-9_13 ◽

2011 ◽

pp. 98-107 ◽

Cited By ~ 4

Author(s):

Abdelghani Bakhtouchi ◽

Ladjel Bellatreche ◽

Yamine Ait-Ameur

Keyword(s):

Data Integration ◽

Functional Dependencies

Modified algorithm for fast bandwidth selection for kernel estimates of multidimensional probability densities

Izmeritel`naya Tekhnika ◽

10.32446/0368-1025it.2020-11-9-13 ◽

2020 ◽

pp. 9-13

Author(s):

A. V. Lapko ◽

V. A. Lapko

Keyword(s):

Probability Density ◽

Optimal Parameter ◽

Random Variables ◽

Kernel Functions ◽

Independent Random Variables ◽

Functional Dependencies ◽

Approximation Properties ◽

Probability Density Estimation ◽

Multidimensional Probability ◽

Selection Of

An original technique has been justified for the fast bandwidths selection of kernel functions in a nonparametric estimate of the multidimensional probability density of the Rosenblatt–Parzen type. The proposed method makes it possible to significantly increase the computational efficiency of the optimization procedure for kernel probability density estimates in the conditions of large-volume statistical data in comparison with traditional approaches. The basis of the proposed approach is the analysis of the optimal parameter formula for the bandwidths of a multidimensional kernel probability density estimate. Dependencies between the nonlinear functional on the probability density and its derivatives up to the second order inclusive of the antikurtosis coefficients of random variables are found. The bandwidths for each random variable are represented as the product of an undefined parameter and their mean square deviation. The influence of the error in restoring the established functional dependencies on the approximation properties of the kernel probability density estimation is determined. The obtained results are implemented as a method of synthesis and analysis of a fast bandwidths selection of the kernel estimation of the two-dimensional probability density of independent random variables. This method uses data on the quantitative characteristics of a family of lognormal distribution laws.

Multi-omic Data Integration in Oncology

10.3389/978-2-88966-151-0 ◽

2020 ◽

Keyword(s):

Data Integration ◽

Omic Data Integration ◽

Omic Data

Identifying linkages between EDCs in personal care products and breast cancer through data integration and gene network analysis

Endocrine Abstracts ◽

10.1530/endoabs.50.p244 ◽

2017 ◽

Author(s):

Hyeri Jeong ◽

Jongwoon Kim

Keyword(s):

Breast Cancer ◽

Network Analysis ◽

Data Integration ◽

Gene Network ◽

Personal Care Products ◽

Personal Care ◽

Gene Network Analysis

The Linked Data Enterprise as Enabler for both Intra- and Inter-organizational Business Data Integration and Usage

2020 43rd International Convention on Information, Communication and Electronic Technology (MIPRO) ◽

10.23919/mipro48935.2020.9245320 ◽

2020 ◽

Author(s):

A M. Tjoa

Keyword(s):

Data Integration ◽

Linked Data ◽

Business Data

Research and design of a semi-structured data-integration system on multiple Web sources

Advances in Computer Science and Technology ◽

10.2495/iccst140281 ◽

2014 ◽

Author(s):

Q. Yu ◽

Y. N. Wang

Keyword(s):

Data Integration ◽

Structured Data ◽

Integration System ◽

Data Integration System ◽

Research And Design