schema matching Latest Research Papers

University requires the integration of data from one system with other systems as needed. This is because there are still many processes to input the same data but with different information systems. The application of data integration generally has several obstacles, one of which is due to the diversity of databases used by each information system. Schema matching is one method that can be used to overcome data integration problems caused by database diversity. The schema matching method used in this research is linguistic and constraint. The results of the matching scheme are used as material for optimizing data integration at the database level. The optimization process shows a change in the number of tables and attributes in the database that is a decrease in the number of tables by 13 tables and 492 attributes. The changes were caused by some tables and attributes were omitted and normalized. This research shows that after optimization, data integration becomes better because the data was connected and used by other systems has increased by 46.67% from the previous amount. This causes the same data entry on different systems can be reduced and also data inconsistencies caused by duplication of data on different systems can be minimized.

Download Full-text

Optimization of data integration using schema matching of linguistic-based and constraint-based in the university database

Matrix Jurnal Manajemen Teknologi dan Informatika ◽

10.31940/matrix.v3i11.119-129 ◽

2021 ◽

Vol 11 (3) ◽

pp. 119-129

Author(s):

Rifqi Hammad ◽

◽

Azriel Christian Nurcahyo ◽

Ahmad Zuli Amrullah ◽

Pahrul Irfan ◽

...

Keyword(s):

Information System ◽

Information Systems ◽

Data Integration ◽

Data Entry ◽

Optimization Process ◽

Schema Matching ◽

Matching Method ◽

Integration Problems ◽

The University

University requires the integration of data from one system with other systems as needed. This is because there are still many processes to input the same data but with different information systems. The application of data integration generally has several obstacles, one of which is due to the diversity of databases used by each information system. Schema matching is one method that can be used to overcome data integration problems caused by database diversity. The schema matching method used in this research is linguistic and constraint. The results of the matching scheme are used as material for optimizing data integration at the database level. The optimization process shows a change in the number of tables and attributes in the database that is a decrease in the number of tables by 13 tables and 492 attributes. The changes were caused by some tables and attributes were omitted and normalized. This research shows that after optimization, data integration becomes better because the data was connected and used by other systems has increased by 46.67% from the previous amount. This causes the same data entry on different systems can be reduced and also data inconsistencies caused by duplication of data on different systems can be minimized.

Download Full-text

A study on machine learning techniques for the schema matching network problem

Journal of the Brazilian Computer Society ◽

10.1186/s13173-021-00119-5 ◽

2021 ◽

Vol 27 (1) ◽

Author(s):

Diego Rodrigues ◽

Altigran da Silva

Keyword(s):

Machine Learning ◽

User Feedback ◽

Training Data ◽

Machine Learning Techniques ◽

Schema Matching ◽

Challenging Problem ◽

Matching Problem ◽

Matching Networks ◽

Learning Techniques

AbstractSchema matching is the problem of finding semantic correspondences between elements from different schemas. This is a challenging problem since disparate elements in the schemas often represent the same concept. Traditional instances of this problem involved a pair of schemas. However, recently, there has been an increasing interest in matching several related schemas at once, a problem known as schema matching networks. The goal is to identify elements from several schemas that correspond to a single concept. We propose a family of methods for schema matching networks based on machine learning, which proved to be a competitive alternative for the traditional matching problem in several domains. To overcome the issue of requiring a large amount of training data, we also propose a bootstrapping procedure to generate training data automatically. In addition, we leverage constraints that arise in network scenarios to improve the quality of this data. We also study a strategy for receiving user feedback to assert some of the matchings generated and, relying on this feedback, improve the final result’s quality. Our experiments show that our methods can outperform baselines, reaching F1-score up to 0.83.

Download Full-text

Towards Deep Entity Resolution via Soft Schema Matching

Neurocomputing ◽

10.1016/j.neucom.2021.10.106 ◽

2021 ◽

Author(s):

Chenchen Sun ◽

Derong Shen

Keyword(s):

Entity Resolution ◽

Schema Matching

Download Full-text

Covid19-IBO: A Covid-19 Impact on Indian Banking Ontology Along with an Efficient Schema Matching Approach

New Generation Computing ◽

10.1007/s00354-021-00136-0 ◽

2021 ◽

Author(s):

Archana Patel ◽

Narayan C. Debnath ◽

Ambrish Kumar Mishra ◽

Sarika Jain

Keyword(s):

Schema Matching

Download Full-text

RPT

Proceedings of the VLDB Endowment ◽

10.14778/3457390.3457391 ◽

2021 ◽

Vol 14 (8) ◽

pp. 1254-1261

Author(s):

Nan Tang ◽

Ju Fan ◽

Fangyi Li ◽

Jianhong Tu ◽

Xiaoyong Du ◽

...

Keyword(s):

Information Extraction ◽

Question Answering ◽

Data Cleaning ◽

Schema Matching ◽

Data Preparation ◽

Denoising Autoencoder ◽

Data Annotation ◽

Hard Data ◽

Wide Range ◽

Collaborative Training

Can AI help automate human-easy but computer-hard data preparation tasks that burden data scientists, practitioners, and crowd workers? We answer this question by presenting RPT, a denoising autoencoder for tuple-to-X models (" X " could be tuple, token, label, JSON, and so on). RPT is pre-trained for a tuple-to-tuple model by corrupting the input tuple and then learning a model to reconstruct the original tuple. It adopts a Transformer-based neural translation architecture that consists of a bidirectional encoder (similar to BERT) and a left-to-right autoregressive decoder (similar to GPT), leading to a generalization of both BERT and GPT. The pre-trained RPT can already support several common data preparation tasks such as data cleaning, auto-completion and schema matching. Better still, RPT can be fine-tuned on a wide range of data preparation tasks, such as value normalization, data transformation, data annotation, etc. To complement RPT, we also discuss several appealing techniques such as collaborative training and few-shot learning for entity resolution, and few-shot learning and NLP question-answering for information extraction. In addition, we identify a series of research opportunities to advance the field of data preparation.

Download Full-text

Using Variation in Weighting Criteria and String Size Matching on Hybrid Model Schema Matching

International Journal on Advanced Science Engineering and Information Technology ◽

10.18517/ijaseit.11.1.6650 ◽

2021 ◽

Vol 11 (1) ◽

pp. 326

Author(s):

Edhy Sutanta ◽

Erna Kumalasari Nurnawati ◽

Rosalia Arum Kumalasanti

Keyword(s):

Hybrid Model ◽

Schema Matching

Download Full-text

Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents

Frontiers of Structural and Civil Engineering ◽

10.1007/s11709-020-0666-8 ◽

2021 ◽

Author(s):

Sang I. Park ◽

Sang-Ho Lee

Keyword(s):

Decision Tree ◽

Xml Schema ◽

Decision Tree Model ◽

Schema Matching ◽

Tree Model ◽

Heuristic Solution ◽

Structural Calculation

Download Full-text

An Efficient Holistic Schema Matching Approach

Communications in Computer and Information Science - Information and Communication Technology and Applications ◽

10.1007/978-3-030-69143-1_45 ◽

2021 ◽

pp. 588-601

Author(s):

Aola Yousfi ◽

Moulay Hafid El Yazidi ◽

Ahmed Zellou

Keyword(s):

Schema Matching

Download Full-text

schema matching
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Attribute Similarity and Relevance-Based Product Schema Matching for Targeted Catalog Enrichment

Optimization of data integration using schema matching of linguistic-based and constraint-based in the university database

Optimization of data integration using schema matching of linguistic-based and constraint-based in the university database

A study on machine learning techniques for the schema matching network problem

Towards Deep Entity Resolution via Soft Schema Matching

Covid19-IBO: A Covid-19 Impact on Indian Banking Ontology Along with an Efficient Schema Matching Approach

RPT

Using Variation in Weighting Criteria and String Size Matching on Hybrid Model Schema Matching

Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents

An Efficient Holistic Schema Matching Approach

Export Citation Format

schema matchingRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Attribute Similarity and Relevance-Based Product Schema Matching for Targeted Catalog Enrichment

Optimization of data integration using schema matching of linguistic-based and constraint-based in the university database

Optimization of data integration using schema matching of linguistic-based and constraint-based in the university database

A study on machine learning techniques for the schema matching network problem

Towards Deep Entity Resolution via Soft Schema Matching

Covid19-IBO: A Covid-19 Impact on Indian Banking Ontology Along with an Efficient Schema Matching Approach

RPT

Using Variation in Weighting Criteria and String Size Matching on Hybrid Model Schema Matching

Heuristic solution using decision tree model for enhanced XML schema matching of bridge structural calculation documents

An Efficient Holistic Schema Matching Approach

schema matching
Recently Published Documents