query translation Latest Research Papers

Modern web wants the data to be in Resource Description Framework (RDF) format, a machine-readable form that is easy to share and reuse data without human intervention. However, most of the information is still available in relational form. The existing conventional methods transform the data from RDB to RDF using instance-level mapping, which has not yielded the expected results because of poor mapping. Hence, in this paper, a novel schema-based RDB-RDF mapping method (relational database to Resource Description Framework) is proposed, which is an improvised version for transforming the relational database into the Resource Description Framework. It provides both data materialization and on-demand mapping. RDB-RDF reduces the data retrieval time for nonprimary key search by using schema-level mapping. The resultant mapped RDF graph presents the relational database in a conceptual schema and maintains the instance triples as data graph. This mechanism is known as data materialization, which suits well for the static dataset. To get the data in a dynamic environment, query translation (on-demand mapping) is best instead of whole data conversion. The proposed approach directly converts the SPARQL query into SQL query using the mapping descriptions available in the proposed system. The mapping description is the key component of this proposed system which is responsible for quick data retrieval and query translation. Join expression introduced in the proposed RDB-RDF mapping method efficiently handles all complex operations with primary and foreign keys. Experimental evaluation is done on the graphics designer database. It is observed from the result that the proposed schema-based RDB-RDF mapping method accomplishes more comprehensible mapping than conventional methods by dissolving structural and operational differences.

Download Full-text

Accessing OMOP Common Data Model Repositories with the i2b2 Webclient – Algorithm for Automatic Query Translation

German Medical Data Sciences: Bringing Data to Life - Studies in Health Technology and Informatics ◽

10.3233/shti210077 ◽

2021 ◽

Author(s):

Raphael W. Majeed ◽

Patrick Fischer ◽

Andreas Günther

Keyword(s):

User Interface ◽

Data Integration ◽

Data Model ◽

Research Data ◽

Feasibility Analysis ◽

Common Data Model ◽

Wide Spread ◽

Query Translation ◽

Fair Principles ◽

Intuitive User Interface

In the era of translational research, data integration and clinical data warehouses are important enabling technologies for clinical researchers. The OMOP common data model is a wide-spread choice as a target for data integration in medical informatics. It’s portability of queries and analyses across different institutions and data are ideal also from the viewpoint of the FAIR principles. Yet, the OMOP CDM lacks a simple and intuitive user interface for untrained users to run simple queries for feasibility analysis. Aim of this study is to provide an algorithm to translate any given i2b2 query to an equivalent query which can then be run on the OMOP CDM database. The provided algorithm is able to convert queries created in the i2b2 webclient to SQL statements which can be executed on a standard OMOP CDM database programmatically.

Download Full-text

Enhancing virtual ontology based access over tabular data with Morph-CSV

Semantic Web ◽

10.3233/sw-210432 ◽

2021 ◽

pp. 1-34

Author(s):

David Chaves-Fraga ◽

Edna Ruckhaus ◽

Freddy Priyatna ◽

Maria-Esther Vidal ◽

Oscar Corcho

Keyword(s):

Relational Databases ◽

Data Access ◽

Database Management System ◽

Tabular Data ◽

Translation Process ◽

Query Translation ◽

Unified View ◽

And Performance ◽

Referential Integrity ◽

Sql Query

Ontology-Based Data Access (OBDA) has traditionally focused on providing a unified view of heterogeneous datasets (e.g., relational databases, CSV and JSON files), either by materializing integrated data into RDF or by performing on-the-fly querying via SPARQL query translation. In the specific case of tabular datasets represented as several CSV or Excel files, query translation approaches have been applied by considering each source as a single table that can be loaded into a relational database management system (RDBMS). Nevertheless, constraints over these tables are not represented (e.g., referential integrity among sources, datatypes, or data integrity); thus, neither consistency among attributes nor indexes over tables are enforced. As a consequence, efficiency of the SPARQL-to-SQL translation process may be affected, as well as the completeness of the answers produced during the evaluation of the generated SQL query. Our work is focused on applying implicit constraints on the OBDA query translation process over tabular data. We propose Morph-CSV, a framework for querying tabular data that exploits information from typical OBDA inputs (e.g., mappings, queries) to enforce constraints that can be used together with any SPARQL-to-SQL OBDA engine. Morph-CSV relies on both a constraint component and a set of constraint operators. For a given set of constraints, the operators are applied to each type of constraint with the aim of enhancing query completeness and performance. We evaluate Morph-CSV in several domains: e-commerce with the BSBM benchmark; transportation with the GTFS-Madrid benchmark; and biology with a use case extracted from the Bio2RDF project. We compare and report the performance of two SPARQL-to-SQL OBDA engines, without and with the incorporation of Morph-CSV. The observed results suggest that Morph-CSV is able to speed up the total query execution time by up to two orders of magnitude, while it is able to produce all the query answers.

Download Full-text

Parallel sentence extraction to improve cross-language information retrieval from Wikipedia

Journal of Information Science ◽

10.1177/0165551521992754 ◽

2021 ◽

pp. 016555152199275

Author(s):

Juryong Cheon ◽

Youngjoong Ko

Keyword(s):

Information Retrieval ◽

Language Resources ◽

Query Translation ◽

Factors Affecting ◽

Parallel Corpora ◽

Parallel Corpus ◽

Bilingual Dictionary ◽

Sentence Extraction ◽

Cross Language Information Retrieval ◽

Cross Language

Translation language resources, such as bilingual word lists and parallel corpora, are important factors affecting the effectiveness of cross-language information retrieval (CLIR) systems. In particular, when large domain-appropriate parallel corpora are not available, developing an effective CLIR system is particularly difficult. Furthermore, creating a large parallel corpus is costly and requires considerable effort. Therefore, we here demonstrate the construction of parallel corpora from Wikipedia as well as improved query translation, wherein the queries are used for a CLIR system. To do so, we first constructed a bilingual dictionary, termed WikiDic. Then, we evaluated individual language resources and combinations of them in terms of their ability to extract parallel sentences; the combinations of our proposed WikiDic with the translation probability from the Web’s bilingual example sentence pairs and WikiDic was found to be best suited to parallel sentence extraction. Finally, to evaluate the parallel corpus generated from this best combination of language resources, we compared its performance in query translation for CLIR to that of a manually created English–Korean parallel corpus. As a result, the corpus generated by our proposed method achieved a better performance than did the manually created corpus, thus demonstrating the effectiveness of the proposed method for automatic parallel corpus extraction. Not only can the method demonstrated herein be used to inform the construction of other parallel corpora from language resources that are readily available, but also, the parallel sentence extraction method will naturally improve as Wikipedia continues to be used and its content develops.

Download Full-text

The Query Translation from MySQL to MongoDB Taking into Account the Structure of the Database

2021 IEEE Conference of Russian Young Researchers in Electrical and Electronic Engineering (ElConRus) ◽

10.1109/elconrus51938.2021.9396591 ◽

2021 ◽

Author(s):

Muon Ha ◽

Yulia Shichkina

Keyword(s):

Query Translation

Download Full-text

English-Hindi Cross Language Query Translation and Disambiguation Using Most Salient Seed Word

Advances in Intelligent Systems and Computing - Intelligent Systems Design and Applications ◽

10.1007/978-3-030-71187-0_5 ◽

2021 ◽

pp. 49-58

Author(s):

Pratibha Maurya

Keyword(s):

Query Translation ◽

Cross Language

Download Full-text

Information Retrieval System Based on Query Translation Approach for Cross-Languages

Advances in Automation, Signal Processing, Instrumentation, and Control - Lecture Notes in Electrical Engineering ◽

10.1007/978-981-15-8221-9_118 ◽

2021 ◽

pp. 1261-1269

Author(s):

Mangala Madankar ◽

Manoj Chandak ◽

Nekita Chavhan

Keyword(s):

Information Retrieval ◽

Retrieval System ◽

Information Retrieval System ◽

Query Translation

Download Full-text

Query Translation for Multilingual Content with Semantic Technique

Sains Malaysiana ◽

10.17576/jsm-2020-4909-09 ◽

2020 ◽

Vol 49 (09) ◽

pp. 2113-2118

Author(s):

Norita Md Norwawi ◽

Sundresan a/l Perumal ◽

Emran Huda ◽

Waka Jeng

Keyword(s):

Query Translation

Download Full-text

Domain Transfer based Data Augmentation for Neural Query Translation

10.18653/v1/2020.coling-main.399 ◽

2020 ◽

Author(s):

Liang Yao ◽

Baosong Yang ◽

Haibo Zhang ◽

Boxing Chen ◽

Weihua Luo

Keyword(s):

Data Augmentation ◽

Query Translation ◽

Domain Transfer

Download Full-text

Term Ordering-Based Query Expansion Technique for Hindi-English CLIR System

Handling Priority Inversion in Time-Constrained Distributed Databases - Advances in Data Mining and Database Management ◽

10.4018/978-1-7998-2491-6.ch016 ◽

2020 ◽

pp. 283-302

Author(s):

Ganesh Chandra ◽

Sanjay K. Dwivedi

Keyword(s):

Query Expansion ◽

Multiple Representations ◽

Poor Quality ◽

Query Translation ◽

Quality Of Results ◽

Expansion Technique ◽

Back Translation ◽

Accuracy Of Results ◽

Term Ordering

The quality of retrieval documents in CLIR is often poor compared to IR system due to (1) query mismatching, (2) multiple representations of query terms, and (3) un-translated query terms. The inappropriate translation may lead to poor quality of results. Hence, automated query translation is performed using the back-translation approach for improvement of query translation. This chapter mainly focuses on query expansion (Q.E) and proposes an algorithm to address the drift query issue for Hindi-English CLIR. The system uses FIRE datasets and a set of 50 queries of Hindi language for evaluation. The purpose of a term ordering-based algorithm is to resolve the drift query issue in Q.E. The result shows that the relevancy of Hindi-English CLIR is improved by performing Q.E. using a term ordering-based algorithm. The outcome achieved 60.18% accuracy of results where Q.E has been performed using a term ordering based algorithm, whereas the result of Q.E without a term ordering-based algorithm stands at 57.46%.

Download Full-text

query translation
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Schema-Based Mapping Approach for Data Transformation to Enrich Semantic Web

Accessing OMOP Common Data Model Repositories with the i2b2 Webclient – Algorithm for Automatic Query Translation

Enhancing virtual ontology based access over tabular data with Morph-CSV

Parallel sentence extraction to improve cross-language information retrieval from Wikipedia

The Query Translation from MySQL to MongoDB Taking into Account the Structure of the Database

English-Hindi Cross Language Query Translation and Disambiguation Using Most Salient Seed Word

Information Retrieval System Based on Query Translation Approach for Cross-Languages

Query Translation for Multilingual Content with Semantic Technique

Domain Transfer based Data Augmentation for Neural Query Translation

Term Ordering-Based Query Expansion Technique for Hindi-English CLIR System

Export Citation Format

query translationRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Schema-Based Mapping Approach for Data Transformation to Enrich Semantic Web

Accessing OMOP Common Data Model Repositories with the i2b2 Webclient – Algorithm for Automatic Query Translation

Enhancing virtual ontology based access over tabular data with Morph-CSV

Parallel sentence extraction to improve cross-language information retrieval from Wikipedia

The Query Translation from MySQL to MongoDB Taking into Account the Structure of the Database

English-Hindi Cross Language Query Translation and Disambiguation Using Most Salient Seed Word

Information Retrieval System Based on Query Translation Approach for Cross-Languages

Query Translation for Multilingual Content with Semantic Technique

Domain Transfer based Data Augmentation for Neural Query Translation

Term Ordering-Based Query Expansion Technique for Hindi-English CLIR System

query translation
Recently Published Documents