RYANSQL: Recursively Applying Sketch-based Slot Fillings for Complex Text-to-SQL in Cross-Domain Databases

Abstract Text-to-SQL is the problem of converting a user question into an SQL query, when the question and database are given. In this paper, we present a neural network approach called RYANSQL (Recursively Yielding Annotation Network for SQL) to solve complex Text-to-SQL tasks for cross-domain databases. Statement Position Code (SPC) is defined to transform a nested SQL query into a set of non-nested SELECT statements; a sketch-based slot filling approach is proposed to synthesize each SELECT statement for its corresponding SPC. Additionally, two input manipulation methods are presented to improve generation performance further. RYANSQL achieved competitive result of 58.2% accuracy on the challenging Spider benchmark. At the time of paper submission (April 2020), RYANSQL v2, a variant of original RYANSQL, is positioned at 3rd place among all systems and 1st place among the systems not using database content with 60.6% exact matching accuracy. The source code is available at https://github.com/kakaoenterprise/RYANSQL.

Download Full-text

ICodeNet - A Hierarchical Neural Network Approach For Source Code Author Identification

2021 13th International Conference on Machine Learning and Computing ◽

10.1145/3457682.3457709 ◽

2021 ◽

Author(s):

Pranali Bora ◽

Tulika Awalgaonkar ◽

Himanshu Palve ◽

Raviraj Joshi ◽

Purvi Goel

Keyword(s):

Neural Network ◽

Source Code ◽

Network Approach ◽

Neural Network Approach ◽

Author Identification ◽

Hierarchical Neural Network

Download Full-text

Capturing SQL Query Overlapping via Subtree Copy for Cross-Domain Context-Dependent SQL Generation

Advances in Knowledge Discovery and Data Mining - Lecture Notes in Computer Science ◽

10.1007/978-3-030-75765-6_53 ◽

2021 ◽

pp. 664-675

Author(s):

Ruizhuo Zhao ◽

Jinhua Gao ◽

Huawei Shen ◽

Xueqi Cheng

Keyword(s):

Cross Domain ◽

Context Dependent ◽

Sql Query

Download Full-text

SEQ2SEQ VS SKETCH FILLING STRUCTURE FOR NATURAL LANGUAGE TO SQL TRANSLATION

ISPRS - International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences ◽

10.5194/isprs-archives-xliv-4-w3-2020-7-2020 ◽

2020 ◽

Vol XLIV-4/W3-2020 ◽

pp. 7-11

Author(s):

K. Ahkouk ◽

M. Machkour ◽

K. Majhadi ◽

R. Mama

Keyword(s):

Natural Language Processing ◽

Natural Language ◽

Language Processing ◽

Human Language ◽

Test Results ◽

Exact Matching ◽

Cross Domain ◽

Pros And Cons ◽

Time Required

Abstract. Sequence to sequence models have been widely used in the recent years in the different tasks of Natural Language processing. In particular, the concept has been deeply adopted to treat the problem of translating human language questions to SQL. In this context, many studies suggest the use of sequence to sequence approaches for predicting the target SQL queries using the different available datasets. In this paper, we put the light on another way to resolve natural language processing tasks, especially the Natural Language to SQL one using the method of sketch-based decoding which is based on a sketch with holes that the model incrementally tries to fill. We present the pros and cons of each approach and how a sketch-based model can outperform the already existing solutions in order to predict the wanted SQL queries and to generate to unseen input pairs in different contexts and cross-domain datasets, and finally we discuss the test results of the already proposed models using the exact matching scores and the errors propagation and the time required for the training as metrics.

Download Full-text

Slot Transferability for Cross-domain Slot Filling

10.18653/v1/2021.findings-acl.440 ◽

2021 ◽

Author(s):

Hengtong Lu ◽

Zhuoxin Han ◽

Caixia Yuan ◽

Xiaojie Wang ◽

Shuyu Lei ◽

...

Keyword(s):

Cross Domain ◽

Slot Filling

Download Full-text

Editing-Based SQL Query Generation for Cross-Domain Context-Dependent Questions

10.18653/v1/d19-1537 ◽

2019 ◽

Cited By ~ 2

Author(s):

Rui Zhang ◽

Tao Yu ◽

Heyang Er ◽

Sungrok Shim ◽

Eric Xue ◽

...

Keyword(s):

Cross Domain ◽

Query Generation ◽

Context Dependent ◽

Sql Query

Download Full-text

A multi-label convolutional neural network approach to cross-domain action unit detection

2015 International Conference on Affective Computing and Intelligent Interaction (ACII) ◽

10.1109/acii.2015.7344632 ◽

2015 ◽

Cited By ~ 21

Author(s):

Sayan Ghosh ◽

Eugene Laksana ◽

Stefan Scherer ◽

Louis-Philippe Morency

Keyword(s):

Neural Network ◽

Convolutional Neural Network ◽

Network Approach ◽

Neural Network Approach ◽

Action Unit ◽

Cross Domain ◽

Action Unit Detection

Download Full-text

Robot View Navigation Based on Level-Divided Strategy

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.753-755.3108 ◽

2013 ◽

Vol 753-755 ◽

pp. 3108-3111

Author(s):

Yin Bing Li

Keyword(s):

Genetic Algorithm ◽

Real Time ◽

Image Matching ◽

Search Strategy ◽

Detection Algorithm ◽

Adaptive Genetic Algorithm ◽

Exact Matching ◽

Matching Accuracy ◽

Similarity Detection ◽

Time Requirements

In allusion to the colored image matching characteristic in the system of robot view navigation, SSDA (the sequential similarity detection algorithm) is improved and adaptive genetic algorithm is brought in; meanwhile, level-divided search strategy connective with rough and exact matching. The improved algorithm can enhance the image matching speed with no matching accuracy reduced, so that real-time requirements of robot view navigation can be met and robot view navigation will be of preferable robustness.

Download Full-text

Elastic CRFs for Open-Ontology Slot Filling

Applied Sciences ◽

10.3390/app112210675 ◽

2021 ◽

Vol 11 (22) ◽

pp. 10675

Author(s):

Yinpei Dai ◽

Yichi Zhang ◽

Hong Liu ◽

Zhijian Ou ◽

Yi Huang ◽

...

Keyword(s):

Random Field ◽

Conditional Random Field ◽

Dialog Systems ◽

Cross Domain ◽

Crucial Component ◽

Semantic Concepts ◽

The One ◽

Task Oriented ◽

Slot Filling ◽

Language Description

Slot filling is a crucial component in task-oriented dialog systems that is used to parse (user) utterances into semantic concepts called slots. An ontology is defined by the collection of slots and the values that each slot can take. The most widely used practice of treating slot filling as a sequence labeling task suffers from two main drawbacks. First, the ontology is usually pre-defined and fixed and therefore is not able to detect new labels for unseen slots. Second, the one-hot encoding of slot labels ignores the correlations between slots with similar semantics, which makes it difficult to share knowledge learned across different domains. To address these problems, we propose a new model called elastic conditional random field (eCRF), where each slot is represented by the embedding of its natural language description and modeled by a CRF layer. New slot values can be detected by eCRF whenever a language description is available for the slot. In our experiment, we show that eCRFs outperform existing models in both in-domain and cross-domain tasks, especially in predicting unseen slots and values.

Download Full-text

Cross-domain Slot Filling with Distinct Slot Entity and Type Prediction

10.1007/978-3-030-88480-2_41 ◽

2021 ◽

pp. 517-528

Author(s):

Shudong Liu ◽

Peijie Huang ◽

Zhanbiao Zhu ◽

Hualin Zhang ◽

Jianying Tan

Keyword(s):

Cross Domain ◽

Slot Filling

Download Full-text

Struo: a pipeline for building custom databases for common metagenome profilers

Bioinformatics ◽

10.1093/bioinformatics/btz899 ◽

2019 ◽

Vol 36 (7) ◽

pp. 2314-2315 ◽

Cited By ~ 7

Author(s):

Jacobo de la Cuesta-Zuluaga ◽

Ruth E Ley ◽

Nicholas D Youngblut

Keyword(s):

Microbial Diversity ◽

Microbial Communities ◽

Source Code ◽

Supplementary Information ◽

Functional Information ◽

Supplementary Data ◽

Microbial Genomes ◽

Public Repositories ◽

Database Content

Abstract Summary Taxonomic and functional information from microbial communities can be efficiently obtained by metagenome profiling, which requires databases of genes and genomes to which sequence reads are mapped. However, the databases that accompany metagenome profilers are not updated at a pace that matches the increase in available microbial genomes, and unifying database content across metagenome profiling tools can be cumbersome. To address this, we developed Struo, a modular pipeline that automatizes the acquisition of genomes from public repositories and the construction of custom databases for multiple metagenome profilers. The use of custom databases that broadly represent the known microbial diversity by incorporating novel genomes results in a substantial increase in mappability of reads in synthetic and real metagenome datasets. Availability and implementation Source code available for download at https://github.com/leylabmpi/Struo. Custom genome taxonomy database databases available at http://ftp.tue.mpg.de/ebio/projects/struo/. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text