Learning to Map Frequent Phrases to Sub-Structures of Meaning Representation for Neural Semantic Parsing

Neural semantic parsers usually generate meaning representation tokens from natural language tokens via an encoder-decoder model. However, there is often a vocabulary-mismatch problem between natural language utterances and logical forms. That is, one word maps to several atomic logical tokens, which need to be handled as a whole, rather than individual logical tokens at multiple steps. In this paper, we propose that the vocabulary-mismatch problem can be effectively resolved by leveraging appropriate logical tokens. Specifically, we exploit macro actions, which are of the same granularity of words/phrases, and allow the model to learn mappings from frequent phrases to corresponding sub-structures of meaning representation. Furthermore, macro actions are compact, and therefore utilizing them can significantly reduce the search space, which brings a great benefit to weakly supervised semantic parsing. Experiments show that our method leads to substantial performance improvement on three benchmarks, in both supervised and weakly supervised settings.

Download Full-text

A New Corpus and Imitation Learning Framework for Context-Dependent Semantic Parsing

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00202 ◽

2014 ◽

Vol 2 ◽

pp. 547-560 ◽

Cited By ~ 4

Author(s):

Andreas Vlachos ◽

Stephen Clark

Keyword(s):

Information System ◽

Natural Language ◽

Learning Algorithm ◽

Imitation Learning ◽

Semantic Parsing ◽

Learning Framework ◽

Test Sets ◽

Context Dependent ◽

Meaning Representation ◽

Tourist Information

Semantic parsing is the task of translating natural language utterances into a machine-interpretable meaning representation. Most approaches to this task have been evaluated on a small number of existing corpora which assume that all utterances must be interpreted according to a database and typically ignore context. In this paper we present a new, publicly available corpus for context-dependent semantic parsing. The MRL used for the annotation was designed to support a portable, interactive tourist information system. We develop a semantic parser for this corpus by adapting the imitation learning algorithm DAgger without requiring alignment information during training. DAgger improves upon independently trained classifiers by 9.0 and 4.8 points in F-score on the development and test sets respectively.

Download Full-text

Semantic Parsing of Ambiguous Input through Paraphrasing and Verification

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00159 ◽

2015 ◽

Vol 3 ◽

pp. 571-584

Author(s):

Philip Arthur ◽

Graham Neubig ◽

Sakriani Sakti ◽

Tomoki Toda ◽

Satoshi Nakamura

Keyword(s):

Natural Language ◽

Language Model ◽

Input String ◽

New Method ◽

Semantic Parsing ◽

Search Queries ◽

Input Sentence ◽

Meaning Representation ◽

Context Free ◽

Do So

We propose a new method for semantic parsing of ambiguous and ungrammatical input, such as search queries. We do so by building on an existing semantic parsing framework that uses synchronous context free grammars (SCFG) to jointly model the input sentence and output meaning representation. We generalize this SCFG framework to allow not one, but multiple outputs. Using this formalism, we construct a grammar that takes an ambiguous input string and jointly maps it into both a meaning representation and a natural language paraphrase that is less ambiguous than the original input. This paraphrase can be used to disambiguate the meaning representation via verification using a language model that calculates the probability of each paraphrase.

Download Full-text

Effective Search of Logical Forms for Weakly Supervised Knowledge-Based Question Answering

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/308 ◽

2020 ◽

Author(s):

Tao Shen ◽

Xiubo Geng ◽

Guodong Long ◽

Jing Jiang ◽

Chengqi Zhang ◽

...

Keyword(s):

Question Answering ◽

Logical Form ◽

Computation Time ◽

Search Space ◽

Training Data ◽

Semantic Parsing ◽

Weak Supervision ◽

Knowledge Based ◽

Model Training ◽

Weakly Supervised

Many algorithms for Knowledge-Based Question Answering (KBQA) depend on semantic parsing, which translates a question to its logical form. When only weak supervision is provided, it is usually necessary to search valid logical forms for model training. However, a complex question typically involves a huge search space, which creates two main problems: 1) the solutions limited by computation time and memory usually reduce the success rate of the search, and 2) spurious logical forms in the search results degrade the quality of training data. These two problems lead to a poorly-trained semantic parsing model. In this work, we propose an effective search method for weakly supervised KBQA based on operator prediction for questions. With search space constrained by predicted operators, sufficient search paths can be explored, more valid logical forms can be derived, and operators possibly causing spurious logical forms can be avoided. As a result, a larger proportion of questions in a weakly supervised training set are equipped with logical forms, and fewer spurious logical forms are generated. Such high-quality training data directly contributes to a better semantic parsing model. Experimental results on one of the largest KBQA datasets (i.e., CSQA) verify the effectiveness of our approach and deliver a new state-of-the-art performance.

Download Full-text

Break It Down: A Question Understanding Benchmark

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00309 ◽

2020 ◽

Vol 8 ◽

pp. 183-198

Author(s):

Tomer Wolfson ◽

Mor Geva ◽

Ankit Gupta ◽

Matt Gardner ◽

Yoav Goldberg ◽

...

Keyword(s):

Natural Language ◽

Formal Language ◽

Question Answering ◽

Semantic Parsing ◽

Open Domain ◽

Meaning Representation

Understanding natural language questions entails the ability to break down a question into the requisite steps for computing its answer. In this work, we introduce a Question Decomposition Meaning Representation (QDMR) for questions. QDMR constitutes the ordered list of steps, expressed through natural language, that are necessary for answering a question. We develop a crowdsourcing pipeline, showing that quality QDMRs can be annotated at scale, and release the Break dataset, containing over 83K pairs of questions and their QDMRs. We demonstrate the utility of QDMR by showing that (a) it can be used to improve open-domain question answering on the HotpotQA dataset, (b) it can be deterministically converted to a pseudo-SQL formal language, which can alleviate annotation in semantic parsing applications. Last, we use Break to train a sequence-to-sequence model with copying that parses questions into QDMR structures, and show that it substantially outperforms several natural baselines.

Download Full-text

Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00209 ◽

2013 ◽

Vol 1 ◽

pp. 49-62 ◽

Cited By ~ 51

Author(s):

Yoav Artzi ◽

Luke Zettlemoyer

Keyword(s):

Natural Language ◽

Supervised Learning ◽

State Of The Art ◽

Semantic Parsing ◽

Weak Supervision ◽

Instruction Sets ◽

Strong Signal ◽

Previous State ◽

Strong Performance ◽

Weakly Supervised

The context in which language is used provides a strong signal for learning to recover its meaning. In this paper, we show it can be used within a grounded CCG semantic parsing approach that learns a joint model of meaning and context for interpreting and executing natural language instructions, using various types of weak supervision. The joint nature provides crucial benefits by allowing situated cues, such as the set of visible objects, to directly influence learning. It also enables algorithms that learn while executing instructions, for example by trying to replicate human actions. Experiments on a benchmark navigational dataset demonstrate strong performance under differing forms of supervision, including correctly executing 60% more instruction sets relative to the previous state of the art.

Download Full-text

Large-scale Semantic Parsing without Question-Answer Pairs

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00190 ◽

2014 ◽

Vol 2 ◽

pp. 377-392 ◽

Cited By ~ 40

Author(s):

Siva Reddy ◽

Mirella Lapata ◽

Mark Steedman

Keyword(s):

Natural Language ◽

Large Scale ◽

Graph Matching ◽

State Of The Art ◽

The State ◽

Semantic Parsing ◽

Matching Problem ◽

Weak Supervision ◽

Benchmark Datasets

In this paper we introduce a novel semantic parsing approach to query Freebase in natural language without requiring manual annotations or question-answer pairs. Our key insight is to represent natural language via semantic graphs whose topology shares many commonalities with Freebase. Given this representation, we conceptualize semantic parsing as a graph matching problem. Our model converts sentences to semantic graphs using CCG and subsequently grounds them to Freebase guided by denotations as a form of weak supervision. Evaluation experiments on a subset of the Free917 and WebQuestions benchmark datasets show our semantic parser improves over the state of the art.

Download Full-text

Transforming meaning representation grammars to improve semantic parsing

Proceedings of the Twelfth Conference on Computational Natural Language Learning - CoNLL '08 ◽

10.3115/1596324.1596331 ◽

2008 ◽

Cited By ~ 1

Author(s):

Rohit J. Kate

Keyword(s):

Semantic Parsing ◽

Meaning Representation

Download Full-text

Proceedings of the Shared Task on Cross-Framework Meaning Representation Parsing at the 2019 Conference on Natural Language Learning

10.18653/v1/k19-2 ◽

2019 ◽

Keyword(s):

Natural Language ◽

Language Learning ◽

Shared Task ◽

Natural Language Learning ◽

Meaning Representation

Download Full-text

RECPARSER: A Recursive Semantic Parsing Framework for Text-to-SQL Task

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/504 ◽

2020 ◽

Author(s):

Yu Zeng ◽

Yan Gao ◽

Jiaqi Guo ◽

Bei Chen ◽

Qian Liu ◽

...

Keyword(s):

State Of The Art ◽

Search Space ◽

Layer By Layer ◽

Semantic Parsing ◽

Query Generation ◽

Sql Query ◽

Generation Problem

Neural semantic parsers usually fail to parse long and complicated utterances into nested SQL queries, due to the large search space. In this paper, we propose a novel recursive semantic parsing framework called RECPARSER to generate the nested SQL query layer-by-layer. It decomposes the complicated nested SQL query generation problem into several progressive non-nested SQL query generation problems. Furthermore, we propose a novel Question Decomposer module to explicitly encourage RECPARSER to focus on different components of an utterance when predicting SQL queries of different layers. Experiments on the Spider dataset show that our approach is more effective compared to the previous works at predicting the nested SQL queries. In addition, we achieve an overall accuracy that is comparable with state-of-the-art approaches.

Download Full-text

Natural Language as the Basis for Meaning Representation and Inference

Computational Linguistics and Intelligent Text Processing - Lecture Notes in Computer Science ◽

10.1007/978-3-540-78135-6_14 ◽

2008 ◽

pp. 151-170 ◽

Cited By ~ 2

Author(s):

Ido Dagan ◽

Roy Bar-Haim ◽

Idan Szpektor ◽

Iddo Greental ◽

Eyal Shnarch

Keyword(s):

Natural Language ◽

Meaning Representation

Download Full-text