Learning to Parse Natural Language to Grounded Reward Functions with Weak Supervision

In this paper we introduce a novel semantic parsing approach to query Freebase in natural language without requiring manual annotations or question-answer pairs. Our key insight is to represent natural language via semantic graphs whose topology shares many commonalities with Freebase. Given this representation, we conceptualize semantic parsing as a graph matching problem. Our model converts sentences to semantic graphs using CCG and subsequently grounds them to Freebase guided by denotations as a form of weak supervision. Evaluation experiments on a subset of the Free917 and WebQuestions benchmark datasets show our semantic parser improves over the state of the art.

Download Full-text

Deep Distant Supervision: Learning Statistical Relational Models for Weak Supervision in Natural Language Extraction

Solving Large Scale Learning Tasks. Challenges and Algorithms - Lecture Notes in Computer Science ◽

10.1007/978-3-319-41706-6_18 ◽

2016 ◽

pp. 331-345

Author(s):

Sriraam Natarajan ◽

Ameet Soni ◽

Anurag Wazalwar ◽

Dileep Viswanathan ◽

Kristian Kersting

Keyword(s):

Natural Language ◽

Relational Models ◽

Weak Supervision ◽

Distant Supervision

Download Full-text

Sketch-Driven Regular Expression Generation from Natural Language and Examples

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00339 ◽

2020 ◽

Vol 8 ◽

pp. 679-694

Author(s):

Xi Ye ◽

Qiaochu Chen ◽

Xinyu Wang ◽

Isil Dillig ◽

Greg Durrett

Keyword(s):

Natural Language ◽

Real World ◽

Regular Expression ◽

State Of The Art ◽

Neural Systems ◽

Regular Expressions ◽

Weak Supervision ◽

Stack Overflow ◽

Neural Maps ◽

The Given

Recent systems for converting natural language descriptions into regular expressions (regexes) have achieved some success, but typically deal with short, formulaic text and can only produce simple regexes. Real-world regexes are complex, hard to describe with brief sentences, and sometimes require examples to fully convey the user’s intent. We present a framework for regex synthesis in this setting where both natural language (NL) and examples are available. First, a semantic parser (either grammar-based or neural) maps the natural language description into an intermediate sketch, which is an incomplete regex containing holes to denote missing components. Then a program synthesizer searches over the regex space defined by the sketch and finds a regex that is consistent with the given string examples. Our semantic parser can be trained purely from weak supervision based on correctness of the synthesized regex, or it can leverage heuristically derived sketches. We evaluate on two prior datasets (Kushman and Barzilay 2013 ; Locascio et al. 2016 ) and a real-world dataset from Stack Overflow. Our system achieves state-of-the-art performance on the prior datasets and solves 57% of the real-world dataset, which existing neural systems completely fail on. 1

Download Full-text

Weakly Supervised Learning of Semantic Parsers for Mapping Instructions to Actions

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00209 ◽

2013 ◽

Vol 1 ◽

pp. 49-62 ◽

Cited By ~ 51

Author(s):

Yoav Artzi ◽

Luke Zettlemoyer

Keyword(s):

Natural Language ◽

Supervised Learning ◽

State Of The Art ◽

Semantic Parsing ◽

Weak Supervision ◽

Instruction Sets ◽

Strong Signal ◽

Previous State ◽

Strong Performance ◽

Weakly Supervised

The context in which language is used provides a strong signal for learning to recover its meaning. In this paper, we show it can be used within a grounded CCG semantic parsing approach that learns a joint model of meaning and context for interpreting and executing natural language instructions, using various types of weak supervision. The joint nature provides crucial benefits by allowing situated cues, such as the set of visible objects, to directly influence learning. It also enables algorithms that learn while executing instructions, for example by trying to replicate human actions. Experiments on a benchmark navigational dataset demonstrate strong performance under differing forms of supervision, including correctly executing 60% more instruction sets relative to the previous state of the art.

Download Full-text

Using Natural Language for Reward Shaping in Reinforcement Learning

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/331 ◽

2019 ◽

Cited By ~ 4

Author(s):

Prasoon Goyal ◽

Scott Niekum ◽

Raymond J. Mooney

Keyword(s):

Reinforcement Learning ◽

Video Games ◽

Natural Language ◽

Learning Algorithm ◽

Interaction Time ◽

Free Form ◽

Reward Shaping ◽

Strong Performance ◽

Reward Functions ◽

Reinforcement Learning Algorithm

Recent reinforcement learning (RL) approaches have shown strong performance in complex domains, such as Atari games, but are highly sample inefficient. A common approach to reduce interaction time with the environment is to use reward shaping, which involves carefully designing reward functions that provide the agent intermediate rewards for progress towards the goal. Designing such rewards remains a challenge, though. In this work, we use natural language instructions to perform reward shaping. We propose a framework that maps free-form natural language instructions to intermediate rewards, that can seamlessly be integrated into any standard reinforcement learning algorithm. We experiment with Montezuma's Revenge from the Atari video games domain, a popular benchmark in RL. Our experiments on a diverse set of 15 tasks demonstrate that for the same number of interactions with the environment, using language-based rewards can successfully complete the task 60% more often, averaged across all tasks, compared to learning without language.

Download Full-text