Project-then-Transfer: Effective Two-stage Cross-lingual Transfer for Semantic Dependency Parsing

In recent years, the research on dependency parsing focuses on improving the accuracy of the domain-specific (in-domain) test datasets and has made remarkable progress. However, there are innumerable scenarios in the real world that are not covered by the dataset, namely, the out-of-domain dataset. As a result, parsers that perform well on the in-domain data usually suffer from significant performance degradation on the out-of-domain data. Therefore, to adapt the existing in-domain parsers with high performance to a new domain scenario, cross-domain transfer learning methods are essential to solve the domain problem in parsing. This paper examines two scenarios for cross-domain transfer learning: semi-supervised and unsupervised cross-domain transfer learning. Specifically, we adopt a pre-trained language model BERT for training on the source domain (in-domain) data at the subword level and introduce self-training methods varied from tri-training for these two scenarios. The evaluation results on the NLPCC-2019 shared task and universal dependency parsing task indicate the effectiveness of the adopted approaches on cross-domain transfer learning and show the potential of self-learning to cross-lingual transfer learning.

Download Full-text

Perturbation Based Learning for Structured NLP Tasks with Application to Dependency Parsing

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00291 ◽

2019 ◽

Vol 7 ◽

pp. 643-659

Author(s):

Amichay Doitch ◽

Ram Yazdi ◽

Tamir Hazan ◽

Roi Reichart

Keyword(s):

Prediction Models ◽

Learning Algorithm ◽

Solution Space ◽

Optimal Strategies ◽

Expressive Power ◽

Entire Solution ◽

Dependency Parsing ◽

Computationally Efficient ◽

Cross Lingual

The best solution of structured prediction models in NLP is often inaccurate because of limited expressive power of the model or to non-exact parameter estimation. One way to mitigate this problem is sampling candidate solutions from the model’s solution space, reasoning that effective exploration of this space should yield high-quality solutions. Unfortunately, sampling is often computationally hard and many works hence back-off to sub-optimal strategies, such as extraction of the best scoring solutions of the model, which are not as diverse as sampled solutions. In this paper we propose a perturbation-based approach where sampling from a probabilistic model is computationally efficient. We present a learning algorithm for the variance of the perturbations, and empirically demonstrate its importance. Moreover, while finding the argmax in our model is intractable, we propose an efficient and effective approximation. We apply our framework to cross-lingual dependency parsing across 72 corpora from 42 languages and to lightly supervised dependency parsing across 13 corpora from 12 languages, and demonstrate strong results in terms of both the quality of the entire solution list and of the final solution. 1

Download Full-text

OSU_CHGCG at SemEval-2016 Task 9 : Chinese Semantic Dependency Parsing with Generalized Categorial Grammar

10.18653/v1/s16-1189 ◽

2016 ◽

Author(s):

Manjuan Duan ◽

Lifeng Jin ◽

William Schuler

Keyword(s):

Categorial Grammar ◽

Dependency Parsing ◽

Semantic Dependency

Download Full-text

Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing

10.18653/v1/n19-1162 ◽

2019 ◽

Cited By ~ 11

Author(s):

Tal Schuster ◽

Ori Ram ◽

Regina Barzilay ◽

Amir Globerson

Keyword(s):

Word Embeddings ◽

Dependency Parsing ◽

Cross Lingual

Download Full-text

Deep Multitask Learning for Semantic Dependency Parsing

10.18653/v1/p17-1186 ◽

2017 ◽

Cited By ~ 8

Author(s):

Hao Peng ◽

Sam Thomson ◽

Noah A. Smith

Keyword(s):

Multitask Learning ◽

Dependency Parsing ◽

Semantic Dependency

Download Full-text

Cross-Lingual Dependency Parsing with Unlabeled Auxiliary Languages

10.18653/v1/k19-1035 ◽

2019 ◽

Author(s):

Wasi Uddin Ahmad ◽

Zhisong Zhang ◽

Xuezhe Ma ◽

Kai-Wei Chang ◽

Nanyun Peng

Keyword(s):

Dependency Parsing ◽

Cross Lingual

Download Full-text

Integrative Semantic Dependency Parsing via Efficient Large-scale Feature Selection

Journal of Artificial Intelligence Research ◽

10.1613/jair.3717 ◽

2013 ◽

Vol 46 ◽

pp. 203-233 ◽

Cited By ~ 7

Author(s):

H. Zhao ◽

X. Zhang ◽

C. Kit

Keyword(s):

Feature Selection ◽

Large Scale ◽

Critical Role ◽

Integrative Approach ◽

Pipeline System ◽

Dependency Parsing ◽

Semantic Parsing ◽

Data Set ◽

Scale Feature ◽

Semantic Dependency

Semantic parsing, i.e., the automatic derivation of meaning representation such as an instantiated predicate-argument structure for a sentence, plays a critical role in deep processing of natural language. Unlike all other top systems of semantic dependency parsing that have to rely on a pipeline framework to chain up a series of submodels each specialized for a specific subtask, the one presented in this article integrates everything into one model, in hopes of achieving desirable integrity and practicality for real applications while maintaining a competitive performance. This integrative approach tackles semantic parsing as a word pair classification problem using a maximum entropy classifier. We leverage adaptive pruning of argument candidates and large-scale feature selection engineering to allow the largest feature space ever in use so far in this field, it achieves a state-of-the-art performance on the evaluation data set for CoNLL-2008 shared task, on top of all but one top pipeline system, confirming its feasibility and effectiveness.

Download Full-text

Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders

10.18653/v1/2020.acl-main.607 ◽

2020 ◽

Author(s):

Zixia Jia ◽

Youmi Ma ◽

Jiong Cai ◽

Kewei Tu

Keyword(s):

Dependency Parsing ◽

Semantic Dependency

Download Full-text