Cross-Lingual Dependency Parsing with Unlabeled Auxiliary Languages

In recent years, the research on dependency parsing focuses on improving the accuracy of the domain-specific (in-domain) test datasets and has made remarkable progress. However, there are innumerable scenarios in the real world that are not covered by the dataset, namely, the out-of-domain dataset. As a result, parsers that perform well on the in-domain data usually suffer from significant performance degradation on the out-of-domain data. Therefore, to adapt the existing in-domain parsers with high performance to a new domain scenario, cross-domain transfer learning methods are essential to solve the domain problem in parsing. This paper examines two scenarios for cross-domain transfer learning: semi-supervised and unsupervised cross-domain transfer learning. Specifically, we adopt a pre-trained language model BERT for training on the source domain (in-domain) data at the subword level and introduce self-training methods varied from tri-training for these two scenarios. The evaluation results on the NLPCC-2019 shared task and universal dependency parsing task indicate the effectiveness of the adopted approaches on cross-domain transfer learning and show the potential of self-learning to cross-lingual transfer learning.

Download Full-text

Perturbation Based Learning for Structured NLP Tasks with Application to Dependency Parsing

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00291 ◽

2019 ◽

Vol 7 ◽

pp. 643-659

Author(s):

Amichay Doitch ◽

Ram Yazdi ◽

Tamir Hazan ◽

Roi Reichart

Keyword(s):

Prediction Models ◽

Learning Algorithm ◽

Solution Space ◽

Optimal Strategies ◽

Expressive Power ◽

Entire Solution ◽

Dependency Parsing ◽

Computationally Efficient ◽

Cross Lingual

The best solution of structured prediction models in NLP is often inaccurate because of limited expressive power of the model or to non-exact parameter estimation. One way to mitigate this problem is sampling candidate solutions from the model’s solution space, reasoning that effective exploration of this space should yield high-quality solutions. Unfortunately, sampling is often computationally hard and many works hence back-off to sub-optimal strategies, such as extraction of the best scoring solutions of the model, which are not as diverse as sampled solutions. In this paper we propose a perturbation-based approach where sampling from a probabilistic model is computationally efficient. We present a learning algorithm for the variance of the perturbations, and empirically demonstrate its importance. Moreover, while finding the argmax in our model is intractable, we propose an efficient and effective approximation. We apply our framework to cross-lingual dependency parsing across 72 corpora from 42 languages and to lightly supervised dependency parsing across 13 corpora from 12 languages, and demonstrate strong results in terms of both the quality of the entire solution list and of the final solution. 1

Download Full-text

Cross-Lingual Alignment of Contextual Word Embeddings, with Applications to Zero-shot Dependency Parsing

10.18653/v1/n19-1162 ◽

2019 ◽

Cited By ~ 11

Author(s):

Tal Schuster ◽

Ori Ram ◽

Regina Barzilay ◽

Amir Globerson

Keyword(s):

Word Embeddings ◽

Dependency Parsing ◽

Cross Lingual

Download Full-text

Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank

10.18653/v1/d19-1092 ◽

2019 ◽

Author(s):

Meishan Zhang ◽

Yue Zhang ◽

Guohong Fu

Keyword(s):

Dependency Parsing ◽

Cross Lingual

Download Full-text

Distributed Word Representation Learning for Cross-Lingual Dependency Parsing

10.3115/v1/w14-1613 ◽

2014 ◽

Cited By ~ 7

Author(s):

Min Xiao ◽

Yuhong Guo

Keyword(s):

Representation Learning ◽

Dependency Parsing ◽

Word Representation ◽

Cross Lingual

Download Full-text

On Difficulties of Cross-Lingual Transfer with Order Differences: A Case Study on Dependency Parsing

10.18653/v1/n19-1253 ◽

2019 ◽

Cited By ~ 4

Author(s):

Wasi Ahmad ◽

Zhisong Zhang ◽

Xuezhe Ma ◽

Eduard Hovy ◽

Kai-Wei Chang ◽

...

Keyword(s):

Dependency Parsing ◽

Cross Lingual

Download Full-text

Dependency Parsing of Code-Switching Data with Cross-Lingual Feature Representations

10.18653/v1/w18-0201 ◽

2018 ◽

Author(s):

Niko Partanen ◽

Kyungtae Lim ◽

Michael Rießler ◽

Thierry Poibeau

Keyword(s):

Code Switching ◽

Dependency Parsing ◽

Feature Representations ◽

Cross Lingual

Download Full-text

Cross-lingual Dependency Parsing Based on Distributed Representations

10.3115/v1/p15-1119 ◽

2015 ◽

Cited By ~ 19

Author(s):

Jiang Guo ◽

Wanxiang Che ◽

David Yarowsky ◽

Haifeng Wang ◽

Ting Liu

Keyword(s):

Dependency Parsing ◽

Distributed Representations ◽

Cross Lingual

Download Full-text

Synthetic Treebanking for Cross-Lingual Dependency Parsing

Journal of Artificial Intelligence Research ◽

10.1613/jair.4785 ◽

2016 ◽

Vol 55 ◽

pp. 209-248 ◽

Cited By ~ 7

Author(s):

Jörg Tiedemann ◽

Zeljko Agić

Keyword(s):

Machine Translation ◽

Target Language ◽

Dependency Parsing ◽

Practical Applications ◽

Source Language ◽

Part Of Speech ◽

Statistical Dependency ◽

Target Languages ◽

Cross Lingual ◽

The Impact

How do we parse the languages for which no treebanks are available? This contribution addresses the cross-lingual viewpoint on statistical dependency parsing, in which we attempt to make use of resource-rich source language treebanks to build and adapt models for the under-resourced target languages. We outline the benefits, and indicate the drawbacks of the current major approaches. We emphasize synthetic treebanking: the automatic creation of target language treebanks by means of annotation projection and machine translation. We present competitive results in cross-lingual dependency parsing using a combination of various techniques that contribute to the overall success of the method. We further include a detailed discussion about the impact of part-of-speech label accuracy on parsing results that provide guidance in practical applications of cross-lingual methods for truly under-resourced languages.

Download Full-text