Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

2020 ◽

Vol 34 (05) ◽

pp. 8058-8065

Author(s):

Katharina Kann ◽

Samuel R. Bowman ◽

Kyunghyun Cho

Keyword(s):

Fine Tuning ◽

Target Language ◽

Model Parameters ◽

Transfer Model ◽

Absolute Accuracy ◽

Learning To Learn ◽

Suggested Approach ◽

High Resource ◽

Resource Poor ◽

Cross Lingual

We propose to cast the task of morphological inflection—mapping a lemma to an indicated inflected form—for resource-poor languages as a meta-learning problem. Treating each language as a separate task, we use data from high-resource source languages to learn a set of model parameters that can serve as a strong initialization point for fine-tuning on a resource-poor target language. Experiments with two model architectures on 29 target languages from 3 families show that our suggested approach outperforms all baselines. In particular, it obtains a 31.7% higher absolute accuracy than a previously proposed cross-lingual transfer model and outperforms the previous state of the art by 1.7% absolute accuracy on average over languages.

Download Full-text

Consistency Regularization for Cross-Lingual Fine-Tuning

10.18653/v1/2021.acl-long.264 ◽

2021 ◽

Author(s):

Bo Zheng ◽

Li Dong ◽

Shaohan Huang ◽

Wenhui Wang ◽

Zewen Chi ◽

...

Keyword(s):

Fine Tuning ◽

Cross Lingual

Download Full-text

Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00063 ◽

2017 ◽

Vol 5 ◽

pp. 309-324 ◽

Cited By ~ 11

Author(s):

Nikola Mrkšić ◽

Ivan Vulić ◽

Diarmuid Ó Séaghdha ◽

Ira Leviant ◽

Roi Reichart ◽

...

Keyword(s):

State Of The Art ◽

Vector Spaces ◽

Lexical Resources ◽

High Quality ◽

Performance Improvements ◽

State Tracking ◽

Cross Lingual ◽

Semantic Transfer ◽

Multiple Languages

We present Attract-Repel, an algorithm for improving the semantic quality of word vectors by injecting constraints extracted from lexical resources. Attract-Repel facilitates the use of constraints from mono- and cross-lingual resources, yielding semantically specialized cross-lingual vector spaces. Our evaluation shows that the method can make use of existing cross-lingual lexicons to construct high-quality vector spaces for a plethora of different languages, facilitating semantic transfer from high- to lower-resource ones. The effectiveness of our approach is demonstrated with state-of-the-art results on semantic similarity datasets in six languages. We next show that Attract-Repel-specialized vectors boost performance in the downstream task of dialogue state tracking (DST) across multiple languages. Finally, we show that cross-lingual vector spaces produced by our algorithm facilitate the training of multilingual DST models, which brings further performance improvements.

Download Full-text

The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study

JMIR Medical Informatics ◽

10.2196/18953 ◽

2020 ◽

Vol 8 (12) ◽

pp. e18953

Author(s):

Renzo Rivera Zavala ◽

Paloma Martinez

Keyword(s):

Machine Learning ◽

Information Extraction ◽

Fine Tuning ◽

Entity Recognition ◽

Language Models ◽

Special Focus ◽

Rule Based ◽

Clinical Narrative ◽

Cross Lingual ◽

The Impact

Background Negation and speculation are critical elements in natural language processing (NLP)-related tasks, such as information extraction, as these phenomena change the truth value of a proposition. In the clinical narrative that is informal, these linguistic facts are used extensively with the objective of indicating hypotheses, impressions, or negative findings. Previous state-of-the-art approaches addressed negation and speculation detection tasks using rule-based methods, but in the last few years, models based on machine learning and deep learning exploiting morphological, syntactic, and semantic features represented as spare and dense vectors have emerged. However, although such methods of named entity recognition (NER) employ a broad set of features, they are limited to existing pretrained models for a specific domain or language. Objective As a fundamental subsystem of any information extraction pipeline, a system for cross-lingual and domain-independent negation and speculation detection was introduced with special focus on the biomedical scientific literature and clinical narrative. In this work, detection of negation and speculation was considered as a sequence-labeling task where cues and the scopes of both phenomena are recognized as a sequence of nested labels recognized in a single step. Methods We proposed the following two approaches for negation and speculation detection: (1) bidirectional long short-term memory (Bi-LSTM) and conditional random field using character, word, and sense embeddings to deal with the extraction of semantic, syntactic, and contextual patterns and (2) bidirectional encoder representations for transformers (BERT) with fine tuning for NER. Results The approach was evaluated for English and Spanish languages on biomedical and review text, particularly with the BioScope corpus, IULA corpus, and SFU Spanish Review corpus, with F-measures of 86.6%, 85.0%, and 88.1%, respectively, for NeuroNER and 86.4%, 80.8%, and 91.7%, respectively, for BERT. Conclusions These results show that these architectures perform considerably better than the previous rule-based and conventional machine learning–based systems. Moreover, our analysis results show that pretrained word embedding and particularly contextualized embedding for biomedical corpora help to understand complexities inherent to biomedical text.

Download Full-text

Cross-lingual Fine-tuning for Abstractive Arabic Text Summarization

10.26615/978-954-452-072-4_074 ◽

2021 ◽

Author(s):

Mram Kahla ◽

◽

Zijian Győző Yang ◽

Attila Novák ◽

◽

...

Keyword(s):

Text Summarization ◽

Fine Tuning ◽

Arabic Text ◽

Arabic Text Summarization ◽

Cross Lingual

Download Full-text

ECO-DST: An Efficient Cross-lingual Dialogue State Tracking Framework

10.1145/3490725.3490737 ◽

2021 ◽

Author(s):

Chao Huang ◽

Hui Di ◽

Lina Wang ◽

Kazushige Ouchi

Keyword(s):

State Tracking ◽

Cross Lingual

Download Full-text

Low-Resource Text Classification via Cross-Lingual Language Model Fine-Tuning

Lecture Notes in Computer Science - Chinese Computational Linguistics ◽

10.1007/978-3-030-63031-7_17 ◽

2020 ◽

pp. 231-246

Author(s):

Xiuhong Li ◽

Zhe Li ◽

Jiabao Sheng ◽

Wushour Slamu

Keyword(s):

Text Classification ◽

Language Model ◽

Fine Tuning ◽

Low Resource ◽

Cross Lingual

Download Full-text

Monolingual and Cross-Lingual Intent Detection without Training Data in Target Languages

Electronics ◽

10.3390/electronics10121412 ◽

2021 ◽

Vol 10 (12) ◽

pp. 1412

Author(s):

Jurgita Kapočiūtė-Dzikienė ◽

Askars Salimbajevs ◽

Raivis Skadiņš

Keyword(s):

Experimental Investigation ◽

Training Data ◽

Fine Tuning ◽

Target Language ◽

Learning Approach ◽

Lazy Learning ◽

Detection Problem ◽

Target Languages ◽

Cross Lingual ◽

Similar Accuracy

Due to recent DNN advancements, many NLP problems can be effectively solved using transformer-based models and supervised data. Unfortunately, such data is not available in some languages. This research is based on assumptions that (1) training data can be obtained by the machine translating it from another language; (2) there are cross-lingual solutions that work without the training data in the target language. Consequently, in this research, we use the English dataset and solve the intent detection problem for five target languages (German, French, Lithuanian, Latvian, and Portuguese). When seeking the most accurate solutions, we investigate BERT-based word and sentence transformers together with eager learning classifiers (CNN, BERT fine-tuning, FFNN) and lazy learning approach (Cosine similarity as the memory-based method). We offer and evaluate several strategies to overcome the data scarcity problem with machine translation, cross-lingual models, and a combination of the previous two. The experimental investigation revealed the robustness of sentence transformers under various cross-lingual conditions. The accuracy equal to ~0.842 is achieved with the English dataset with completely monolingual models is considered our top-line. However, cross-lingual approaches demonstrate similar accuracy levels reaching ~0.831, ~0.829, ~0.853, ~0.831, and ~0.813 on German, French, Lithuanian, Latvian, and Portuguese languages.

Download Full-text

The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study (Preprint)

10.2196/preprints.18953 ◽

2020 ◽

Author(s):

Renzo Rivera Zavala ◽

Paloma Martinez

Keyword(s):

Machine Learning ◽

Information Extraction ◽

Fine Tuning ◽

Entity Recognition ◽

Language Models ◽

Special Focus ◽

Rule Based ◽

Clinical Narrative ◽

Cross Lingual ◽

The Impact

BACKGROUND Negation and speculation are critical elements in natural language processing (NLP)-related tasks, such as information extraction, as these phenomena change the truth value of a proposition. In the clinical narrative that is informal, these linguistic facts are used extensively with the objective of indicating hypotheses, impressions, or negative findings. Previous state-of-the-art approaches addressed negation and speculation detection tasks using rule-based methods, but in the last few years, models based on machine learning and deep learning exploiting morphological, syntactic, and semantic features represented as spare and dense vectors have emerged. However, although such methods of named entity recognition (NER) employ a broad set of features, they are limited to existing pretrained models for a specific domain or language. OBJECTIVE As a fundamental subsystem of any information extraction pipeline, a system for cross-lingual and domain-independent negation and speculation detection was introduced with special focus on the biomedical scientific literature and clinical narrative. In this work, detection of negation and speculation was considered as a sequence-labeling task where cues and the scopes of both phenomena are recognized as a sequence of nested labels recognized in a single step. METHODS We proposed the following two approaches for negation and speculation detection: (1) bidirectional long short-term memory (Bi-LSTM) and conditional random field using character, word, and sense embeddings to deal with the extraction of semantic, syntactic, and contextual patterns and (2) bidirectional encoder representations for transformers (BERT) with fine tuning for NER. RESULTS The approach was evaluated for English and Spanish languages on biomedical and review text, particularly with the BioScope corpus, IULA corpus, and SFU Spanish Review corpus, with F-measures of 86.6%, 85.0%, and 88.1%, respectively, for NeuroNER and 86.4%, 80.8%, and 91.7%, respectively, for BERT. CONCLUSIONS These results show that these architectures perform considerably better than the previous rule-based and conventional machine learning–based systems. Moreover, our analysis results show that pretrained word embedding and particularly contextualized embedding for biomedical corpora help to understand complexities inherent to biomedical text.

Download Full-text

Cross-Lingual Lemmatization and Morphology Tagging with Two-Stage Multilingual BERT Fine-Tuning

10.18653/v1/w19-4203 ◽

2019 ◽

Cited By ~ 1

Author(s):

Dan Kondratyuk

Keyword(s):

Fine Tuning ◽

Two Stage ◽

Cross Lingual

Download Full-text

Cross-lingual Intermediate Fine-tuning improves Dialogue State Tracking

Learning to Learn Morphological Inflection for Resource-Poor Languages

Consistency Regularization for Cross-Lingual Fine-Tuning

Semantic Specialization of Distributional Word Vector Spaces using Monolingual and Cross-Lingual Constraints

The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study

Cross-lingual Fine-tuning for Abstractive Arabic Text Summarization

ECO-DST: An Efficient Cross-lingual Dialogue State Tracking Framework

Low-Resource Text Classification via Cross-Lingual Language Model Fine-Tuning

Monolingual and Cross-Lingual Intent Detection without Training Data in Target Languages

The Impact of Pretrained Language Models on Negation and Speculation Detection in Cross-Lingual Medical Text: Comparative Study (Preprint)

Cross-Lingual Lemmatization and Morphology Tagging with Two-Stage Multilingual BERT Fine-Tuning

Export Citation Format