Combination of Multi-view Multi-source Language Classifiers for Cross-Lingual Sentiment Classification

In recent years great success has been achieved in sentiment classification for English, thanks in part to the availability of copious annotated resources. Unfortunately, most languages do not enjoy such an abundance of labeled data. To tackle the sentiment classification problem in low-resource languages without adequate annotated data, we propose an Adversarial Deep Averaging Network (ADAN 1 ) to transfer the knowledge learned from labeled data on a resource-rich source language to low-resource languages where only unlabeled data exist. ADAN has two discriminative branches: a sentiment classifier and an adversarial language discriminator. Both branches take input from a shared feature extractor to learn hidden representations that are simultaneously indicative for the classification task and invariant across languages. Experiments on Chinese and Arabic sentiment classification demonstrate that ADAN significantly outperforms state-of-the-art systems.

Download Full-text

Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification

Electronics ◽

10.3390/electronics10030270 ◽

2021 ◽

Vol 10 (3) ◽

pp. 270

Author(s):

Hanqian Wu ◽

Zhike Wang ◽

Feng Qing ◽

Shoushan Li

Keyword(s):

General Purpose ◽

Sentiment Classification ◽

Training Data ◽

Target Language ◽

Source Language ◽

Domain Specific ◽

Novel Approach ◽

The Rich ◽

Target Languages ◽

Cross Lingual

Though great progress has been made in the Aspect-Based Sentiment Analysis(ABSA) task through research, most of the previous work focuses on English-based ABSA problems, and there are few efforts on other languages mainly due to the lack of training data. In this paper, we propose an approach for performing a Cross-Lingual Aspect Sentiment Classification (CLASC) task which leverages the rich resources in one language (source language) for aspect sentiment classification in a under-resourced language (target language). Specifically, we first build a bilingual lexicon for domain-specific training data to translate the aspect category annotated in the source-language corpus and then translate sentences from the source language to the target language via Machine Translation (MT) tools. However, most MT systems are general-purpose, it non-avoidably introduces translation ambiguities which would degrade the performance of CLASC. In this context, we propose a novel approach called Reinforced Transformer with Cross-Lingual Distillation (RTCLD) combined with target-sensitive adversarial learning to minimize the undesirable effects of translation ambiguities in sentence translation. We conduct experiments on different language combinations, treating English as the source language and Chinese, Russian, and Spanish as target languages. The experimental results show that our proposed approach outperforms the state-of-the-art methods on different target languages.

Download Full-text

Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification (Extended Abstract)

Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2020/649 ◽

2020 ◽

Author(s):

Zhenpeng Chen ◽

Sheng Shen ◽

Ziniu Hu ◽

Xuan Lu ◽

Qiaozhu Mei ◽

...

Keyword(s):

Machine Translation ◽

Representation Learning ◽

Sentiment Classification ◽

Target Language ◽

Learning Method ◽

Source Language ◽

Translation Tools ◽

Target Languages ◽

Cross Lingual ◽

Cross Language

Sentiment classification typically relies on a large amount of labeled data. In practice, the availability of labels is highly imbalanced among different languages. To tackle this problem, cross-lingual sentiment classification approaches aim to transfer knowledge learned from one language that has abundant labeled examples (i.e., the source language, usually English) to another language with fewer labels (i.e., the target language). The source and the target languages are usually bridged through off-the-shelf machine translation tools. Through such a channel, cross-language sentiment patterns can be successfully learned from English and transferred into the target languages. This approach, however, often fails to capture sentiment knowledge specific to the target language. In this paper, we employ emojis, which are widely available in many languages, as a new channel to learn both the cross-language and the language-specific sentiment patterns. We propose a novel representation learning method that uses emoji prediction as an instrument to learn respective sentiment-aware representations for each language. The learned representations are then integrated to facilitate cross-lingual sentiment classification.

Download Full-text

Deep Learning in Cross-Lingual English-Vietnamese Sentiment Classification

2018 10th International Conference on Knowledge and Systems Engineering (KSE) ◽

10.1109/kse.2018.8573366 ◽

2018 ◽

Author(s):

Alexander Sedunov ◽

Hady Salloum ◽

Alexander Sutin ◽

Nikolay Sedunov

Keyword(s):

Deep Learning ◽

Sentiment Classification ◽

Cross Lingual

Download Full-text

Co-training for cross-lingual sentiment classification

Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - ACL-IJCNLP '09 ◽

10.3115/1687878.1687913 ◽

2009 ◽

Cited By ~ 127

Author(s):

Xiaojun Wan

Keyword(s):

Sentiment Classification ◽

Cross Lingual

Download Full-text

Zero-Shot Learning for Cross-Lingual News Sentiment Classification

Applied Sciences ◽

10.3390/app10175993 ◽

2020 ◽

Vol 10 (17) ◽

pp. 5993

Author(s):

Andraž Pelicon ◽

Marko Pranjić ◽

Dragana Miljković ◽

Blaž Škrlj ◽

Senja Pollak

Keyword(s):

Classification System ◽

State Of The Art ◽

Sentiment Classification ◽

Training Data ◽

Test Set ◽

Novel Technique ◽

Analysis Task ◽

Negative News ◽

Cross Lingual ◽

News Sentiment

In this paper, we address the task of zero-shot cross-lingual news sentiment classification. Given the annotated dataset of positive, neutral, and negative news in Slovene, the aim is to develop a news classification system that assigns the sentiment category not only to Slovene news, but to news in another language without any training data required. Our system is based on the multilingual BERTmodel, while we test different approaches for handling long documents and propose a novel technique for sentiment enrichment of the BERT model as an intermediate training step. With the proposed approach, we achieve state-of-the-art performance on the sentiment analysis task on Slovenian news. We evaluate the zero-shot cross-lingual capabilities of our system on a novel news sentiment test set in Croatian. The results show that the cross-lingual approach also largely outperforms the majority classifier, as well as all settings without sentiment enrichment in pre-training.

Download Full-text

Transfer Learning for Cross-Lingual Sentiment Classification with Weakly Shared Deep Neural Networks

Proceedings of the 39th International ACM SIGIR conference on Research and Development in Information Retrieval - SIGIR '16 ◽

10.1145/2911451.2911490 ◽

2016 ◽

Cited By ~ 8

Author(s):

Guangyou Zhou ◽

Zhao Zeng ◽

Jimmy Xiangji Huang ◽

Tingting He

Keyword(s):

Neural Networks ◽

Transfer Learning ◽

Deep Neural Networks ◽

Sentiment Classification ◽

Cross Lingual

Download Full-text

Data Quality Controlling for Cross-Lingual Sentiment Classification

2013 International Conference on Asian Language Processing ◽

10.1109/ialp.2013.43 ◽

2013 ◽

Cited By ~ 1

Author(s):

Shoushan Li ◽

Yunxia Xue ◽

Zhongqing Wang ◽

Sophia Yat Mei Lee ◽

Chu-Ren Huang

Keyword(s):

Data Quality ◽

Sentiment Classification ◽

Cross Lingual

Download Full-text

Distributional Correspondence Indexing for Cross-Lingual and Cross-Domain Sentiment Classification (Extended Abstract)

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/802 ◽

2018 ◽

Author(s):

Alejandro Moreo Fernández ◽

Andrea Esuli ◽

Fabrizio Sebastiani

Keyword(s):

Domain Adaptation ◽

State Of The Art ◽

Sentiment Classification ◽

Training Data ◽

Target Domain ◽

Source Domain ◽

Machine Learning Methods ◽

Cross Domain ◽

Current State ◽

Cross Lingual

Domain Adaptation (DA) techniques aim at enabling machine learning methods learn effective classifiers for a “target” domain when the only available training data belongs to a different “source” domain. In this extended abstract, we briefly describe our new DA method called Distributional Correspondence Indexing (DCI) for sentiment classification. DCI derives term representations in a vector space common to both domains where each dimension reflects its distributional correspondence to a pivot, i.e., to a highly predictive term that behaves similarly across domains. The experiments we have conducted show that DCI obtains better performance than current state-of-the-art techniques for cross-lingual and cross-domain sentiment classification.

Download Full-text

Cross-Lingual Sentiment Classification from English to Arabic using Machine Translation

International Journal of Advanced Computer Science and Applications ◽

10.14569/ijacsa.2017.081257 ◽

2017 ◽

Vol 8 (12) ◽

Cited By ~ 3

Author(s):

Adel Al-Shabi ◽

Aisah Adel ◽

Nazlia Omar ◽

Tareq Al-Moslmi

Keyword(s):

Machine Translation ◽

Sentiment Classification ◽

Cross Lingual

Download Full-text

Combination of Multi-view Multi-source Language Classifiers for Cross-Lingual Sentiment Classification

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

Reinforced Transformer with Cross-Lingual Distillation for Cross-Lingual Aspect Sentiment Classification

Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification (Extended Abstract)

Deep Learning in Cross-Lingual English-Vietnamese Sentiment Classification

Co-training for cross-lingual sentiment classification

Zero-Shot Learning for Cross-Lingual News Sentiment Classification

Transfer Learning for Cross-Lingual Sentiment Classification with Weakly Shared Deep Neural Networks

Data Quality Controlling for Cross-Lingual Sentiment Classification

Distributional Correspondence Indexing for Cross-Lingual and Cross-Domain Sentiment Classification (Extended Abstract)

Cross-Lingual Sentiment Classification from English to Arabic using Machine Translation

Export Citation Format