Distant Domain Adaptation for Text Classification

Domain adaptation is an important problem in natural language processing (NLP) due to the distributional difference between the labeled source domain and the target domain. In this paper, we study the domain adaptation problem from the instance weighting perspective. By using density ratio as the instance weight, the traditional instance weighting approaches can potentially correct the sample selection bias in domain adaptation. However, researchers often failed to achieve good performance when applying instance weighting to domain adaptation in NLP and many negative results were reported in the literature. In this work, we conduct an in-depth study on the causes of the failure, and find that previous work only focused on reducing the sample selection bias, but ignored another important factor, sample selection variance, in domain adaptation. On this basis, we propose a new instance weighting framework by trading off two factors in instance weight learning. We evaluate our approach on two cross-domain text classification tasks and compare it with eight instance weighting methods. The results prove our approach's advantages in domain adaptation performance, optimization efficiency and parameter stability.

Download Full-text

Representation Learning for Improved Generalization of Adversarial Domain Adaptation with Text Classification

2020 IEEE International Conference on Informatics, IoT, and Enabling Technologies (ICIoT) ◽

10.1109/iciot48696.2020.9089430 ◽

2020 ◽

Author(s):

Alaa Khaddaj ◽

Hazem Hajj

Keyword(s):

Text Classification ◽

Domain Adaptation ◽

Representation Learning

Download Full-text

Unsupervised Energy-based Adversarial Domain Adaptation for Cross-domain Text Classification

10.18653/v1/2021.findings-acl.103 ◽

2021 ◽

Author(s):

Han Zou ◽

Jianfei Yang ◽

Xiaojian Wu

Keyword(s):

Text Classification ◽

Domain Adaptation ◽

Cross Domain

Download Full-text

Domain Adaptation for Text Classification with Weird Embeddings

10.4000/books.aaccademia.8250 ◽

2020 ◽

pp. 37-43

Author(s):

Valerio Basile

Keyword(s):

Text Classification ◽

Domain Adaptation

Download Full-text

Leveraging Accident Investigation Reports as Leading Indicators of Construction Safety Using Text Classification

Construction Research Congress 2020 ◽

10.1061/9780784482872.053 ◽

2020 ◽

Author(s):

Shraddha Shrestha ◽

Syed Ahnaf Morshed ◽

Nipesh Pradhananga ◽

Xuan Lv

Keyword(s):

Text Classification ◽

Construction Safety ◽

Leading Indicators ◽

Accident Investigation

Download Full-text

Domain Adaptation for Visual Recognition

10.1561/9781680830316 ◽

2015 ◽

Author(s):

Raghuraman Gopalan ◽

Ruonan Li ◽

Vishal M. Patel ◽

Rama Chellappa

Keyword(s):

Visual Recognition ◽

Domain Adaptation

Download Full-text

A Brief Survey on Text Classification Using Various Machine Learning Techniques

International Journal of Advanced Research in Computer Science and Software Engineering ◽

10.23956/ijarcsse.v8i1.521 ◽

2018 ◽

Vol 8 (1) ◽

pp. 14

Author(s):

Padmavathi .S ◽

M. Chidambaram

Keyword(s):

Machine Learning ◽

Text Classification ◽

Fixed Number ◽

Machine Learning Techniques ◽

Online Information ◽

Rule Based ◽

Learning Techniques ◽

Machine Learning Approach ◽

Rule Based Approach

Text classification has grown into more significant in managing and organizing the text data due to tremendous growth of online information. It does classification of documents in to fixed number of predefined categories. Rule based approach and Machine learning approach are the two ways of text classification. In rule based approach, classification of documents is done based on manually defined rules. In Machine learning based approach, classification rules or classifier are defined automatically using example documents. It has higher recall and quick process. This paper shows an investigation on text classification utilizing different machine learning techniques.

Download Full-text