Automatic construction of domain-specific sentiment lexicon for unsupervised domain adaptation and sentiment classification

There is often the need to perform sentiment classification in a particular domain where no labeled document is available. Although we could make use of a general-purpose off-the-shelf sentiment classifier or a pre-built one for a different domain, the effectiveness would be inferior. In this paper, we explore the possibility of building domain-specific sentiment classifiers with unlabeled documents only. Our investigation indicates that in the word embeddings learned from the unlabeled corpus of a given domain, the distributed word representations (vectors) for opposite sentiments form distinct clusters, though those clusters are not transferable across domains. Exploiting such a clustering structure, we are able to utilize machine learning algorithms to induce a quality domain-specific sentiment lexicon from just a few typical sentiment words (“seeds”). An important finding is that simple linear model based supervised learning algorithms (such as linear SVM) can actually work better than more sophisticated semi-supervised/transductive learning algorithms which represent the state-of-the-art technique for sentiment lexicon induction. The induced lexicon could be applied directly in a lexicon-based method for sentiment classification, but a higher performance could be achieved through a two-phase bootstrapping method which uses the induced lexicon to assign positive/negative sentiment scores to unlabeled documents first, a nd t hen u ses those documents found to have clear sentiment signals as pseudo-labeled examples to train a document sentiment classifier v ia supervised learning algorithms (such as LSTM). On several benchmark datasets for document sentiment classification, our end-to-end pipelined approach which is overall unsupervised (except for a tiny set of seed words) outperforms existing unsupervised approaches and achieves an accuracy comparable to that of fully supervised approaches.

Download Full-text

Unsupervised Domain Adaptation for Event Detection using Domain-specific Adapters

10.18653/v1/2021.findings-acl.351 ◽

2021 ◽

Author(s):

Nghia Ngo Trung ◽

Duy Phung ◽

Thien Huu Nguyen

Keyword(s):

Event Detection ◽

Domain Adaptation ◽

Domain Specific ◽

Unsupervised Domain Adaptation

Download Full-text

Unsupervised Domain Adaptation via Structured Prediction Based Selective Pseudo-Labeling

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i04.6091 ◽

2020 ◽

Vol 34 (04) ◽

pp. 6243-6250 ◽

Cited By ~ 2

Author(s):

Qian Wang ◽

Toby Breckon

Keyword(s):

Domain Adaptation ◽

Feature Space ◽

Structured Prediction ◽

Target Domain ◽

Source Domain ◽

Domain Specific ◽

Unsupervised Domain Adaptation ◽

Deep Feature ◽

Significant Performance ◽

Error Accumulation

Unsupervised domain adaptation aims to address the problem of classifying unlabeled samples from the target domain whilst labeled samples are only available from the source domain and the data distributions are different in these two domains. As a result, classifiers trained from labeled samples in the source domain suffer from significant performance drop when directly applied to the samples from the target domain. To address this issue, different approaches have been proposed to learn domain-invariant features or domain-specific classifiers. In either case, the lack of labeled samples in the target domain can be an issue which is usually overcome by pseudo-labeling. Inaccurate pseudo-labeling, however, could result in catastrophic error accumulation during learning. In this paper, we propose a novel selective pseudo-labeling strategy based on structured prediction. The idea of structured prediction is inspired by the fact that samples in the target domain are well clustered within the deep feature space so that unsupervised clustering analysis can be used to facilitate accurate pseudo-labeling. Experimental results on four datasets (i.e. Office-Caltech, Office31, ImageCLEF-DA and Office-Home) validate our approach outperforms contemporary state-of-the-art methods.

Download Full-text

Domain-Specific Batch Normalization for Unsupervised Domain Adaptation

2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) ◽

10.1109/cvpr.2019.00753 ◽

2019 ◽

Cited By ~ 17

Author(s):

Woong-Gi Chang ◽

Tackgeun You ◽

Seonguk Seo ◽

Suha Kwak ◽

Bohyung Han

Keyword(s):

Domain Adaptation ◽

Domain Specific ◽

Unsupervised Domain Adaptation ◽

Batch Normalization

Download Full-text

Automatic construction of domain-specific sentiment lexicon based on constrained label propagation

Knowledge-Based Systems ◽

10.1016/j.knosys.2013.11.009 ◽

2014 ◽

Vol 56 ◽

pp. 191-200 ◽

Cited By ~ 58

Author(s):

Sheng Huang ◽

Zhendong Niu ◽

Chongyang Shi

Keyword(s):

Label Propagation ◽

Automatic Construction ◽

Domain Specific ◽

Sentiment Lexicon

Download Full-text

PMI-based polarity computation for SVM-NN-based sentiment classification from user-generated reviews

International Journal of Intelligent Unmanned Systems ◽

10.1108/ijius-09-2020-0043 ◽

2021 ◽

Vol ahead-of-print (ahead-of-print) ◽

Author(s):

P. Padmavathy ◽

S. Pakkir Mohideen ◽

Zameer Gulzar

Keyword(s):

Sentiment Classification ◽

The Other ◽

General Knowledge ◽

Data Set ◽

Content Type ◽

Domain Specific ◽

Lexical Resource ◽

Sentiment Lexicon ◽

Polarity Classification ◽

Level Performance

PurposeThe purpose of this paper is to initially perform Senti-WordNet (SWN)- and point wise mutual information (PMI)-based polarity computation and based polarity updation. When the SWN polarity and polarity mismatched, the vote flipping algorithm (VFA) is employed.Design/methodology/approachRecently, in domains like social media(SM), healthcare, hotel, car, product data, etc., research on sentiment analysis (SA) has massively increased. In addition, there is no approach for analyzing the positive or negative orientations of every single aspect in a document (a tweet, a review, as well as a piece of news, among others). For SA as well as polarity classification, several researchers have used SWN as a lexical resource. Nevertheless, these lexicons show lower-level performance for sentiment classification (SC) than domain-specific lexicons (DSL). Likewise, in some scenarios, the same term is utilized differently between domain and general knowledge lexicons. While concerning different domains, most words have one sentiment class in SWN, and in the annotated data set, their occurrence signifies a strong inclination with the other sentiment class. Hence, this paper chiefly concentrates on the drawbacks of adapting domain-dependent sentiment lexicon (DDSL) from a collection of labeled user reviews and domain-independent lexicon (DIL) for proposing a framework centered on the information theory that could predict the correct polarity of the words (positive, neutral and negative). The proposed work initially performs SWN- and PMI-based polarity computation and based polarity updation. When the SWN polarity and polarity mismatched, the vote flipping algorithm (VFA) is employed. Finally, the predicted polarity is inputted to the mtf-idf-based SVM-NN classifier for the SC of reviews. The outcomes are examined and contrasted to the other existing techniques to verify that the proposed work has predicted the class of the reviews more effectually for different datasets.FindingsThere is no approach for analyzing the positive or negative orientations of every single aspect in a document (a tweet, a review, as well as a piece of news, among others). For SA as well as polarity classification, several researchers have used SWN as a lexical resource. Nevertheless, these lexicons show lower-level performance for sentiment classification (SC) than domain-specific lexicons (DSL). Likewise, in some scenarios, the same term is utilized differently between domain and general knowledge lexicons. While concerning different domains, most words have one sentiment class in SWN, and in the annotated data set their occurrence signifies a strong inclination with the other sentiment class.Originality/valueThe proposed work initially performs SWN- and PMI-based polarity computation, and based polarity updation. When the SWN polarity and polarity mismatched, the vote flipping algorithm (VFA) is employed.

Download Full-text

Automatic construction of domain-specific sentiment lexicon based on the semantics graph

2017 IEEE International Conference on Signal Processing, Communications and Computing (ICSPCC) ◽

10.1109/icspcc.2017.8242562 ◽

2017 ◽

Cited By ~ 1

Author(s):

Gen Xiong ◽

Yilin Fang ◽

Quan Liu

Keyword(s):

Automatic Construction ◽

Domain Specific ◽

Sentiment Lexicon

Download Full-text

Aligning Domain-Specific Distribution and Classifier for Cross-Domain Classification from Multiple Sources

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015989 ◽

2019 ◽

Vol 33 ◽

pp. 5989-5996 ◽

Cited By ~ 11

Author(s):

Yongchun Zhu ◽

Fuzhen Zhuang ◽

Deqing Wang

Keyword(s):

Domain Adaptation ◽

Feature Space ◽

Multiple Sources ◽

Target Domain ◽

Domain Specific ◽

Unsupervised Domain Adaptation ◽

Specific Distribution ◽

Benchmark Datasets ◽

Invariant Representations ◽

Decision Boundaries

While Unsupervised Domain Adaptation (UDA) algorithms, i.e., there are only labeled data from source domains, have been actively studied in recent years, most algorithms and theoretical results focus on Single-source Unsupervised Domain Adaptation (SUDA). However, in the practical scenario, labeled data can be typically collected from multiple diverse sources, and they might be different not only from the target domain but also from each other. Thus, domain adapters from multiple sources should not be modeled in the same way. Recent deep learning based Multi-source Unsupervised Domain Adaptation (MUDA) algorithms focus on extracting common domain-invariant representations for all domains by aligning distribution of all pairs of source and target domains in a common feature space. However, it is often very hard to extract the same domain-invariant representations for all domains in MUDA. In addition, these methods match distributions without considering domain-specific decision boundaries between classes. To solve these problems, we propose a new framework with two alignment stages for MUDA which not only respectively aligns the distributions of each pair of source and target domains in multiple specific feature spaces, but also aligns the outputs of classifiers by utilizing the domainspecific decision boundaries. Extensive experiments demonstrate that our method can achieve remarkable results on popular benchmark datasets for image classification.

Download Full-text

Improving Domain-Specific Classification by Collaborative Learning with Adaptation Networks

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33015450 ◽

2019 ◽

Vol 33 ◽

pp. 5450-5457 ◽

Cited By ~ 1

Author(s):

Si Wu ◽

Jian Zhong ◽

Wenming Cao ◽

Rui Li ◽

Zhiwen Yu ◽

...

Keyword(s):

Domain Adaptation ◽

Visual Adaptation ◽

Generalization Capability ◽

Target Domain ◽

Domain Specific ◽

Unsupervised Domain Adaptation ◽

Source Data ◽

Data Points ◽

Invariant Representations ◽

Target Data

For unsupervised domain adaptation, the process of learning domain-invariant representations could be dominated by the labeled source data, such that the specific characteristics of the target domain may be ignored. In order to improve the performance in inferring target labels, we propose a targetspecific network which is capable of learning collaboratively with a domain adaptation network, instead of directly minimizing domain discrepancy. A clustering regularization is also utilized to improve the generalization capability of the target-specific network by forcing target data points to be close to accumulated class centers. As this network learns and specializes to the target domain, its performance in inferring target labels improves, which in turn facilitates the learning process of the adaptation network. Therefore, there is a mutually beneficial relationship between these two networks. We perform extensive experiments on multiple digit and object datasets, and the effectiveness and superiority of the proposed approach is presented and verified on multiple visual adaptation benchmarks, e.g., we improve the state-ofthe-art on the task of MNIST→SVHN from 76.5% to 84.9% without specific augmentation.

Download Full-text

An Unsupervised Domain Adaptation Being Aware of Domain-specific and Label Information

10.1109/ijcnn52387.2021.9533404 ◽

2021 ◽

Author(s):

Yuhong Zhang ◽

Qi Zhang

Keyword(s):

Domain Adaptation ◽

Domain Specific ◽

Unsupervised Domain Adaptation ◽

Label Information

Download Full-text

Automatic construction of domain-specific sentiment lexicon for unsupervised domain adaptation and sentiment classification

Bootstrap Domain-Specific Sentiment Classifiers from Unlabeled Corpora

Unsupervised Domain Adaptation for Event Detection using Domain-specific Adapters

Unsupervised Domain Adaptation via Structured Prediction Based Selective Pseudo-Labeling

Domain-Specific Batch Normalization for Unsupervised Domain Adaptation

Automatic construction of domain-specific sentiment lexicon based on constrained label propagation

PMI-based polarity computation for SVM-NN-based sentiment classification from user-generated reviews

Automatic construction of domain-specific sentiment lexicon based on the semantics graph

Aligning Domain-Specific Distribution and Classifier for Cross-Domain Classification from Multiple Sources

Improving Domain-Specific Classification by Collaborative Learning with Adaptation Networks

An Unsupervised Domain Adaptation Being Aware of Domain-specific and Label Information

Export Citation Format