Does BERT need domain adaptation for clinical negation detection?

Abstract Introduction Classifying whether concepts in an unstructured clinical text are negated is an important unsolved task. New domain adaptation and transfer learning methods can potentially address this issue. Objective We examine neural unsupervised domain adaptation methods, introducing a novel combination of domain adaptation with transformer-based transfer learning methods to improve negation detection. We also want to better understand the interaction between the widely used bidirectional encoder representations from transformers (BERT) system and domain adaptation methods. Materials and Methods We use 4 clinical text datasets that are annotated with negation status. We evaluate a neural unsupervised domain adaptation algorithm and BERT, a transformer-based model that is pretrained on massive general text datasets. We develop an extension to BERT that uses domain adversarial training, a neural domain adaptation method that adds an objective to the negation task, that the classifier should not be able to distinguish between instances from 2 different domains. Results The domain adaptation methods we describe show positive results, but, on average, the best performance is obtained by plain BERT (without the extension). We provide evidence that the gains from BERT are likely not additive with the gains from domain adaptation. Discussion Our results suggest that, at least for the task of clinical negation detection, BERT subsumes domain adaptation, implying that BERT is already learning very general representations of negation phenomena such that fine-tuning even on a specific corpus does not lead to much overfitting. Conclusion Despite being trained on nonclinical text, the large training sets of models like BERT lead to large gains in performance for the clinical negation detection task.

Download Full-text

A Semi-Supervised Transfer Learning with Dynamic Associate Domain Adaptation for Human Activity Recognition Using WiFi Signals

Sensors ◽

10.3390/s21248475 ◽

2021 ◽

Vol 21 (24) ◽

pp. 8475

Author(s):

Yuh-Shyan Chen ◽

Yu-Chi Chang ◽

Chun-Yu Li

Keyword(s):

Transfer Learning ◽

Activity Recognition ◽

Human Activity ◽

Domain Adaptation ◽

Noise Removal ◽

Human Activity Recognition ◽

Fine Tuning ◽

Target Domain ◽

Adaptation Algorithm ◽

Learning Stage

Human activity recognition without equipment plays a vital role in smart home applications, freeing humans from the shackles of wearable devices. In this paper, by using the channel state information (CSI) of the WiFi signal, semi-supervised transfer learning with dynamic associate domain adaptation is proposed for human activity recognition. In order to improve the CSI quality and denoising of CSI, we carried out missing packet filling, burst noise removal, background estimation, feature extraction, feature enhancement, and data augmentation in the data pre-processing stage. This paper considers the problem of environment-independent human activity recognition, also known as domain adaptation. The pre-trained model is trained from the source domain by collecting a complete labeled dataset of all of the CSI of human activity patterns. Then, the pre-trained model is transferred to the target environment through the semi-supervised transfer learning stage. Therefore, when humans move to different target domains, a partial labeled dataset of the target domain is required for fine-tuning. In this paper, we propose a dynamic associate domain adaptation called DADA. By modifying the existing associate domain adaptation algorithm, the target domain can provide a dynamic ratio of labeled dataset/unlabeled dataset, while the existing associate domain adaptation algorithm only allows target domains with the unlabeled dataset. The advantage of DADA is that it provides a dynamic strategy to eliminate different effects on different environments. In addition, we further designed an attention-based DenseNet model, or AD, as our training network, which is modified by an existing DenseNet by adding the attention function. The solution we proposed was simplified to DADA-AD throughout the paper. The experimental results show that for domain adaptation in different domains, the accuracy of human activity recognition of the DADA-AD scheme is 97.4%. It also shows that DADA-AD has advantages over existing semi-supervised learning schemes.

Download Full-text

A new deep domain adaptation method with joint adversarial training for online detection of bearing early fault

ISA Transactions ◽

10.1016/j.isatra.2021.04.026 ◽

2021 ◽

Author(s):

Wentao Mao ◽

Ling Ding ◽

Yamin Liu ◽

Sajad Saraygord Afshari ◽

Xihui Liang

Keyword(s):

Domain Adaptation ◽

Online Detection ◽

Adversarial Training ◽

Adaptation Method

Download Full-text

Spectral Normalization for Domain Adaptation

Information ◽

10.3390/info11020068 ◽

2020 ◽

Vol 11 (2) ◽

pp. 68

Author(s):

Liquan Zhao ◽

Yan Liu

Keyword(s):

Transfer Learning ◽

Predictive Accuracy ◽

Domain Adaptation ◽

Learning Performance ◽

Constraint Condition ◽

Adversarial Network ◽

Network Methods ◽

The Stability ◽

Spectral Normalization ◽

Adaptation Method

The transfer learning method is used to extend our existing model to more difficult scenarios, thereby accelerating the training process and improving learning performance. The conditional adversarial domain adaptation method proposed in 2018 is a particular type of transfer learning. It uses the domain discriminator to identify which images the extracted features belong to. The features are obtained from the feature extraction network. The stability of the domain discriminator directly affects the classification accuracy. Here, we propose a new algorithm to improve the predictive accuracy. First, we introduce the Lipschitz constraint condition into domain adaptation. If the constraint condition can be satisfied, the method will be stable. Second, we analyze how to make the gradient satisfy the condition, thereby deducing the modified gradient via the spectrum regularization method. The modified gradient is then used to update the parameter matrix. The proposed method is compared to the ResNet-50, deep adaptation network, domain adversarial neural network, joint adaptation network, and conditional domain adversarial network methods using the datasets that are found in Office-31, ImageCLEF-DA, and Office-Home. The simulations demonstrate that the proposed method has a better performance than other methods with respect to accuracy.

Download Full-text

Transfer Learning and Deep Domain Adaptation

Advances and Applications in Deep Learning ◽

10.5772/intechopen.94072 ◽

2020 ◽

Author(s):

Wen Xu ◽

Jing He ◽

Yanfeng Shu

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Transfer Learning ◽

Real World ◽

Deep Neural Networks ◽

Domain Adaptation ◽

Fine Tuning ◽

Real World Applications ◽

Comprehensive Survey ◽

Sample Reconstruction

Transfer learning is an emerging technique in machine learning, by which we can solve a new task with the knowledge obtained from an old task in order to address the lack of labeled data. In particular deep domain adaptation (a branch of transfer learning) gets the most attention in recently published articles. The intuition behind this is that deep neural networks usually have a large capacity to learn representation from one dataset and part of the information can be further used for a new task. In this research, we firstly present the complete scenarios of transfer learning according to the domains and tasks. Secondly, we conduct a comprehensive survey related to deep domain adaptation and categorize the recent advances into three types based on implementing approaches: fine-tuning networks, adversarial domain adaptation, and sample-reconstruction approaches. Thirdly, we discuss the details of these methods and introduce some typical real-world applications. Finally, we conclude our work and explore some potential issues to be further addressed.

Download Full-text

Aspect-augmented Adversarial Networks for Domain Adaptation

Transactions of the Association for Computational Linguistics ◽

10.1162/tacl_a_00077 ◽

2017 ◽

Vol 5 ◽

pp. 515-528 ◽

Cited By ~ 17

Author(s):

Yuan Zhang ◽

Regina Barzilay ◽

Tommi Jaakkola

Keyword(s):

Transfer Learning ◽

Domain Adaptation ◽

Experimental Results ◽

Dependent Manner ◽

Target Classification ◽

Adversarial Networks ◽

Adversarial Training ◽

Classification Tasks ◽

Class Labels

We introduce a neural method for transfer learning between two (source and target) classification tasks or aspects over the same domain. Rather than training on target labels, we use a few keywords pertaining to source and target aspects indicating sentence relevance instead of document class labels. Documents are encoded by learning to embed and softly select relevant sentences in an aspect-dependent manner. A shared classifier is trained on the source encoded documents and labels, and applied to target encoded documents. We ensure transfer through aspect-adversarial training so that encoded documents are, as sets, aspect-invariant. Experimental results demonstrate that our approach outperforms different baselines and model variants on two datasets, yielding an improvement of 27% on a pathology dataset and 5% on a review dataset.

Download Full-text

A MultiKernel Domain Adaptation Method for Unsupervised Transfer Learning on Cross-Source and Cross-Region Remote Sensing Data Classification

IEEE Transactions on Geoscience and Remote Sensing ◽

10.1109/tgrs.2019.2962039 ◽

2020 ◽

Vol 58 (6) ◽

pp. 4279-4289 ◽

Cited By ~ 2

Author(s):

Wei Liu ◽

Rongjun Qin

Keyword(s):

Remote Sensing ◽

Transfer Learning ◽

Domain Adaptation ◽

Remote Sensing Data ◽

Data Classification ◽

Sensing Data ◽

Adaptation Method

Download Full-text

Adversarial Training Based Multi-Source Unsupervised Domain Adaptation for Sentiment Analysis

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v34i05.6262 ◽

2020 ◽

Vol 34 (05) ◽

pp. 7618-7625

Author(s):

Yong Dai ◽

Jian Liu ◽

Xiancong Ren ◽

Zenglin Xu

Keyword(s):

Sentiment Analysis ◽

Domain Adaptation ◽

State Of The Art ◽

Weak Assumption ◽

Target Domain ◽

Smoothness Assumption ◽

Unsupervised Domain Adaptation ◽

Good Target ◽

Adversarial Training ◽

Learning Frameworks

Multi-source unsupervised domain adaptation (MS-UDA) for sentiment analysis (SA) aims to leverage useful information in multiple source domains to help do SA in an unlabeled target domain that has no supervised information. Existing algorithms of MS-UDA either only exploit the shared features, i.e., the domain-invariant information, or based on some weak assumption in NLP, e.g., smoothness assumption. To avoid these problems, we propose two transfer learning frameworks based on the multi-source domain adaptation methodology for SA by combining the source hypotheses to derive a good target hypothesis. The key feature of the first framework is a novel Weighting Scheme based Unsupervised Domain Adaptation framework ((WS-UDA), which combine the source classifiers to acquire pseudo labels for target instances directly. While the second framework is a Two-Stage Training based Unsupervised Domain Adaptation framework (2ST-UDA), which further exploits these pseudo labels to train a target private extractor. Importantly, the weights assigned to each source classifier are based on the relations between target instances and source domains, which measured by a discriminator through the adversarial training. Furthermore, through the same discriminator, we also fulfill the separation of shared features and private features.Experimental results on two SA datasets demonstrate the promising performance of our frameworks, which outperforms unsupervised state-of-the-art competitors.

Download Full-text

Triplet Loss Network for Unsupervised Domain Adaptation

Algorithms ◽

10.3390/a12050096 ◽

2019 ◽

Vol 12 (5) ◽

pp. 96 ◽

Cited By ~ 1

Author(s):

Imad Eddine Ibrahim Bekkouch ◽

Youssef Youssry ◽

Rustam Gafarov ◽

Adil Khan ◽

Asad Masood Khattak

Keyword(s):

Image Classification ◽

Domain Adaptation ◽

Fine Tuning ◽

Generative Adversarial Networks ◽

Target Domain ◽

Traffic Sign ◽

Linear Discriminant ◽

Unsupervised Domain Adaptation ◽

Sign Recognition ◽

Almost All

Domain adaptation is a sub-field of transfer learning that aims at bridging the dissimilarity gap between different domains by transferring and re-using the knowledge obtained in the source domain to the target domain. Many methods have been proposed to resolve this problem, using techniques such as generative adversarial networks (GAN), but the complexity of such methods makes it hard to use them in different problems, as fine-tuning such networks is usually a time-consuming task. In this paper, we propose a method for unsupervised domain adaptation that is both simple and effective. Our model (referred to as TripNet) harnesses the idea of a discriminator and Linear Discriminant Analysis (LDA) to push the encoder to generate domain-invariant features that are category-informative. At the same time, pseudo-labelling is used for the target data to train the classifier and to bring the same classes from both domains together. We evaluate TripNet against several existing, state-of-the-art methods on three image classification tasks: Digit classification (MNIST, SVHN, and USPC datasets), object recognition (Office31 dataset), and traffic sign recognition (GTSRB and Synthetic Signs datasets). Our experimental results demonstrate that (i) TripNet beats almost all existing methods (having a similar simple model like it) on all of these tasks; and (ii) for models that are significantly more complex (or hard to train) than TripNet, it even beats their performance in some cases. Hence, the results confirm the effectiveness of using TripNet for unsupervised domain adaptation in image classification.

Download Full-text

Unsupervised Domain Adaptation via Domain Adversarial Training for Speaker Recognition

2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ◽

10.1109/icassp.2018.8461423 ◽

2018 ◽

Cited By ~ 23

Author(s):

Qing Wang ◽

Wei Rao ◽

Sining Sun ◽

Leib Xie ◽

Eng Siong Chng ◽

...

Keyword(s):

Speaker Recognition ◽

Domain Adaptation ◽

Unsupervised Domain Adaptation ◽

Adversarial Training

Download Full-text

Disjoint Label Space Transfer Learning with Common Factorised Space

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33013288 ◽

2019 ◽

Vol 33 ◽

pp. 3288-3295 ◽

Cited By ~ 4

Author(s):

Xiaobin Chang ◽

Yongxin Yang ◽

Tao Xiang ◽

Timothy M. Hospedales

Keyword(s):

Transfer Learning ◽

Domain Adaptation ◽

Unified Approach ◽

Target Domain ◽

Single Model ◽

Unsupervised Domain Adaptation ◽

Common Representation ◽

Wide Range

In this paper, a unified approach is presented to transfer learning that addresses several source and target domain labelspace and annotation assumptions with a single model. It is particularly effective in handling a challenging case, where source and target label-spaces are disjoint, and outperforms alternatives in both unsupervised and semi-supervised settings. The key ingredient is a common representation termed Common Factorised Space. It is shared between source and target domains, and trained with an unsupervised factorisation loss and a graph-based loss. With a wide range of experiments, we demonstrate the flexibility, relevance and efficacy of our method, both in the challenging cases with disjoint label spaces, and in the more conventional cases such as unsupervised domain adaptation, where the source and target domains share the same label-sets.

Download Full-text