Enhancing Semantic Understanding with Self-Supervised Methods for Abstractive Dialogue Summarization

Adversarial training for few-shot text classification

Intelligenza Artificiale ◽

10.3233/ia-200051 ◽

2021 ◽

Vol 14 (2) ◽

pp. 201-214

Author(s):

Danilo Croce ◽

Giuseppe Castellucci ◽

Roberto Basili

Keyword(s):

Supervised Learning ◽

Language Processing ◽

Reproducing Kernel ◽

Generative Adversarial Networks ◽

Training Material ◽

Semantic Classification ◽

Universal Sentence ◽

Kernel Hilbert Spaces ◽

Supervised Methods ◽

Low Dimensional

In recent years, Deep Learning methods have become very popular in classification tasks for Natural Language Processing (NLP); this is mainly due to their ability to reach high performances by relying on very simple input representations, i.e., raw tokens. One of the drawbacks of deep architectures is the large amount of annotated data required for an effective training. Usually, in Machine Learning this problem is mitigated by the usage of semi-supervised methods or, more recently, by using Transfer Learning, in the context of deep architectures. One recent promising method to enable semi-supervised learning in deep architectures has been formalized within Semi-Supervised Generative Adversarial Networks (SS-GANs) in the context of Computer Vision. In this paper, we adopt the SS-GAN framework to enable semi-supervised learning in the context of NLP. We demonstrate how an SS-GAN can boost the performances of simple architectures when operating in expressive low-dimensional embeddings; these are derived by combining the unsupervised approximation of linguistic Reproducing Kernel Hilbert Spaces and the so-called Universal Sentence Encoders. We experimentally evaluate the proposed approach over a semantic classification task, i.e., Question Classification, by considering different sizes of training material and different numbers of target classes. By applying such adversarial schema to a simple Multi-Layer Perceptron, a classifier trained over a subset derived from 1% of the original training material achieves 92% of accuracy. Moreover, when considering a complex classification schema, e.g., involving 50 classes, the proposed method outperforms state-of-the-art alternatives such as BERT.

Download Full-text

Real-World Person Re-Identification via Super-Resolution and Semi-Supervised Methods

IEEE Access ◽

10.1109/access.2021.3063000 ◽

2021 ◽

Vol 9 ◽

pp. 35834-35845

Author(s):

Limin Xia ◽

Jiahui Zhu ◽

Zhimin Yu

Keyword(s):

Real World ◽

Super Resolution ◽

Supervised Methods

Download Full-text

A Survey on Contrastive Self-Supervised Learning

Technologies ◽

10.3390/technologies9010002 ◽

2020 ◽

Vol 9 (1) ◽

pp. 2

Author(s):

Ashish Jaiswal ◽

Ashwin Ramesh Babu ◽

Mohammad Zaki Zadeh ◽

Debapriya Banerjee ◽

Fillia Makedon

Keyword(s):

Computer Vision ◽

Supervised Learning ◽

Language Processing ◽

Large Scale ◽

Performance Comparison ◽

Extensive Review ◽

Future Directions ◽

Dominant Component ◽

Supervised Methods ◽

The Cost

Self-supervised learning has gained popularity because of its ability to avoid the cost of annotating large-scale datasets. It is capable of adopting self-defined pseudolabels as supervision and use the learned representations for several downstream tasks. Specifically, contrastive learning has recently become a dominant component in self-supervised learning for computer vision, natural language processing (NLP), and other domains. It aims at embedding augmented versions of the same sample close to each other while trying to push away embeddings from different samples. This paper provides an extensive review of self-supervised methods that follow the contrastive approach. The work explains commonly used pretext tasks in a contrastive learning setup, followed by different architectures that have been proposed so far. Next, we present a performance comparison of different methods for multiple downstream tasks such as image classification, object detection, and action recognition. Finally, we conclude with the limitations of the current methods and the need for further techniques and future directions to make meaningful progress.

Download Full-text

Semi-Supervised Methods to Identify Individual Crowns of Lowland Tropical Canopy Species Using Imaging Spectroscopy and LiDAR

Remote Sensing ◽

10.3390/rs4082457 ◽

2012 ◽

Vol 4 (8) ◽

pp. 2457-2476 ◽

Cited By ~ 33

Author(s):

Jean-Baptiste Féret ◽

Gregory P. Asner

Keyword(s):

Imaging Spectroscopy ◽

Supervised Methods ◽

Canopy Species

Download Full-text

Analysis of Neural Machine Translation KANGRI Language by Unsupervised and Semi Supervised Methods

IETE Journal of Research ◽

10.1080/03772063.2021.2016506 ◽

2022 ◽

pp. 1-11

Author(s):

Shweta Chauhan ◽

Shefali Saxena ◽

Philemon Daniel

Keyword(s):

Machine Translation ◽

Neural Machine Translation ◽

Supervised Methods

Download Full-text

Multiple Saliency and Channel Sensitivity Network for Aggregated Convolutional Feature

Proceedings of the AAAI Conference on Artificial Intelligence ◽

10.1609/aaai.v33i01.33019013 ◽

2019 ◽

Vol 33 ◽

pp. 9013-9020

Author(s):

Xuanlu Xiang ◽

Zhipeng Wang ◽

Zhicheng Zhao ◽

Fei Su

Keyword(s):

State Of The Art ◽

Image Representation ◽

Training Data ◽

Gram Matrix ◽

Redundant Information ◽

Deep Architecture ◽

Benchmark Datasets ◽

Supervised Methods ◽

Ranking Loss ◽

Effective Channel

In this paper, aiming at two key problems of instance-level image retrieval, i.e., the distinctiveness of image representation and the generalization ability of the model, we propose a novel deep architecture - Multiple Saliency and Channel Sensitivity Network(MSCNet). Specifically, to obtain distinctive global descriptors, an attention-based multiple saliency learning is first presented to highlight important details of the image, and then a simple but effective channel sensitivity module based on Gram matrix is designed to boost the channel discrimination and suppress redundant information. Additionally, in contrast to most existing feature aggregation methods, employing pre-trained deep networks, MSCNet can be trained in two modes: the first one is an unsupervised manner with an instance loss, and another is a supervised manner, which combines classification and ranking loss and only relies on very limited training data. Experimental results on several public benchmark datasets, i.e., Oxford buildings, Paris buildings and Holidays, indicate that the proposed MSCNet outperforms the state-of-the-art unsupervised and supervised methods.

Download Full-text

A study on cross-language text summarization using supervised methods

2009 International Conference on Natural Language Processing and Knowledge Engineering ◽

10.1109/nlpke.2009.5313809 ◽

2009 ◽

Cited By ~ 4

Author(s):

Lei Yu ◽

Fuji Ren

Keyword(s):

Text Summarization ◽

Supervised Methods ◽

Cross Language ◽

Language Text

Download Full-text

Social network extraction based on Web: A Review about Supervised Methods

Journal of Physics Conference Series ◽

10.1088/1742-6596/1898/1/012046 ◽

2021 ◽

Vol 1898 (1) ◽

pp. 012046

Author(s):

Mahyuddin K. M. Nasution ◽

Shahrul Azman Noah

Keyword(s):

Social Network ◽

Supervised Methods

Download Full-text

3DLEB-Net: Label-Efficient Deep Learning-Based Semantic Segmentation of Building Point Clouds at LoD3 Level

Applied Sciences ◽

10.3390/app11198996 ◽

2021 ◽

Vol 11 (19) ◽

pp. 8996

Author(s):

Yuwei Cao ◽

Marco Scaioni

Keyword(s):

Deep Learning ◽

Semantic Segmentation ◽

Point Clouds ◽

Training Data ◽

Second Step ◽

Point Cloud Data ◽

Dynamic Graph ◽

Cloud Data ◽

Supervised Methods ◽

Global And Local

In current research, fully supervised Deep Learning (DL) techniques are employed to train a segmentation network to be applied to point clouds of buildings. However, training such networks requires large amounts of fine-labeled buildings’ point-cloud data, presenting a major challenge in practice because they are difficult to obtain. Consequently, the application of fully supervised DL for semantic segmentation of buildings’ point clouds at LoD3 level is severely limited. In order to reduce the number of required annotated labels, we proposed a novel label-efficient DL network that obtains per-point semantic labels of LoD3 buildings’ point clouds with limited supervision, named 3DLEB-Net. In general, it consists of two steps. The first step (Autoencoder, AE) is composed of a Dynamic Graph Convolutional Neural Network (DGCNN) encoder and a folding-based decoder. It is designed to extract discriminative global and local features from input point clouds by faithfully reconstructing them without any label. The second step is the semantic segmentation network. By supplying a small amount of task-specific supervision, a segmentation network is proposed for semantically segmenting the encoded features acquired from the pre-trained AE. Experimentally, we evaluated our approach based on the Architectural Cultural Heritage (ArCH) dataset. Compared to the fully supervised DL methods, we found that our model achieved state-of-the-art results on the unseen scenes, with only 10% of labeled training data from fully supervised methods as input. Moreover, we conducted a series of ablation studies to show the effectiveness of the design choices of our model.

Download Full-text

Zero-Shot Feature Selection via Transferring Supervised Knowledge

International Journal of Data Warehousing and Mining ◽

10.4018/ijdwm.2021040101 ◽

2021 ◽

Vol 17 (2) ◽

pp. 1-20

Author(s):

Zheng Wang ◽

Qiao Wang ◽

Tingzhang Zhao ◽

Chaokun Wang ◽

Xiaojun Ye

Keyword(s):

Machine Learning ◽

Feature Selection ◽

Dimensionality Reduction ◽

Real World ◽

Rapid Growth ◽

Learning Systems ◽

Training Data ◽

Effective Technique ◽

Supervised Methods ◽

Real World Datasets

Feature selection, an effective technique for dimensionality reduction, plays an important role in many machine learning systems. Supervised knowledge can significantly improve the performance. However, faced with the rapid growth of newly emerging concepts, existing supervised methods might easily suffer from the scarcity and validity of labeled data for training. In this paper, the authors study the problem of zero-shot feature selection (i.e., building a feature selection model that generalizes well to “unseen” concepts with limited training data of “seen” concepts). Specifically, they adopt class-semantic descriptions (i.e., attributes) as supervision for feature selection, so as to utilize the supervised knowledge transferred from the seen concepts. For more reliable discriminative features, they further propose the center-characteristic loss which encourages the selected features to capture the central characteristics of seen concepts. Extensive experiments conducted on various real-world datasets demonstrate the effectiveness of the method.

Download Full-text