A semi-supervised learning detection method for vision-based monitoring of construction sites by integrating teacher-student networks and data augmentation

2021 ◽

Author(s):

Arjit Jain ◽

Pranay Reddy Samala ◽

Preethi Jyothi ◽

Deepak Mittal ◽

Maneesh Singh

Keyword(s):

Supervised Learning ◽

Data Augmentation ◽

State Of The Art ◽

Image Captioning ◽

Original Algorithm ◽

Teacher Student ◽

Depth Analysis ◽

Classification Tasks ◽

Multi Class Classification ◽

Structured Outputs

Recent semi-supervised learning (SSL) methods are predominantly focused on multi-class classification tasks. Classification tasks allow for easy mixing of class labels during augmentation which does not trivially extend to structured outputs such as word sequences that appear in tasks like image captioning. Noisy Student Training is a recent SSL paradigm proposed for image classification that is an extension of self-training and teacher-student learning. In this work, we provide an in-depth analysis of the noisy student SSL framework for the task of image captioning and derive state-of-the-art results. The original algorithm relies on computationally expensive data augmentation steps that involve perturbing the raw images and computing features for each perturbed image. We show that, even in the absence of raw image augmentation, the use of simple model and feature perturbations to the input images for the student model are beneficial to SSL training. We also show how a paraphrase generator could be effectively used for label augmentation to improve the quality of pseudo labels and significantly improve performance. Our final results in the limited labeled data setting (1% of the MS-COCO labeled data) outperform previous state-of-the-art approaches by 2.5 on BLEU4 and 11.5 on CIDEr scores.

Download Full-text

ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning

2021 IEEE Winter Conference on Applications of Computer Vision (WACV) ◽

10.1109/wacv48630.2021.00141 ◽

2021 ◽

Author(s):

Viktor Olsson ◽

Wilhelm Tranheden ◽

Juliano Pinto ◽

Lennart Svensson

Keyword(s):

Supervised Learning ◽

Data Augmentation

Download Full-text

Data augmentation and semi-supervised learning for deep neural networks-based text classifier

Proceedings of the 35th Annual ACM Symposium on Applied Computing ◽

10.1145/3341105.3373992 ◽

2020 ◽

Author(s):

Heereen Shim ◽

Stijn Luca ◽

Dietwig Lowet ◽

Bart Vanrumste

Keyword(s):

Neural Networks ◽

Supervised Learning ◽

Deep Neural Networks ◽

Data Augmentation

Download Full-text

Consistency regularization teacher–student semi-supervised learning method for target recognition in SAR images

The Visual Computer ◽

10.1007/s00371-021-02287-z ◽

2021 ◽

Author(s):

Ye Tian ◽

Liguo Zhang ◽

Jianguo Sun ◽

Guisheng Yin ◽

Yuxin Dong

Keyword(s):

Supervised Learning ◽

Target Recognition ◽

Learning Method ◽

Sar Images ◽

Teacher Student

Download Full-text

Simplifying the Supervised Learning of Kerr Nonlinearity Compensation Algorithms by Data Augmentation

2020 European Conference on Optical Communications (ECOC) ◽

10.1109/ecoc48923.2020.9333417 ◽

2020 ◽

Author(s):

Vladislav Neskorniuk ◽

Pedro J. Freire ◽

Antonio Napoli ◽

Bernhard Spinnler ◽

Wolfgang Schairer ◽

...

Keyword(s):

Supervised Learning ◽

Data Augmentation ◽

Kerr Nonlinearity ◽

Nonlinearity Compensation

Download Full-text

A review: preprocessing techniques and data augmentation for sentiment analysis

Computational Social Networks ◽

10.1186/s40649-020-00080-x ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Huu-Thanh Duong ◽

Tram-Anh Nguyen-Thi

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Supervised Learning ◽

Data Augmentation ◽

Original Data ◽

Training Data ◽

Unseen Data ◽

Augmentation Techniques ◽

User Intervention

AbstractIn literature, the machine learning-based studies of sentiment analysis are usually supervised learning which must have pre-labeled datasets to be large enough in certain domains. Obviously, this task is tedious, expensive and time-consuming to build, and hard to handle unseen data. This paper has approached semi-supervised learning for Vietnamese sentiment analysis which has limited datasets. We have summarized many preprocessing techniques which were performed to clean and normalize data, negation handling, intensification handling to improve the performances. Moreover, data augmentation techniques, which generate new data from the original data to enrich training data without user intervention, have also been presented. In experiments, we have performed various aspects and obtained competitive results which may motivate the next propositions.

Download Full-text

Impact of data augmentation on supervised learning for a moving mid-frequency source

The Journal of the Acoustical Society of America ◽

10.1121/10.0007284 ◽

2021 ◽

Vol 150 (5) ◽

pp. 3914-3928

Author(s):

J. A. Castro-Correa ◽

M. Badiey ◽

T. B. Neilsen ◽

D. P. Knobles ◽

W. S. Hodgkiss

Keyword(s):

Supervised Learning ◽

Data Augmentation ◽

Frequency Source

Download Full-text

Semi supervised inspection algorithm of automatic packaging curve based on deep learning

Journal of Computational Methods in Sciences and Engineering ◽

10.3233/jcm-215690 ◽

2021 ◽

pp. 1-10

Author(s):

Yong He

Keyword(s):

Deep Learning ◽

Supervised Learning ◽

Optimization Algorithm ◽

Posterior Probability ◽

Detection Method ◽

Detection System ◽

Experimental Results ◽

Detection Accuracy ◽

Data Set ◽

Packaging Process

The current automatic packaging process is complex, requires high professional knowledge, poor universality, and difficult to apply in multi-objective and complex background. In view of this problem, automatic packaging optimization algorithm has been widely paid attention to. However, the traditional automatic packaging detection accuracy is low, the practicability is poor. Therefore, a semi-supervised detection method of automatic packaging curve based on deep learning and semi-supervised learning is proposed. Deep learning is used to extract features and posterior probability to classify unlabeled data. KDD CUP99 data set was used to verify the accuracy of the algorithm. Experimental results show that this method can effectively improve the performance of automatic packaging curve semi-supervised detection system.

Download Full-text

Teacher/Student Deep Semi-Supervised Learning for Training with Noisy Labels

2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA) ◽

10.1109/icmla.2018.00147 ◽

2018 ◽

Author(s):

Zeyad Hailat ◽

Xue-Wen Chen

Keyword(s):

Supervised Learning ◽

Teacher Student ◽

Noisy Labels

Download Full-text

Self-Supervised Contextual Data Augmentation for Natural Language Processing

Symmetry ◽

10.3390/sym11111393 ◽

2019 ◽

Vol 11 (11) ◽

pp. 1393

Author(s):

Dongju Park ◽

Chang Wook Ahn

Keyword(s):

Supervised Learning ◽

Language Processing ◽

Recurrent Neural Networks ◽

Question Answering ◽

Data Augmentation ◽

Language Model ◽

Contextual Data ◽

External Data ◽

Label Information ◽

Benchmark Datasets

In this paper, we propose a novel data augmentation method with respect to the target context of the data via self-supervised learning. Instead of looking for the exact synonyms of masked words, the proposed method finds words that can replace the original words considering the context. For self-supervised learning, we can employ the masked language model (MLM), which masks a specific word within a sentence and obtains the original word. The MLM learns the context of a sentence through asymmetrical inputs and outputs. However, without using the existing MLM, we propose a label-masked language model (LMLM) that can include label information for the mask tokens used in the MLM to effectively use the MLM in data with label information. The augmentation method performs self-supervised learning using LMLM and then implements data augmentation through the trained model. We demonstrate that our proposed method improves the classification accuracy of recurrent neural networks and convolutional neural network-based classifiers through several experiments for text classification benchmark datasets, including the Stanford Sentiment Treebank-5 (SST5), the Stanford Sentiment Treebank-2 (SST2), the subjectivity (Subj), the Multi-Perspective Question Answering (MPQA), the Movie Reviews (MR), and the Text Retrieval Conference (TREC) datasets. In addition, since the proposed method does not use external data, it can eliminate the time spent collecting external data, or pre-training using external data.

Download Full-text

A semi-supervised learning detection method for vision-based monitoring of construction sites by integrating teacher-student networks and data augmentation

Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image Captioning

ClassMix: Segmentation-Based Data Augmentation for Semi-Supervised Learning

Data augmentation and semi-supervised learning for deep neural networks-based text classifier

Consistency regularization teacher–student semi-supervised learning method for target recognition in SAR images

Simplifying the Supervised Learning of Kerr Nonlinearity Compensation Algorithms by Data Augmentation

A review: preprocessing techniques and data augmentation for sentiment analysis

Impact of data augmentation on supervised learning for a moving mid-frequency source

Semi supervised inspection algorithm of automatic packaging curve based on deep learning

Teacher/Student Deep Semi-Supervised Learning for Training with Noisy Labels

Self-Supervised Contextual Data Augmentation for Natural Language Processing

Export Citation Format