Revisiting Superpixels for Active Learning in Semantic Segmentation with Realistic Annotation Costs

Abstract. Automated semantic interpretation of 3D point clouds is crucial for many tasks in the domain of geospatial data analysis. For this purpose, labeled training data is required, which has often to be provided manually by experts. One approach to minimize effort in terms of costs of human interaction is Active Learning (AL). The aim is to process only the subset of an unlabeled dataset that is particularly helpful with respect to class separation. Here a machine identifies informative instances which are then labeled by humans, thereby increasing the performance of the machine. In order to completely avoid involvement of an expert, this time-consuming annotation can be resolved via crowdsourcing. Therefore, we propose an approach combining AL with paid crowdsourcing. Although incorporating human interaction, our method can run fully automatically, so that only an unlabeled dataset and a fixed financial budget for the payment of the crowdworkers need to be provided. We conduct multiple iteration steps of the AL process on the ISPRS Vaihingen 3D Semantic Labeling benchmark dataset (V3D) and especially evaluate the performance of the crowd when labeling 3D points. We prove our concept by using labels derived from our crowd-based AL method for classifying the test dataset. The analysis outlines that by labeling only 0:4% of the training dataset by the crowd and spending less than 145 $, both our trained Random Forest and sparse 3D CNN classifier differ in Overall Accuracy by less than 3 percentage points compared to the same classifiers trained on the complete V3D training set.

Download Full-text

Introducing AIDE: a Software Suite for Annotating Images with Deep and Active Learning Assistance

10.5194/egusphere-egu21-12065 ◽

2021 ◽

Author(s):

Benjamin Kellenberger ◽

Devis Tuia ◽

Dan Morris

Keyword(s):

Deep Learning ◽

Active Learning ◽

Expert Knowledge ◽

Semantic Segmentation ◽

Third Party ◽

Learning Models ◽

Web Browser ◽

Web Based ◽

Model Training ◽

Bounding Boxes

Ecological research like wildlife censuses increasingly relies on data on the scale of Terabytes. For example, modern camera trap datasets contain millions of images that require prohibitive amounts of manual labour to be annotated with species, bounding boxes, and the like. Machine learning, especially deep learning [3], could greatly accelerate this task through automated predictions, but involves expansive coding and expert knowledge.In this abstract we present AIDE, the Annotation Interface for Data-driven Ecology [2]. In a first instance, AIDE is a web-based annotation suite for image labelling with support for concurrent access and scalability, up to the cloud. In a second instance, it tightly integrates deep learning models into the annotation process through active learning [7], where models learn from user-provided labels and in turn select the most relevant images for review from the large pool of unlabelled ones (Fig. 1). The result is a system where users only need to label what is required, which saves time and decreases errors due to fatigue.<img src="https://contentmanager.copernicus.org/fileStorageProxy.php?f=gnp.0402be60f60062057601161/sdaolpUECMynit/12UGE&app=m&a=0&c=131251398e575ac9974634bd0861fadc&ct=x&pn=gnp.elif&d=1" alt="">Fig. 1: AIDE offers concurrent web image labelling support and uses annotations and deep learning models in an active learning loop.AIDE includes a comprehensive set of built-in models, such as ResNet [1] for image classification, Faster R-CNN [5] and RetinaNet [4] for object detection, and U-Net [6] for semantic segmentation. All models can be customised and used without having to write a single line of code. Furthermore, AIDE accepts any third-party model with minimal implementation requirements. To complete the package, AIDE offers both user annotation and model prediction evaluation, access control, customisable model training, and more, all through the web browser.AIDE is fully open source and available under https://github.com/microsoft/aerial_wildlife_detection.&#160;References

Download Full-text

Combining Self-supervised Learning and Active Learning for Disfluency Detection

ACM Transactions on Asian and Low-Resource Language Information Processing ◽

10.1145/3487290 ◽

2022 ◽

Vol 21 (3) ◽

pp. 1-25

Author(s):

Shaolei Wang ◽

Zhongyuan Wang ◽

Wanxiang Che ◽

Sendong Zhao ◽

Ting Liu

Keyword(s):

Neural Network ◽

Active Learning ◽

Supervised Learning ◽

Large Scale ◽

Training Data ◽

Fine Tuning ◽

Training Dataset ◽

Performance Gap ◽

Annotation Costs ◽

Trained Neural Network

Spoken language is fundamentally different from the written language in that it contains frequent disfluencies or parts of an utterance that are corrected by the speaker. Disfluency detection (removing these disfluencies) is desirable to clean the input for use in downstream NLP tasks. Most existing approaches to disfluency detection heavily rely on human-annotated data, which is scarce and expensive to obtain in practice. To tackle the training data bottleneck, in this work, we investigate methods for combining self-supervised learning and active learning for disfluency detection. First, we construct large-scale pseudo training data by randomly adding or deleting words from unlabeled data and propose two self-supervised pre-training tasks: (i) a tagging task to detect the added noisy words and (ii) sentence classification to distinguish original sentences from grammatically incorrect sentences. We then combine these two tasks to jointly pre-train a neural network. The pre-trained neural network is then fine-tuned using human-annotated disfluency detection training data. The self-supervised learning method can capture task-special knowledge for disfluency detection and achieve better performance when fine-tuning on a small annotated dataset compared to other supervised methods. However, limited in that the pseudo training data are generated based on simple heuristics and cannot fully cover all the disfluency patterns, there is still a performance gap compared to the supervised models trained on the full training dataset. We further explore how to bridge the performance gap by integrating active learning during the fine-tuning process. Active learning strives to reduce annotation costs by choosing the most critical examples to label and can address the weakness of self-supervised learning with a small annotated dataset. We show that by combining self-supervised learning with active learning, our model is able to match state-of-the-art performance with just about 10% of the original training data on both the commonly used English Switchboard test set and a set of in-house annotated Chinese data.

Download Full-text

Batch Mode Active Learning with Nonlocal Self-Similarity Prior for Semantic Segmentation

2019 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2019.8852102 ◽

2019 ◽

Cited By ~ 1

Author(s):

Yao Tan ◽

Qinghua Hu ◽

Zhibin Du

Keyword(s):

Active Learning ◽

Semantic Segmentation ◽

Self Similarity ◽

Batch Mode

Download Full-text

MetaBox+: A New Region based Active Learning Method for Semantic Segmentation using Priority Maps

Proceedings of the 10th International Conference on Pattern Recognition Applications and Methods ◽

10.5220/0010227500510062 ◽

2021 ◽

Author(s):

Pascal Colling ◽

Lutz Roese-Koerner ◽

Hanno Gottschalk ◽

Matthias Rottmann

Keyword(s):

Active Learning ◽

Semantic Segmentation ◽

Learning Method ◽

Active Learning Method

Download Full-text

Batch Mode Active Learning for Semantic Segmentation Based on Multi-Clue Sample Selection

Proceedings of the 28th ACM International Conference on Information and Knowledge Management - CIKM '19 ◽

10.1145/3357384.3357968 ◽

2019 ◽

Author(s):

Yao Tan ◽

Liu Yang ◽

Qinghua Hu ◽

Zhibin Du

Keyword(s):

Active Learning ◽

Sample Selection ◽

Semantic Segmentation ◽

Batch Mode

Download Full-text

Remembering Both the Machine and the Crowd When Sampling Points: Active Learning for Semantic Segmentation of ALS Point Clouds

Pattern Recognition. ICPR International Workshops and Challenges - Lecture Notes in Computer Science ◽

10.1007/978-3-030-68787-8_37 ◽

2021 ◽

pp. 505-520

Author(s):

Michael Kölle ◽

Volker Walter ◽

Stefan Schmohl ◽

Uwe Soergel

Keyword(s):

Active Learning ◽

Semantic Segmentation ◽

Point Clouds ◽

Sampling Points

Download Full-text

Cost-Effective Active Learning with Informativeness and Representativeness for Semantic Segmentation

2019 12th International Symposium on Computational Intelligence and Design (ISCID) ◽

10.1109/iscid.2019.00016 ◽

2019 ◽

Author(s):

Zhibin Du ◽

Yao Tan ◽

Liu Yang

Keyword(s):

Active Learning ◽

Cost Effective ◽

Semantic Segmentation

Download Full-text

Combining Generative and Discriminative Models for Semantic Segmentation of CT Scans via Active Learning

Lecture Notes in Computer Science - Information Processing in Medical Imaging ◽

10.1007/978-3-642-22092-0_3 ◽

2011 ◽

pp. 25-36 ◽

Cited By ~ 25

Author(s):

Juan Eugenio Iglesias ◽

Ender Konukoglu ◽

Albert Montillo ◽

Zhuowen Tu ◽

Antonio Criminisi

Keyword(s):

Active Learning ◽

Semantic Segmentation ◽

Ct Scans ◽

Discriminative Models

Download Full-text

Cost-Effective Active Learning for Hierarchical Multi-Label Classification

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/411 ◽

2018 ◽

Cited By ~ 2

Author(s):

Yi-Fan Yan ◽

Sheng-Jun Huang

Keyword(s):

Active Learning ◽

Subset Selection ◽

Cost Effective ◽

Selection Method ◽

Learning Approach ◽

Potential Contribution ◽

Annotation Costs ◽

Model Training ◽

Coarse To Fine ◽

Different Levels

Active learning reduces the labeling cost by actively querying labels for the most valuable data. It is particularly important for multi-label learning, where the annotation cost is rather high because each instance may have multiple labels simultaneously. In many multi-label tasks, the labels are organized into hierarchies from coarse to fine. The labels at different levels of the hierarchy contribute differently to the model training, and also have diverse annotation costs. In this paper, we propose a multi-label active learning approach to exploit the label hierarchies for cost-effective queries. By incorporating the potential contribution of ancestor and descendant labels, a novel criterion is proposed to estimate the informativeness of each candidate query. Further, a subset selection method is introduced to perform active batch selection by balancing the informativeness and cost of each instance-label pair. Experimental results validate the effectiveness of both the proposed criterion and the selection method.

Download Full-text