Birdsong Phrase Verification and Classification Using Siamese Neural Networks

Bird vocalizations have been the focus of a wide variety of interdisciplinary studies in bioacoustics and neuroethology since they serve as models of motor control, learning and auditory perception. Yet, researchers have only begun to shed light on the structure and function of birdsong. Hypotheses abound, but still there is little agreement as how songs should be analyzed. One of the main challenges has been to classify acoustic units (syllables) from birdsong recordings, a task requiring robust classification algorithms capable of generalizing to unseen instances and dealing with data scarcity. Systematically detecting changes in syllable repertoires can help biologists to understand the origin and evolution of birdsong. The process of learning good features to discriminate among numerous and different sound classes is computationally expensive. Moreover, it might be impossible to achieve acceptable performance in cases where training data is scarce and classes are unbalanced. To address this issue, we propose a few-shot learning task in which an algorithm must make predictions given only a few instances of each class. We compared the performance of different Siamese Neural Networks at metric learning over the set of Cassini's Vireo syllables. Then, the network features were reused for the few-shot classification task. With this approach we overcame the limitations of data scarcity and class imbalance while achieving state-of-the-art performance.

Download Full-text

Deep metric learning for bioacoustic classification: Overcoming training data scarcity using dynamic triplet loss

The Journal of the Acoustical Society of America ◽

10.1121/1.5118245 ◽

2019 ◽

Vol 146 (1) ◽

pp. 534-547 ◽

Cited By ~ 5

Author(s):

Anshul Thakur ◽

Daksh Thapar ◽

Padmanabhan Rajan ◽

Aditya Nigam

Keyword(s):

Metric Learning ◽

Training Data ◽

Data Scarcity ◽

Triplet Loss ◽

Deep Metric Learning

Download Full-text

Towards operational phytoplankton recognition with automated high-throughput imaging and compact convolutional neural networks

10.5194/os-2020-62 ◽

2020 ◽

Author(s):

Tuomas Eerola ◽

Kaisa Kraft ◽

Osku Grönberg ◽

Lasse Lensu ◽

Sanna Suikkanen ◽

...

Keyword(s):

Neural Networks ◽

Big Data ◽

Convolutional Neural Networks ◽

Community Dynamics ◽

Data Augmentation ◽

Growth Traits ◽

Research Question ◽

Class Imbalance ◽

Recognition System ◽

Training Data

Abstract. Plankton communities form the basis of aquatic ecosystems and elucidating their role in increasingly important environmental issues is a constantly present research question. The concealed plankton community dynamics reflect changes in environmental forcing, growth traits of competing species, and multiple food web interactions. Recent technological advances have led to the possibility of collecting real-time big data opening new horizons for testing core hypotheses in planktonic systems, derived from macroscopic realms, in community ecology, biodiversity research, and ecosystem functioning. Analyzing the big data calls for computer vision and machine learning methods capable of producing interoperable data across platforms and systems. In this paper we apply convolutional neural networks (CNN) to classify a brackish-water phytoplankton community in the Baltic Sea. For solving the classification task, we utilize compact CNN architectures requiring less computational capacity and creating an opportunity to quickly train the network. This makes it possible to (1) test various modifications to the classification method, and (2) repeat each experiment multiple times with different training and test set combinations to obtain reliable results. We further analyze the effect of large class imbalance to the CNN performance, and test relevant data augmentation techniques to improve the performance. Finally, we address the practical implications of the classification performance to aquatic research by analyzing the confused classes and their effect on the reliability of the automatic plankton recognition system, to guide further development of plankton recognition research. Our results show that it is possible to obtain good classification accuracy with relatively shallow architectures and a small amount of training data when using effective data augmentation methods even with a very unbalanced dataset.

Download Full-text

Learning to Automatically Diagnose Multiple Diseases in Pediatric Chest Radiographs Using Deep Convolutional Neural Networks

10.1101/2021.08.12.21261954 ◽

2021 ◽

Author(s):

Thanh T. Tran ◽

Hieu H. Pham ◽

Thang V. Nguyen ◽

Tung T. Le ◽

Hieu T. Nguyen ◽

...

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Class Imbalance ◽

Learning Task ◽

Deep Convolutional Neural Networks ◽

Thoracic Diseases ◽

Experienced Radiologist ◽

Pediatric Chest ◽

High Level ◽

Sensitivity Specificity

Chest radiograph (CXR) interpretation is critical for the diagnosis of various thoracic diseases in pediatric patients. This task, however, is error-prone and requires a high level of understanding of radiologic expertise. Recently, deep convolutional neural networks (D-CNNs) have shown remarkable performance in interpreting CXR in adults. However, there is a lack of evidence indicating that D-CNNs can recognize accurately multiple lung pathologies from pediatric CXR scans. In particular, the development of diagnostic models for the detection of pediatric chest diseases faces significant challenges such as (i) lack of physician-annotated datasets and (ii) class imbalance problems. In this paper, we retrospectively collect a large dataset of 5,017 pediatric CXR scans, for which each is manually labeled by an experienced radiologist for the presence of 10 common pathologies. A D-CNN model is then trained on 3,550 annotated scans to classify multiple pediatric lung pathologies automatically. To address the high-class imbalance issue, we propose to modify and apply "Distribution-Balanced loss" for training D-CNNs which reshapes the standard Binary-Cross Entropy loss (BCE) to efficiently learn harder samples by down-weighting the loss assigned to the majority classes. On an independent test set of 777 studies, the proposed approach yields an area under the receiver operating characteristic (AUC) of 0.709 (95% CI, 0.690-0.729). The sensitivity, specificity, and F1-score at the cutoff value are 0.722 (0.694-0.750), 0.579 (0.563-0.595), and 0.389 (0.373-0.405), respectively. These results significantly outperform previous state-of-the-art methods on most of the target diseases. Moreover, our ablation studies validate the effectiveness of the proposed loss function compared to other standard losses, e.g., BCE and Focal Loss, for this learning task. Overall, we demonstrate the potential of D-CNNs in interpreting pediatric CXRs.

Download Full-text

Online Deep Learning: Learning Deep Neural Networks on the Fly

Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2018/369 ◽

2018 ◽

Cited By ~ 24

Author(s):

Doyen Sahoo ◽

Quang Pham ◽

Jing Lu ◽

Steven C. H. Hoi

Keyword(s):

Neural Networks ◽

Online Learning ◽

Deep Learning ◽

Deep Neural Networks ◽

Large Data ◽

Learning Task ◽

Large Data Sets ◽

Training Data ◽

Data Sets ◽

Online Setting

Deep Neural Networks (DNNs) are typically trained by backpropagation in a batch setting, requiring the entire training data to be made available prior to the learning task. This is not scalable for many real-world scenarios where new data arrives sequentially in a stream. We aim to address an open challenge of ``Online Deep Learning" (ODL) for learning DNNs on the fly in an online setting. Unlike traditional online learning that often optimizes some convex objective function with respect to a shallow model (e.g., a linear/kernel-based hypothesis), ODL is more challenging as the optimization objective is non-convex, and regular DNN with standard backpropagation does not work well in practice for online settings. We present a new ODL framework that attempts to tackle the challenges by learning DNN models which dynamically adapt depth from a sequence of training data in an online learning setting. Specifically, we propose a novel Hedge Backpropagation (HBP) method for online updating the parameters of DNN effectively, and validate the efficacy on large data sets (both stationary and concept drifting scenarios).

Download Full-text

Operational Determination of the Activated Sludge Process Using Neural Networks

Water Science & Technology ◽

10.2166/wst.1992.0762 ◽

1992 ◽

Vol 26 (9-11) ◽

pp. 2461-2464 ◽

Cited By ~ 2

Author(s):

R. D. Tyagi ◽

Y. G. Du

Keyword(s):

Neural Network ◽

Neural Networks ◽

Steady State ◽

Activated Sludge ◽

Feedforward Neural Network ◽

Training Data ◽

Activated Sludge Process

A steady-statemathematical model of an activated sludgeprocess with a secondary settler was developed. With a limited number of training data samples obtained from the simulation at steady state, a feedforward neural network was established which exhibits an excellent capability for the operational prediction and determination.

Download Full-text

Improving Semi-Supervised Learning for Audio Classification with FixMatch

Electronics ◽

10.3390/electronics10151807 ◽

2021 ◽

Vol 10 (15) ◽

pp. 1807

Author(s):

Sascha Grollmisch ◽

Estefanía Cano

Keyword(s):

Neural Networks ◽

Supervised Learning ◽

Transfer Learning ◽

Data Transfer ◽

State Of The Art ◽

Training Data ◽

Audio Classification ◽

Image Domain ◽

Full Dataset ◽

Audio Data

Including unlabeled data in the training process of neural networks using Semi-Supervised Learning (SSL) has shown impressive results in the image domain, where state-of-the-art results were obtained with only a fraction of the labeled data. The commonality between recent SSL methods is that they strongly rely on the augmentation of unannotated data. This is vastly unexplored for audio data. In this work, SSL using the state-of-the-art FixMatch approach is evaluated on three audio classification tasks, including music, industrial sounds, and acoustic scenes. The performance of FixMatch is compared to Convolutional Neural Networks (CNN) trained from scratch, Transfer Learning, and SSL using the Mean Teacher approach. Additionally, a simple yet effective approach for selecting suitable augmentation methods for FixMatch is introduced. FixMatch with the proposed modifications always outperformed Mean Teacher and the CNNs trained from scratch. For the industrial sounds and music datasets, the CNN baseline performance using the full dataset was reached with less than 5% of the initial training data, demonstrating the potential of recent SSL methods for audio data. Transfer Learning outperformed FixMatch only for the most challenging dataset from acoustic scene classification, showing that there is still room for improvement.

Download Full-text

Experiments with Neural Networks in the Identification and Control of a Magnetic Levitation System Using a Low-Cost Platform

Applied Sciences ◽

10.3390/app11062535 ◽

2021 ◽

Vol 11 (6) ◽

pp. 2535

Author(s):

Bruno E. Silva ◽

Ramiro S. Barbosa

Keyword(s):

Neural Networks ◽

Neural Control ◽

Magnetic Levitation ◽

Low Cost ◽

Training Data ◽

Neural Controllers ◽

Levitation System ◽

Internal Model Controller ◽

Magnetic Levitation System ◽

And Performance

In this article, we designed and implemented neural controllers to control a nonlinear and unstable magnetic levitation system composed of an electromagnet and a magnetic disk. The objective was to evaluate the implementation and performance of neural control algorithms in a low-cost hardware. In a first phase, we designed two classical controllers with the objective to provide the training data for the neural controllers. After, we identified several neural models of the levitation system using Nonlinear AutoRegressive eXogenous (NARX)-type neural networks that were used to emulate the forward dynamics of the system. Finally, we designed and implemented three neural control structures: the inverse controller, the internal model controller, and the model reference controller for the control of the levitation system. The neural controllers were tested on a low-cost Arduino control platform through MATLAB/Simulink. The experimental results proved the good performance of the neural controllers.

Download Full-text

Autonomous flight cycles and extreme landings of airliners beyond the current limits and capabilities using artificial neural networks

Applied Intelligence ◽

10.1007/s10489-021-02202-y ◽

2021 ◽

Author(s):

Haitham Baomar ◽

Peter J. Bentley

Keyword(s):

Neural Networks ◽

Artificial Neural Networks ◽

Weather Conditions ◽

Extreme Weather ◽

Training Data ◽

Aviation Industry ◽

Control Models ◽

Novel Approach ◽

Network Modules ◽

Artificial Neural

AbstractWe describe the Intelligent Autopilot System (IAS), a fully autonomous autopilot capable of piloting large jets such as airliners by learning from experienced human pilots using Artificial Neural Networks. The IAS is capable of autonomously executing the required piloting tasks and handling the different flight phases to fly an aircraft from one airport to another including takeoff, climb, cruise, navigate, descent, approach, and land in simulation. In addition, the IAS is capable of autonomously landing large jets in the presence of extreme weather conditions including severe crosswind, gust, wind shear, and turbulence. The IAS is a potential solution to the limitations and robustness problems of modern autopilots such as the inability to execute complete flights, the inability to handle extreme weather conditions especially during approach and landing where the aircraft’s speed is relatively low, and the uncertainty factor is high, and the pilots shortage problem compared to the increasing aircraft demand. In this paper, we present the work done by collaborating with the aviation industry to provide training data for the IAS to learn from. The training data is used by Artificial Neural Networks to generate control models automatically. The control models imitate the skills of the human pilot when executing all the piloting tasks required to pilot an aircraft between two airports. In addition, we introduce new ANNs trained to control the aircraft’s elevators, elevators’ trim, throttle, flaps, and new ailerons and rudder ANNs to counter the effects of extreme weather conditions and land safely. Experiments show that small datasets containing single demonstrations are sufficient to train the IAS and achieve excellent performance by using clearly separable and traceable neural network modules which eliminate the black-box problem of large Artificial Intelligence methods such as Deep Learning. In addition, experiments show that the IAS can handle landing in extreme weather conditions beyond the capabilities of modern autopilots and even experienced human pilots. The proposed IAS is a novel approach towards achieving full control autonomy of large jets using ANN models that match the skills and abilities of experienced human pilots and beyond.

Download Full-text

A systematic study of the class imbalance problem: Automatically identifying empty camera trap images using convolutional neural networks

Ecological Informatics ◽

10.1016/j.ecoinf.2021.101350 ◽

2021 ◽

pp. 101350

Author(s):

Deng-Qi Yang ◽

Tao Li ◽

Meng-Tao Liu ◽

Xiao-Wei Li ◽

Ben-Hui Chen

Keyword(s):

Neural Networks ◽

Convolutional Neural Networks ◽

Systematic Study ◽

Class Imbalance ◽

Camera Trap ◽

Class Imbalance Problem ◽

Imbalance Problem

Download Full-text

Localization of Scattering Objects Using Neural Networks

Sensors ◽

10.3390/s21010011 ◽

2020 ◽

Vol 21 (1) ◽

pp. 11

Author(s):

Domonkos Haffner ◽

Ferenc Izsák

Keyword(s):

Neural Networks ◽

Plane Waves ◽

Measurement Data ◽

Scattered Wave ◽

Training Data ◽

Data Set ◽

Main Compound ◽

Incident Plane ◽

The Neural Networks ◽

Simulation Package

The localization of multiple scattering objects is performed while using scattered waves. An up-to-date approach: neural networks are used to estimate the corresponding locations. In the scattering phenomenon under investigation, we assume known incident plane waves, fully reflecting balls with known diameters and measurement data of the scattered wave on one fixed segment. The training data are constructed while using the simulation package μ-diff in Matlab. The structure of the neural networks, which are widely used for similar purposes, is further developed. A complex locally connected layer is the main compound of the proposed setup. With this and an appropriate preprocessing of the training data set, the number of parameters can be kept at a relatively low level. As a result, using a relatively large training data set, the unknown locations of the objects can be estimated effectively.

Download Full-text