Automated identification of chicken distress vocalisations using deep learning models

The annual global production of chickens exceeds 25 billion birds, and they are often housed in very large groups, numbering thousands. Distress calling triggered by various sources of stress has been suggested as an "iceberg indicator" of chicken welfare. However, to date, the identification of distress calls largely relies on manual annotations, which is very labour-intensive and time-consuming. Thus, a novel light-VGG11 was developed to automatically identify chicken distress calls using recordings (3,363 distress calls and 1,973 natural barn sounds) collected on intensive chicken farms. The light-VGG11 was modified from VGG11 with a significantly smaller size in parameters (9.3 million vs 128 million) and 55.88% faster detection speed while displaying comparable performance, i.e., precision (94.58%), recall (94.89%), F1-score (94.73%), and accuracy (95.07%), therefore more useful for model deployment in practice. To further improve the light-VGG11's performance, we investigated the impacts of different data augmentation techniques (i.e., time masking, frequency masking, mixed spectrograms of the same class, and Gaussian noise) and found that they could improve distress calls detection by up to 1.52%. In terms of precision livestock farming, our research opens new opportunities for developing technologies used to monitor the output of distress calls in large, commercial chicken flocks.

Download Full-text

Data Augmentation for Deep Learning based Cattle Segmentation in Precision Livestock Farming

2020 IEEE 16th International Conference on Automation Science and Engineering (CASE) ◽

10.1109/case48305.2020.9216758 ◽

2020 ◽

Author(s):

Yongliang Qiao ◽

Daobilige Su ◽

He Kong ◽

Salah Sukkarieh ◽

Sabrina Lomax ◽

...

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Livestock Farming ◽

Precision Livestock Farming

Download Full-text

The Data-augmentation Techniques in Item Response Modeling: Current Approaches and New Developments

Advances in Psychological Science ◽

10.3724/sp.j.1042.2014.01036 ◽

2014 ◽

Vol 22 (6) ◽

pp. 1036

Author(s):

Wei TIAN ◽

Tao XIN ◽

Chunhua KANG

Keyword(s):

Item Response ◽

Data Augmentation ◽

New Developments ◽

Item Response Modeling ◽

Augmentation Techniques ◽

Response Modeling

Download Full-text

Automatic Labeled Dialogue Generation for Nursing Record Systems

Journal of Personalized Medicine ◽

10.3390/jpm10030062 ◽

2020 ◽

Vol 10 (3) ◽

pp. 62

Author(s):

Tittaya Mairittha ◽

Nattaya Mairittha ◽

Sozo Inoue

Keyword(s):

Data Augmentation ◽

Short Term Memory ◽

Generative Models ◽

Abstract Knowledge ◽

Augmentation Techniques ◽

Nursing Record ◽

Long Short Term Memory ◽

The Individual ◽

High Level ◽

Embedding Methods

The integration of digital voice assistants in nursing residences is becoming increasingly important to facilitate nursing productivity with documentation. A key idea behind this system is training natural language understanding (NLU) modules that enable the machine to classify the purpose of the user utterance (intent) and extract pieces of valuable information present in the utterance (entity). One of the main obstacles when creating robust NLU is the lack of sufficient labeled data, which generally relies on human labeling. This process is cost-intensive and time-consuming, particularly in the high-level nursing care domain, which requires abstract knowledge. In this paper, we propose an automatic dialogue labeling framework of NLU tasks, specifically for nursing record systems. First, we apply data augmentation techniques to create a collection of variant sample utterances. The individual evaluation result strongly shows a stratification rate, with regard to both fluency and accuracy in utterances. We also investigate the possibility of applying deep generative models for our augmented dataset. The preliminary character-based model based on long short-term memory (LSTM) obtains an accuracy of 90% and generates various reasonable texts with BLEU scores of 0.76. Secondly, we introduce an idea for intent and entity labeling by using feature embeddings and semantic similarity-based clustering. We also empirically evaluate different embedding methods for learning good representations that are most suitable to use with our data and clustering tasks. Experimental results show that fastText embeddings produce strong performances both for intent labeling and on entity labeling, which achieves an accuracy level of 0.79 and 0.78 f1-scores and 0.67 and 0.61 silhouette scores, respectively.

Download Full-text

Generative Adversarial Networks to Improve the Robustness of Visual Defect Segmentation by Semantic Networks in Manufacturing Components

Applied Sciences ◽

10.3390/app11146368 ◽

2021 ◽

Vol 11 (14) ◽

pp. 6368

Author(s):

Fátima A. Saiz ◽

Garazi Alfaro ◽

Iñigo Barandiaran ◽

Manuel Graña

Keyword(s):

Ad Hoc ◽

Data Augmentation ◽

Semantic Network ◽

Semantic Networks ◽

Stereo Image ◽

Generative Adversarial Networks ◽

Specific Class ◽

Adversarial Networks ◽

Augmentation Techniques ◽

Image Acquisition System

This paper describes the application of Semantic Networks for the detection of defects in images of metallic manufactured components in a situation where the number of available samples of defects is small, which is rather common in real practical environments. In order to overcome this shortage of data, the common approach is to use conventional data augmentation techniques. We resort to Generative Adversarial Networks (GANs) that have shown the capability to generate highly convincing samples of a specific class as a result of a game between a discriminator and a generator module. Here, we apply the GANs to generate samples of images of metallic manufactured components with specific defects, in order to improve training of Semantic Networks (specifically DeepLabV3+ and Pyramid Attention Network (PAN) networks) carrying out the defect detection and segmentation. Our process carries out the generation of defect images using the StyleGAN2 with the DiffAugment method, followed by a conventional data augmentation over the entire enriched dataset, achieving a large balanced dataset that allows robust training of the Semantic Network. We demonstrate the approach on a private dataset generated for an industrial client, where images are captured by an ad-hoc photometric-stereo image acquisition system, and a public dataset, the Northeastern University surface defect database (NEU). The proposed approach achieves an improvement of 7% and 6% in an intersection over union (IoU) measure of detection performance on each dataset over the conventional data augmentation.

Download Full-text

Data Augmentation Techniques on Arabic Data for Named Entity Recognition

Procedia Computer Science ◽

10.1016/j.procs.2021.05.092 ◽

2021 ◽

Vol 189 ◽

pp. 292-299

Author(s):

Caroline Sabty ◽

Islam Omar ◽

Fady Wasfalla ◽

Mohamed Islam ◽

Slim Abdennadher

Keyword(s):

Data Augmentation ◽

Named Entity Recognition ◽

Entity Recognition ◽

Named Entity ◽

Augmentation Techniques

Download Full-text

Neural Data Augmentation Techniques for Time Series Data and its Benefits

2020 19th IEEE International Conference on Machine Learning and Applications (ICMLA) ◽

10.1109/icmla51294.2020.00026 ◽

2020 ◽

Author(s):

Anindya Sarkar ◽

Anirudh Sunder Raj ◽

Raghu Sesha Iyengar

Keyword(s):

Time Series ◽

Data Augmentation ◽

Time Series Data ◽

Series Data ◽

Neural Data ◽

Augmentation Techniques

Download Full-text

A review: preprocessing techniques and data augmentation for sentiment analysis

Computational Social Networks ◽

10.1186/s40649-020-00080-x ◽

2021 ◽

Vol 8 (1) ◽

Author(s):

Huu-Thanh Duong ◽

Tram-Anh Nguyen-Thi

Keyword(s):

Machine Learning ◽

Sentiment Analysis ◽

Supervised Learning ◽

Data Augmentation ◽

Original Data ◽

Training Data ◽

Unseen Data ◽

Augmentation Techniques ◽

User Intervention

AbstractIn literature, the machine learning-based studies of sentiment analysis are usually supervised learning which must have pre-labeled datasets to be large enough in certain domains. Obviously, this task is tedious, expensive and time-consuming to build, and hard to handle unseen data. This paper has approached semi-supervised learning for Vietnamese sentiment analysis which has limited datasets. We have summarized many preprocessing techniques which were performed to clean and normalize data, negation handling, intensification handling to improve the performances. Moreover, data augmentation techniques, which generate new data from the original data to enrich training data without user intervention, have also been presented. In experiments, we have performed various aspects and obtained competitive results which may motivate the next propositions.

Download Full-text

A blueprint for developing and applying precision livestock farming tools: A key output of the EU-PLF project

Animal Frontiers ◽

10.2527/af.2017.0103 ◽

2017 ◽

Vol 7 (1) ◽

pp. 12-17 ◽

Cited By ~ 11

Author(s):

Marcella Guarino ◽

Tomas Norton ◽

Dries Berckmans ◽

Erik Vranken ◽

Daniel Berckmans

Keyword(s):

Livestock Farming ◽

Precision Livestock Farming ◽

The Eu

Download Full-text

Image Data Augmentation techniques for Deep Learning -A Mirror Review

10.1109/icrito51393.2021.9596262 ◽

2021 ◽

Author(s):

Dipen Saini ◽

Rahul Malik

Keyword(s):

Deep Learning ◽

Data Augmentation ◽

Image Data ◽

Augmentation Techniques

Download Full-text

GaitSense: Towards Ubiquitous Gait-Based Human Identification with Wi-Fi

ACM Transactions on Sensor Networks ◽

10.1145/3466638 ◽

2022 ◽

Vol 18 (1) ◽

pp. 1-24

Author(s):

Yi Zhang ◽

Yue Zheng ◽

Guidong Zhang ◽

Kun Qian ◽

Chen Qian ◽

...

Keyword(s):

Data Augmentation ◽

Gait Recognition ◽

Wearable Sensors ◽

Human Identification ◽

Training Data ◽

Identification Accuracy ◽

Identification System ◽

Gait Patterns ◽

Training Samples ◽

Augmentation Techniques

Gait, the walking manner of a person, has been perceived as a physical and behavioral trait for human identification. Compared with cameras and wearable sensors, Wi-Fi-based gait recognition is more attractive because Wi-Fi infrastructure is almost available everywhere and is able to sense passively without the requirement of on-body devices. However, existing Wi-Fi sensing approaches impose strong assumptions of fixed user walking trajectories, sufficient training data, and identification of already known users. In this article, we present GaitSense , a Wi-Fi-based human identification system, to overcome the above unrealistic assumptions. To deal with various walking trajectories and speeds, GaitSense first extracts target specific features that best characterize gait patterns and applies novel normalization algorithms to eliminate gait irrelevant perturbation in signals. On this basis, GaitSense reduces the training efforts in new deployment scenarios by transfer learning and data augmentation techniques. GaitSense also enables a distinct feature of illegal user identification by anomaly detection, making the system readily available for real-world deployment. Our implementation and evaluation with commodity Wi-Fi devices demonstrate a consistent identification accuracy across various deployment scenarios with little training samples, pushing the limit of gait recognition with Wi-Fi signals.

Download Full-text