Bee Swarm Activity Acoustic Classification for an IoT-Based Farm Service

Beekeeping is one of the widespread and traditional fields in agriculture, where Internet of Things (IoT)-based solutions and machine learning approaches can ease and improve beehive management significantly. A particularly important activity is bee swarming. A beehive monitoring system can be applied for digital farming to alert the user via a service about the beginning of swarming, which requires a response. An IoT-based bee activity acoustic classification system is proposed in this paper. The audio data needed for acoustic training was collected from the Open Source Beehives Project. The input audio signal was converted into feature vectors, using the Mel-Frequency Cepstral Coefficients (with cepstral mean normalization) and Linear Predictive Coding. The influence of the acoustic background noise and denoising procedure was evaluated in an additional step. Different Hidden Markov Models’ and Gaussian Mixture Models’ topologies were developed for acoustic modeling, with the objective being to determine the most suitable one for the proposed IoT-based solution. The evaluation was carried out with a separate test set, in order to successfully classify sound between the normal and swarming conditions in a beehive. The evaluation results showed that good acoustic classification performance can be achieved with the proposed system.

Download Full-text

IoT-Based Bee Swarm Activity Acoustic Classification Using Deep Neural Networks

Sensors ◽

10.3390/s21030676 ◽

2021 ◽

Vol 21 (3) ◽

pp. 676

Author(s):

Andrej Zgank

Keyword(s):

Neural Networks ◽

Deep Neural Networks ◽

Markov Models ◽

Audio Signal ◽

Audio Signals ◽

Mel Frequency Cepstral Coefficients ◽

Animal Activity ◽

The Impact ◽

Acoustic Classification ◽

Swarm Activity

Animal activity acoustic monitoring is becoming one of the necessary tools in agriculture, including beekeeping. It can assist in the control of beehives in remote locations. It is possible to classify bee swarm activity from audio signals using such approaches. A deep neural networks IoT-based acoustic swarm classification is proposed in this paper. Audio recordings were obtained from the Open Source Beehive project. Mel-frequency cepstral coefficients features were extracted from the audio signal. The lossless WAV and lossy MP3 audio formats were compared for IoT-based solutions. An analysis was made of the impact of the deep neural network parameters on the classification results. The best overall classification accuracy with uncompressed audio was 94.09%, but MP3 compression degraded the DNN accuracy by over 10%. The evaluation of the proposed deep neural networks IoT-based bee activity acoustic classification showed improved results if compared to the previous hidden Markov models system.

Download Full-text

Kurdish Spoken Letter Recognition based on k-NN and SVM Model

Journal of Raparin University ◽

10.26750/vol(7).no(4).paper1 ◽

2020 ◽

Vol 7 (4) ◽

pp. 1-12

Author(s):

Zrar Khalid Abdul

Keyword(s):

Predictive Coding ◽

Recognition System ◽

Letter Recognition ◽

Learning Approaches ◽

Linear Predictive Coding ◽

Mel Frequency Cepstral Coefficients ◽

Classification Rate ◽

Relative Information ◽

Svm Model ◽

Challenging Tasks

Automatic recognition of spoken letters is one of the most challenging tasks in the area of speech recognition system. In this paper, different machine learning approaches are used to classify the Kurdish alphabets such as SVM and k-NN where both approaches are fed by two different features, Linear Predictive Coding (LPC) and Mel Frequency Cepstral Coefficients (MFCCs). Moreover, the features are combined together to learn the classifiers. The experiments are evaluated on the dataset that are collected by the authors as there as not standard Kurdish dataset. The dataset consists of 2720 samples as a total. The results show that the MFCC features outperforms the LPC features as the MFCCs have more relative information of vocal track. Furthermore, fusion of the features (MFCC and LPC) is not capable to improve the classification rate significantly.

Download Full-text

Speaker Recognition Systems in the Last Decade – A Survey

Engineering and Technology Journal ◽

10.30684/etj.v39i1b.1589 ◽

2021 ◽

Vol 39 (1B) ◽

pp. 30-40

Author(s):

Ahmed M. Ahmed ◽

Aliaa K. Hassan

Keyword(s):

Feature Extraction ◽

Speaker Recognition ◽

Clustering Algorithms ◽

Predictive Coding ◽

Gaussian Mixture ◽

Linear Predictive Coding ◽

Mel Frequency Cepstral Coefficients ◽

Voice Signal ◽

Automatic Speaker Recognition ◽

Authentication System

Speaker Recognition Defined by the process of recognizing a person by his\her voice through specific features that extract from his\her voice signal. An Automatic Speaker recognition (ASP) is a biometric authentication system. In the last decade, many advances in the speaker recognition field have been attained, along with many techniques in feature extraction and modeling phases. In this paper, we present an overview of the most recent works in ASP technology. The study makes an effort to discuss several modeling ASP techniques like Gaussian Mixture Model GMM, Vector Quantization (VQ), and Clustering Algorithms. Also, several feature extraction techniques like Linear Predictive Coding (LPC) and Mel frequency cepstral coefficients (MFCC) are examined. Finally, as a result of this study, we found MFCC and GMM methods could be considered as the most successful techniques in the field of speaker recognition so far.

Download Full-text

Condition Monitoring Using Computational Intelligence

Handbook of Computational Intelligence in Manufacturing and Production Management ◽

10.4018/978-1-59904-582-5.ch006 ◽

2011 ◽

pp. 106-123 ◽

Cited By ~ 1

Author(s):

Tshilidzi Marwala ◽

Christina Busisiwe Vilakazi

Keyword(s):

Feature Extraction ◽

Condition Monitoring ◽

Markov Models ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Extraction Methods ◽

Support Vector ◽

Mel Frequency Cepstral Coefficients ◽

Classification Feature ◽

Monitoring Process

Condition monitoring techniques are described in this chapter. Two aspects of condition monitoring process are considered: (1) feature extraction; and (2) condition classification. Feature extraction methods described and implemented are fractals, kurtosis, and Mel-frequency cepstral coefficients. Classification methods described and implemented are support vector machines (SVM), hidden Markov models (HMM), Gaussian mixture models (GMM), and extension neural networks (ENN). The effectiveness of these features was tested using SVM, HMM, GMM, and ENN on condition monitoring of bearings and are found to give good results.

Download Full-text

Noise Robust Speech Recognition Using Deep Belief Networks

International Journal of Computational Intelligence and Applications ◽

10.1142/s146902681650005x ◽

2016 ◽

Vol 15 (01) ◽

pp. 1650005 ◽

Cited By ~ 1

Author(s):

Mahboubeh Farahat ◽

Ramin Halavati

Keyword(s):

Speech Recognition ◽

Word Recognition ◽

Markov Models ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Belief Networks ◽

Deep Belief Networks ◽

Mel Frequency Cepstral Coefficients ◽

Recognition Systems ◽

Low Dimensional

Most current speech recognition systems use Hidden Markov Models (HMMs) to deal with the temporal variability of speech and Gaussian mixture models (GMMs) to determine how well each state of each HMM fits a frame or a short window of frames of coefficients that represents the acoustic input. In these systems acoustic inputs are represented by Mel Frequency Cepstral Coefficients temporal spectrogram known as frames. But MFCC is not robust to noise. Consequently, with different train and test conditions the accuracy of speech recognition systems decreases. On the other hand, using MFCCs of larger window of frames in GMMs needs more computational power. In this paper, Deep Belief Networks (DBNs) are used to extract discriminative information from larger window of frames. Nonlinear transformations lead to high-order and low-dimensional features which are robust to variation of input speech. Multiple speaker isolated word recognition tasks with 100 and 200 words in clean and noisy environments has been used to test this method. The experimental results indicate that this new method of feature encoding result in much better word recognition accuracy.

Download Full-text

Acoustic classification using linear predictive coding for wildlife detection systems

2017 International Symposium on Signals, Circuits and Systems (ISSCS) ◽

10.1109/isscs.2017.8034944 ◽

2017 ◽

Cited By ~ 2

Author(s):

Lacrimioara Grama ◽

Elena Roxana Buhus ◽

Corneliu Rusu

Keyword(s):

Predictive Coding ◽

Linear Predictive Coding ◽

Detection Systems ◽

Wildlife Detection ◽

Acoustic Classification

Download Full-text

Navigation Security Module with Real-Time Voice Command Recognition System

Polish Maritime Research ◽

10.1515/pomr-2017-0046 ◽

2017 ◽

Vol 24 (2) ◽

pp. 17-26

Author(s):

Mustafa Yagimli ◽

Huseyin Kursat Tezer

Keyword(s):

Real Time ◽

Situational Awareness ◽

Predictive Coding ◽

Recognition System ◽

Time Warping ◽

Linear Predictive Coding ◽

Mel Frequency Cepstral Coefficients ◽

Voice Command ◽

Security Module ◽

Dynamic Time

Abstract The real-time voice command recognition system used for this study, aims to increase the situational awareness, therefore the safety of navigation, related especially to the close manoeuvres of warships, and the courses of commercial vessels in narrow waters. The developed system, the safety of navigation that has become especially important in precision manoeuvres, has become controllable with voice command recognition-based software. The system was observed to work with 90.6% accuracy using Mel Frequency Cepstral Coefficients (MFCC) and Dynamic Time Warping (DTW) parameters and with 85.5% accuracy using Linear Predictive Coding (LPC) and DTW parameters.

Download Full-text

Audio signal classification using Linear Predictive Coding and Random Forests

2017 International Conference on Speech Technology and Human-Computer Dialogue (SpeD) ◽

10.1109/sped.2017.7990431 ◽

2017 ◽

Cited By ~ 2

Author(s):

Lacrimioara Grama ◽

Corneliu Rusu

Keyword(s):

Random Forests ◽

Predictive Coding ◽

Audio Signal ◽

Signal Classification ◽

Linear Predictive Coding ◽

Audio Signal Classification

Download Full-text

Faults Detection Using Gaussian Mixture Models, Mel-Frequency Cepstral Coefficients and Kurtosis

2006 IEEE International Conference on Systems, Man and Cybernetics ◽

10.1109/icsmc.2006.384397 ◽

2006 ◽

Cited By ~ 12

Author(s):

Fulufhelo V. Nelwamondo ◽

Tshilidzi Marwala

Keyword(s):

Mixture Models ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Mel Frequency Cepstral Coefficients ◽

Cepstral Coefficients

Download Full-text

MFCC AND CMN BASED SPEAKER RECOGNITION IN NOISY ENVIRONMENT

International Journal of Electronics Signals and Systems ◽

10.47893/ijess.2013.1137 ◽

2013 ◽

pp. 48-51

Author(s):

DEBASHISH DEV MISHRA ◽

UTPAL BHATTACHARJEE ◽

SHIKHAR KUMAR SARMA

Keyword(s):

Speaker Recognition ◽

Gaussian Mixture Models ◽

Gaussian Mixture ◽

Training Data ◽

Noisy Environment ◽

Noisy Environments ◽

Mel Frequency Cepstral Coefficients ◽

Automatic Speaker Recognition ◽

Cepstral Mean Normalization ◽

Testing Environments

The performance of automatic speaker recognition (ASR) system degrades drastically in the presence of noise and other distortions, especially when there is a noise level mismatch between the training and testing environments. This paper explores the problem of speaker recognition in noisy conditions, assuming that speech signals are corrupted by noise. A major problem of most speaker recognition systems is their unsatisfactory performance in noisy environments. In this experimental research, we have studied a combination of Mel Frequency Cepstral Coefficients (MFCC) for feature extraction and Cepstral Mean Normalization (CMN) techniques for speech enhancement. Our system uses a Gaussian Mixture Models (GMM) classifier and is implemented under MATLAB®7 programming environment. The process involves the use of speaker data for both training and testing. The data used for testing is matched up against a speaker model, which is trained with the training data using GMM modeling. Finally, experiments are carried out to test the new model for ASR given limited training data and with differing levels and types of realistic background noise. The results have demonstrated the robustness of the new system.

Download Full-text