A machine learning pipeline for classification of cetacean echolocation clicks in large underwater acoustic datasets

Machine learning algorithms, including recent advances in deep learning, are promising for tools for detection and classification of broadband high frequency signals in passive acoustic recordings. However, these methods are generally data-hungry and progress has been limited by challenges related to the lack of labeled datasets adequate for training and testing. Large quantities of known and as yet unidentified broadband signal types mingle in marine recordings, with variability introduced by acoustic propagation, source depths and orientations, and interacting signals. Manual classification of these datasets is unmanageable without an in-depth knowledge of the acoustic context of each recording location. A signal classification pipeline is presented which combines unsupervised and supervised learning phases with opportunities for expert oversight to label signals of interest. The method is illustrated with a case study using unsupervised clustering to identify five toothed whale echolocation click types and two anthropogenic signal categories. These categories are used to train a deep network to classify detected signals in either averaged time bins or as individual detections, in two independent datasets. Bin-level classification achieved higher overall precision (>99%) than click-level classification. However, click-level classification had the advantage of providing a label for every signal, and achieved higher overall recall, with overall precision from 92 to 94%. The results suggest that unsupervised learning is a viable solution for efficiently generating the large, representative training sets needed for applications of deep learning in passive acoustics.

Download Full-text

Researching the Research: Applying Machine Learning Techniques to Dissertation Classification

Journal of Computer Science Research ◽

10.30564/jcsr.v2i4.2230 ◽

2020 ◽

Vol 2 (4) ◽

Author(s):

Suzanna Schmeelk

Keyword(s):

Machine Learning ◽

Full Text ◽

Doctoral Program ◽

Machine Learning Algorithms ◽

Machine Learning Techniques ◽

Full Time ◽

Learning Techniques ◽

Manual Classification ◽

Machine Learning Tool

This research examines industry-based dissertation research in a doctoral computing program through the lens of machine learning algorithms to understand topics explored by senior and experienced full-time working professionals (EFWPs). Our research categorizes dissertation by both their abstracts and by their full-text using the Graplab Create library from Apple’s Turi. We also compare the dissertation categorizations using IBM’s Watson Discovery deep machine learning tool. Our research provides perspectives on the practicality of the manual classification of technical documents; and, it provides insights into the: (1) categories of academic work created by EFWPs in a Computing doctoral program, (2) viability of automated categorization versus human abstraction, and (3) differences in categorization algorithms.

Download Full-text

Deep learning models for classification of gases detected by sensor arrays of artificial nose

10.5753/eniac.2019.9339 ◽

2019 ◽

Author(s):

Ismael Araujo ◽

Juan Gamboa ◽

Adenilton Silva

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Sensor Arrays ◽

Machine Learning Algorithms ◽

Human Beings ◽

Learning Models ◽

Classification Problems ◽

Artificial Nose ◽

Learning Techniques

To recognize patterns that are usually imperceptible by human beings has been one of the main advantages of using machine learning algorithms The use of Deep Learning techniques has been promising to the classification problems, especially the ones related to image classification. The classification of gases detected by an artificial nose is one other area where Deep Learning techniques can be used to seek classification improvements. Succeeding in a classification task can result in many advantages to quality control, as well as to preventing accidents. In this work, it is presented some Deep Learning models specifically created to the task of gas classification.

Download Full-text

Federated Learning: A Distributed Shared Machine Learning Method

Complexity ◽

10.1155/2021/8261663 ◽

2021 ◽

Vol 2021 ◽

pp. 1-20

Author(s):

Kai Hu ◽

Yaogen Li ◽

Min Xia ◽

Jiasheng Wu ◽

Meixia Lu ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Development Process ◽

Machine Learning Algorithms ◽

Process Definition ◽

Future Developments ◽

Private Data ◽

Multiple Clients ◽

Distributed Machine Learning

Federated learning (FL) is a distributed machine learning (ML) framework. In FL, multiple clients collaborate to solve traditional distributed ML problems under the coordination of the central server without sharing their local private data with others. This paper mainly sorts out FLs based on machine learning and deep learning. First of all, this paper introduces the development process, definition, architecture, and classification of FL and explains the concept of FL by comparing it with traditional distributed learning. Then, it describes typical problems of FL that need to be solved. On the basis of classical FL algorithms, several federated machine learning algorithms are briefly introduced, with emphasis on deep learning and classification and comparisons of those algorithms are carried out. Finally, this paper discusses possible future developments of FL based on deep learning.

Download Full-text

Research on Classification Method of Maize Seed Defect Based on Machine Vision

Journal of Sensors ◽

10.1155/2019/2716975 ◽

2019 ◽

Vol 2019 ◽

pp. 1-9

Author(s):

Sheng Huang ◽

Xiaofei Fan ◽

Lei Sun ◽

Yanlu Shen ◽

Xuesong Suo

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithm ◽

Machine Learning Algorithms ◽

Heat Map ◽

Deep Learning Algorithm ◽

Quality Classification ◽

Visualization Technology ◽

Better Than

Traditionally, the classification of seed defects mainly relies on the characteristics of color, shape, and texture. This method requires repeated extraction of a large amount of feature information, which is not efficiently used in detection. In recent years, deep learning has performed well in the field of image recognition. We introduced convolutional neural networks (CNNs) and transfer learning into the quality classification of seeds and compared them with traditional machine learning algorithms. Experiments showed that deep learning algorithm was significantly better than the machine learning algorithm with an accuracy of 95% (GoogLeNet) vs. 79.2% (SURF+SVM). We used three classifiers in GoogLeNet to demonstrate that network accuracy increases as the depth of the network increases. We used the visualization technology to obtain the feature map of each layer of the network in CNNs and used the heat map to represent the probability distribution of the inference results. As an end-to-end network, CNNs can be easily applied for automated seed manufacturing.

Download Full-text

Automating Visual Blockage Classification of Culverts with Deep Learning

Applied Sciences ◽

10.3390/app11167561 ◽

2021 ◽

Vol 11 (16) ◽

pp. 7561

Author(s):

Umair Iqbal ◽

Johan Barthelemy ◽

Wanqing Li ◽

Pascal Perez

Keyword(s):

Machine Learning ◽

Deep Learning ◽

False Negative ◽

Hydraulic Modeling ◽

Machine Learning Algorithms ◽

Visual Features ◽

Learning Approaches ◽

Learning Models ◽

Conventional Machine

Blockage of culverts by transported debris materials is reported as the salient contributor in originating urban flash floods. Conventional hydraulic modeling approaches had no success in addressing the problem primarily because of the unavailability of peak floods hydraulic data and the highly non-linear behavior of debris at the culvert. This article explores a new dimension to investigate the issue by proposing the use of intelligent video analytics (IVA) algorithms for extracting blockage related information. The presented research aims to automate the process of manual visual blockage classification of culverts from a maintenance perspective by remotely applying deep learning models. The potential of using existing convolutional neural network (CNN) algorithms (i.e., DarkNet53, DenseNet121, InceptionResNetV2, InceptionV3, MobileNet, ResNet50, VGG16, EfficientNetB3, NASNet) is investigated over a dataset from three different sources (i.e., images of culvert openings and blockage (ICOB), visual hydrology-lab dataset (VHD), synthetic images of culverts (SIC)) to predict the blockage in a given image. Models were evaluated based on their performance on the test dataset (i.e., accuracy, loss, precision, recall, F1 score, Jaccard Index, region of convergence (ROC) curve), floating point operations per second (FLOPs) and response times to process a single test instance. Furthermore, the performance of deep learning models was benchmarked against conventional machine learning algorithms (i.e., SVM, RF, xgboost). In addition, the idea of classifying deep visual features extracted by CNN models (i.e., ResNet50, MobileNet) using conventional machine learning approaches was also implemented in this article. From the results, NASNet was reported most efficient in classifying the blockage images with the 5-fold accuracy of 85%; however, MobileNet was recommended for the hardware implementation because of its improved response time with 5-fold accuracy comparable to NASNet (i.e., 78%). Comparable performance to standard CNN models was achieved for the case where deep visual features were classified using conventional machine learning approaches. False negative (FN) instances, false positive (FP) instances and CNN layers activation suggested that background noise and oversimplified labelling criteria were two contributing factors in the degraded performance of existing CNN algorithms. A framework for partial automation of the visual blockage classification process was proposed, given that none of the existing models was able to achieve high enough accuracy to completely automate the manual process. In addition, a detection-classification pipeline with higher blockage classification accuracy (i.e., 94%) has been proposed as a potential future direction for practical implementation.

Download Full-text

Automatic Classification of Vulnerabilities using Deep Learning and Machine Learning Algorithms

10.1109/ijcnn52387.2021.9534259 ◽

2021 ◽

Author(s):

Vishnu Ramesh ◽

Sara Abraham ◽

P Vinod ◽

Isham Mohamed ◽

Corrado A. Visaggio ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithms ◽

Automatic Classification ◽

Machine Learning Algorithms

Download Full-text

Clustering activity at Mt Etna based on volcanic tremor: A case study

Earth Science Informatics ◽

10.1007/s12145-021-00606-5 ◽

2021 ◽

Author(s):

Giuseppe Nunnari

Keyword(s):

Machine Learning ◽

Volcanic Activity ◽

Roc Curve ◽

Volcanic Tremor ◽

Machine Learning Algorithms ◽

Etna Volcano ◽

Mt Etna ◽

Geophysical Signal

AbstractThis paper deals with the classification of volcanic activity into three classes, referred to as Quite, Strombolian and Paroxysm. The main purpose is to give a measure of the reliability with which such a classification, typically carried out by experts, can be performed by Machine Learning algorithms, by using the volcanic tremor as a feature. Both supervised and unsupervised methods are considered. It is experimentally shown that at least the Paroxysm activity can be reliably classified. Performances are rigorously assessed, in comparison with the classification made by expert volcanologists, in terms of popular indices such as the f1-score and the Area under the ROC curve (AuC). The work is basically a case study carried out on a dataset recorded in the area of the Mt Etna volcano. However, as volcanic tremor is a geophysical signal widely available, considered methods and strategies can be easily applied to similar volcanic areas.

Download Full-text

Classification of hazelnut cultivars: comparison of DL4J and ensemble learning algorithms

Notulae Botanicae Horti Agrobotanici Cluj-Napoca ◽

10.15835/nbha48412041 ◽

2020 ◽

Vol 48 (4) ◽

pp. 2316-2327

Author(s):

Caner KOC ◽

Dilara GERDAN ◽

Maksut B. EMİNOĞLU ◽

Uğur YEGÜL ◽

Bulent KOC ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Random Forest ◽

Ensemble Learning ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Performance Criteria ◽

Gradient Boosting ◽

Data Set

Classification of hazelnuts is one of the values adding processes that increase the marketability and profitability of its production. While traditional classification methods are used commonly, machine learning and deep learning can be implemented to enhance the hazelnut classification processes. This paper presents the results of a comparative study of machine learning frameworks to classify hazelnut (Corylus avellana L.) cultivars (‘Sivri’, ‘Kara’, ‘Tombul’) using DL4J and ensemble learning algorithms. For each cultivar, 50 samples were used for evaluations. Maximum length, width, compression strength, and weight of hazelnuts were measured using a caliper and a force transducer. Gradient boosting machine (Boosting), random forest (Bagging), and DL4J feedforward (Deep Learning) algorithms were applied in traditional machine learning algorithms. The data set was partitioned into a 10-fold-cross validation method. The classifier performance criteria of accuracy (%), error percentage (%), F-Measure, Cohen’s Kappa, recall, precision, true positive (TP), false positive (FP), true negative (TN), false negative (FN) values are provided in the results section. The results showed classification accuracies of 94% for Gradient Boosting, 100% for Random Forest, and 94% for DL4J Feedforward algorithms.

Download Full-text

Assessment of Facial Homogeneity with Regard to Genealogical Aspects Based on Deep Learning Approach

Turkish Journal of Computer and Mathematics Education (TURCOMAT) ◽

10.17762/turcomat.v12i3.962 ◽

2021 ◽

Vol 12 (3) ◽

pp. 1550-1556

Author(s):

Ravi Kumar Y B Et.al

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Binary Classification ◽

Learning Algorithms ◽

Research Work ◽

Machine Learning Algorithms ◽

Facial Features ◽

Learning Approach

The current research work encompasses the assessment of similarity based facial features of images with erected method so as to determines the genealogical similarity. It is based on the principle of grouping the closer features, as compared to those which are away from the predefined threshold for a better ascertainment of the extracted features. The system developed is trained using deep learning-oriented architecture incorporating these closer features for a binary classification of the subjects considered into genealogic non-genealogic. The genealogic set of data is further used to calculate the percentage of similarity with erected methods. The present work considered XX datasets from XXXX source for the assessment of facial similarities. The results portrayed an accuracy of 96.3% for genealogic data, the salient among them being those of father-daughter (98.1%), father-son(98.3%), mother-daughter(96.6%), mother-son(96.1%) genealogy in case of the datasets from “kinface W-I”. Extending this work onto “kinface W-II” set of data, the results were promising with father-daughter(98.5%), father-son(96.7%), mother-daughter(93.4%) and mother-son(98.9%) genealogy. Such an approach could be further extended to larger database so as to assess the genealogical similarity with the aid of machine-learning algorithms.

Download Full-text

A deep learning-based quality assessment model of collaboratively edited documents: A case study of Wikipedia

Journal of Information Science ◽

10.1177/0165551519877646 ◽

2019 ◽

pp. 016555151987764

Author(s):

Ping Wang ◽

Xiaodan Li ◽

Renli Wu

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Complete Information ◽

Classification Performance ◽

Machine Learning Algorithms ◽

Assessment Model ◽

Learning Models ◽

Proposed Model

Wikipedia is becoming increasingly critical in helping people obtain information and knowledge. Its leading advantage is that users can not only access information but also modify it. However, this presents a challenging issue: how can we measure the quality of a Wikipedia article? The existing approaches assess Wikipedia quality by statistical models or traditional machine learning algorithms. However, their performance is not satisfactory. Moreover, most existing models fail to extract complete information from articles, which degrades the model’s performance. In this article, we first survey related works and summarise a comprehensive feature framework. Then, state-of-the-art deep learning models are introduced and applied to assess Wikipedia quality. Finally, a comparison among deep learning models and traditional machine learning models is conducted to validate the effectiveness of the proposed model. The models are compared extensively in terms of their training and classification performance. Moreover, the importance of each feature and the importance of different feature sets are analysed separately.

Download Full-text