WHAT IDENTIFIES A WHALE BY ITS FLUKE? ON THE BENEFIT OF INTERPRETABLE MACHINE LEARNING FOR WHALE IDENTIFICATION

Abstract. Interpretable and explainable machine learning have proven to be promising approaches to verify the quality of a data-driven model in general as well as to obtain more information about the quality of certain observations in practise. In this paper, we use these approaches for an application in the marine sciences to support the monitoring of whales. Whale population monitoring is an important element of whale conservation, where the identification of whales plays an important role in this process, for example to trace the migration of whales over time and space. Classical approaches use photographs and a manual mapping with special focus on the shape of the whale flukes and their unique pigmentation. However, this is not feasible for comprehensive monitoring. Machine learning methods, especially deep neural networks, have shown that they can efficiently solve the automatic observation of a large number of whales. Despite their success for many different tasks such as identification, further potentials such as interpretability and their benefits have not yet been exploited. Our main contribution is an analysis of interpretation tools, especially occlusion sensitivity maps, and the question of how the gained insights can help a whale researcher. For our analysis, we use images of humpback whale flukes provided by the Kaggle Challenge ”Humpback Whale Identification”. By means of spectral cluster analysis of heatmaps, which indicate which parts of the image are important for a decision, we can show that the they can be grouped in a meaningful way. Moreover, it appears that characteristics automatically determined by a neural network correspond to those that are considered important by a whale expert.

Download Full-text

Predicting the Quality of High-power Connector Joints with Different Machine Learning Methods

2020 10th International Electric Drives Production Conference (EDPC) ◽

10.1109/edpc51184.2020.9388211 ◽

2020 ◽

Author(s):

Elisabeth Birgit Schwarz ◽

Fabian Bleier ◽

Jean-Pierre Bergmann

Keyword(s):

Machine Learning ◽

High Power ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

A Survey on Machine Learning-Based Performance Improvement of Wireless Networks: PHY, MAC and Network Layer

Electronics ◽

10.3390/electronics10030318 ◽

2021 ◽

Vol 10 (3) ◽

pp. 318

Author(s):

Merima Kulin ◽

Tarik Kazaz ◽

Eli De Poorter ◽

Ingrid Moerman

Keyword(s):

Machine Learning ◽

Wireless Networks ◽

Performance Improvement ◽

Quality Of Experience ◽

Data Driven ◽

Protocol Stack ◽

Comprehensive Survey ◽

Network Prediction ◽

Network Quality

This paper presents a systematic and comprehensive survey that reviews the latest research efforts focused on machine learning (ML) based performance improvement of wireless networks, while considering all layers of the protocol stack: PHY, MAC and network. First, the related work and paper contributions are discussed, followed by providing the necessary background on data-driven approaches and machine learning to help non-machine learning experts understand all discussed techniques. Then, a comprehensive review is presented on works employing ML-based approaches to optimize the wireless communication parameters settings to achieve improved network quality-of-service (QoS) and quality-of-experience (QoE). We first categorize these works into: radio analysis, MAC analysis and network prediction approaches, followed by subcategories within each. Finally, open challenges and broader perspectives are discussed.

Download Full-text

Estimation of data-driven streamflow predicting models using machine learning methods

Arabian Journal of Geosciences ◽

10.1007/s12517-021-07446-z ◽

2021 ◽

Vol 14 (11) ◽

Author(s):

Tanveer Ahmed Siddiqi ◽

Saima Ashraf ◽

Sadiq Ali Khan ◽

Muhammad Jawed Iqbal

Keyword(s):

Machine Learning ◽

Data Driven ◽

Learning Methods ◽

Machine Learning Methods

Download Full-text

Thermal Load Prediction of Communal District Heating Systems by Applying Data-Driven Machine Learning Methods

SSRN Electronic Journal ◽

10.2139/ssrn.3870973 ◽

2021 ◽

Author(s):

Nikolaos Panagiotis Sakkas ◽

Roger Abang

Keyword(s):

Machine Learning ◽

Thermal Load ◽

District Heating ◽

Data Driven ◽

Load Prediction ◽

Learning Methods ◽

Heating Systems ◽

Machine Learning Methods ◽

District Heating Systems

Download Full-text

TRAFFIC FLOWS FORECASTING BASED ON MACHINE LEARNING

International Journal of Embedded and Real-Time Communication Systems ◽

10.4018/ijertcs.289198 ◽

2022 ◽

Vol 13 (1) ◽

pp. 0-0

Keyword(s):

Machine Learning ◽

Traffic Flows ◽

Random Forest Regression ◽

Machine Learning Methods ◽

Packet Arrival ◽

Entire Flow ◽

The Mean ◽

Active Flow

The article aims to develop a model for forecasting the characteristics of traffic flows in real-time based on the classification of applications using machine learning methods to ensure the quality of service. It is shown that the model can forecast the mean rate and frequency of packet arrival for the entire flow of each class separately. The prediction is based on information about the previous flows of this class and the first 15 packets of the active flow. Thus, the Random Forest Regression method reduces the prediction error by approximately 1.5 times compared to the standard mean estimate for transmitted packets issued at the switch interface.

Download Full-text

Facing Erosion Identification in Railway Lines Using Pixel-Wise Deep-Based Approaches

Remote Sensing ◽

10.3390/rs12040739 ◽

2020 ◽

Vol 12 (4) ◽

pp. 739

Author(s):

Keiller Nogueira ◽

Gabriel L. S. Machado ◽

Pedro H. T. Gama ◽

Caio C. V. da Silva ◽

Remis Balaniuk ◽

...

Keyword(s):

Machine Learning ◽

Deep Learning ◽

High Impact ◽

Automatic Machine ◽

Feature Representation ◽

Data Driven ◽

Maintenance Costs ◽

Crucial Step ◽

Machine Learning Methods ◽

High Resolution Images

Soil erosion is considered one of the most expensive natural hazards with a high impact on several infrastructure assets. Among them, railway lines are one of the most likely constructions for the appearance of erosion and, consequently, one of the most troublesome due to the maintenance costs, risks of derailments, and so on. Therefore, it is fundamental to identify and monitor erosion in railway lines to prevent major consequences. Currently, erosion identification is manually performed by humans using huge image sets, a time-consuming and slow task. Hence, automatic machine learning methods appear as an appealing alternative. A crucial step for automatic erosion identification is to create a good feature representation. Towards such objective, deep learning can learn data-driven features and classifiers. In this paper, we propose a novel deep learning-based framework capable of performing erosion identification in railway lines. Six techniques were evaluated and the best one, Dynamic Dilated ConvNet, was integrated into this framework that was then encapsulated into a new ArcGIS plugin to facilitate its use by non-programmer users. To analyze such techniques, we also propose a new dataset, composed of almost 2000 high-resolution images.

Download Full-text

Data-driven predictions of a multiscale Lorenz 96 chaotic system using machine-learning methods: reservoir computing, artificial neural network, and long short-term memory network

Nonlinear Processes in Geophysics ◽

10.5194/npg-27-373-2020 ◽

2020 ◽

Vol 27 (3) ◽

pp. 373-389 ◽

Cited By ~ 7

Author(s):

Ashesh Chattopadhyay ◽

Pedram Hassanzadeh ◽

Devika Subramanian

Keyword(s):

Neural Network ◽

Machine Learning ◽

Short Term Memory ◽

Data Driven ◽

Reservoir Computing ◽

Short Term ◽

Term Memory ◽

Machine Learning Methods ◽

Long Short Term Memory ◽

Lorenz 96

Abstract. In this paper, the performance of three machine-learning methods for predicting short-term evolution and for reproducing the long-term statistics of a multiscale spatiotemporal Lorenz 96 system is examined. The methods are an echo state network (ESN, which is a type of reservoir computing; hereafter RC–ESN), a deep feed-forward artificial neural network (ANN), and a recurrent neural network (RNN) with long short-term memory (LSTM; hereafter RNN–LSTM). This Lorenz 96 system has three tiers of nonlinearly interacting variables representing slow/large-scale (X), intermediate (Y), and fast/small-scale (Z) processes. For training or testing, only X is available; Y and Z are never known or used. We show that RC–ESN substantially outperforms ANN and RNN–LSTM for short-term predictions, e.g., accurately forecasting the chaotic trajectories for hundreds of numerical solver's time steps equivalent to several Lyapunov timescales. The RNN–LSTM outperforms ANN, and both methods show some prediction skills too. Furthermore, even after losing the trajectory, data predicted by RC–ESN and RNN–LSTM have probability density functions (pdf's) that closely match the true pdf – even at the tails. The pdf of the data predicted using ANN, however, deviates from the true pdf. Implications, caveats, and applications to data-driven and data-assisted surrogate modeling of complex nonlinear dynamical systems, such as weather and climate, are discussed.

Download Full-text

Framework for TCAD augmented machine learning on multi- I–V characteristics using convolutional neural network and multiprocessing

Journal of Semiconductors ◽

10.1088/1674-4926/42/12/124101 ◽

2021 ◽

Vol 42 (12) ◽

pp. 124101

Author(s):

Thomas Hirtz ◽

Steyn Huurman ◽

He Tian ◽

Yi Yang ◽

Tian-Ling Ren

Keyword(s):

Machine Learning ◽

Neural Networks ◽

Information Technologies ◽

Deep Neural Networks ◽

State Of The Art ◽

Data Driven ◽

Sufficient Data ◽

Learning Models ◽

Simulation Tools ◽

New Information

Abstract In a world where data is increasingly important for making breakthroughs, microelectronics is a field where data is sparse and hard to acquire. Only a few entities have the infrastructure that is required to automate the fabrication and testing of semiconductor devices. This infrastructure is crucial for generating sufficient data for the use of new information technologies. This situation generates a cleavage between most of the researchers and the industry. To address this issue, this paper will introduce a widely applicable approach for creating custom datasets using simulation tools and parallel computing. The multi-I–V curves that we obtained were processed simultaneously using convolutional neural networks, which gave us the ability to predict a full set of device characteristics with a single inference. We prove the potential of this approach through two concrete examples of useful deep learning models that were trained using the generated data. We believe that this work can act as a bridge between the state-of-the-art of data-driven methods and more classical semiconductor research, such as device engineering, yield engineering or process monitoring. Moreover, this research gives the opportunity to anybody to start experimenting with deep neural networks and machine learning in the field of microelectronics, without the need for expensive experimentation infrastructure.

Download Full-text

Predicting rice blast disease: machine learning versus process-based models

BMC Bioinformatics ◽

10.1186/s12859-019-3065-1 ◽

2019 ◽

Vol 20 (1) ◽

Cited By ~ 1

Author(s):

David F. Nettleton ◽

Dimitrios Katsantonis ◽

Argyris Kalaitzidis ◽

Natasa Sarafijanovic-Djukic ◽

Pau Puigdollers ◽

...

Keyword(s):

Machine Learning ◽

Rice Blast ◽

Machine Learning Algorithms ◽

Rice Blast Disease ◽

Blast Disease ◽

Data Driven ◽

Learning Methods ◽

Machine Learning Methods ◽

Plant Disease Management ◽

Process Based Models

Abstract Background In this study, we compared four models for predicting rice blast disease, two operational process-based models (Yoshino and Water Accounting Rice Model (WARM)) and two approaches based on machine learning algorithms (M5Rules and Recurrent Neural Networks (RNN)), the former inducing a rule-based model and the latter building a neural network. In situ telemetry is important to obtain quality in-field data for predictive models and this was a key aspect of the RICE-GUARD project on which this study is based. According to the authors, this is the first time process-based and machine learning modelling approaches for supporting plant disease management are compared. Results Results clearly showed that the models succeeded in providing a warning of rice blast onset and presence, thus representing suitable solutions for preventive remedial actions targeting the mitigation of yield losses and the reduction of fungicide use. All methods gave significant “signals” during the “early warning” period, with a similar level of performance. M5Rules and WARM gave the maximum average normalized scores of 0.80 and 0.77, respectively, whereas Yoshino gave the best score for one site (Kalochori 2015). The best average values of r and r2 and %MAE (Mean Absolute Error) for the machine learning models were 0.70, 0.50 and 0.75, respectively and for the process-based models the corresponding values were 0.59, 0.40 and 0.82. Thus it has been found that the ML models are competitive with the process-based models. This result has relevant implications for the operational use of the models, since most of the available studies are limited to the analysis of the relationship between the model outputs and the incidence of rice blast. Results also showed that machine learning methods approximated the performances of two process-based models used for years in operational contexts. Conclusions Process-based and data-driven models can be used to provide early warnings to anticipate rice blast and detect its presence, thus supporting fungicide applications. Data-driven models derived from machine learning methods are a viable alternative to process-based approaches and – in cases when training datasets are available – offer a potentially greater adaptability to new contexts.

Download Full-text