Investigating the Impact of Data Analysis and Classification on Parametric and Nonparametric Machine Learning Techniques: A Proof of Concept

Abstract. Rip currents and other surf hazards are an emerging public health issue globally. Lifeguards, warning flags, and signs are important, and to varying degrees they are effective strategies to minimize risk to beach users. In the United States and other jurisdictions around the world, lifeguards use coloured flags (green, yellow, and red) to indicate whether the danger posed by the surf and rip hazard is low, moderate, or high respectively. The choice of flag depends on the lifeguard(s) monitoring the changing surf conditions along the beach and over the course of the day using both regional surf forecasts and careful observation. There is a potential that the chosen flag is not consistent with the beach user perception of the risk, which may increase the potential for rescues or drownings. In this study, machine learning is used to determine the potential for error in the flags used at Pensacola Beach and the impact of that error on the number of rescues. Results of a decision tree analysis indicate that the colour flag chosen by the lifeguards was different from what the model predicted for 35 % of days between 2004 and 2008 (n=396/1125). Days when there is a difference between the predicted and posted flag colour represent only 17 % of all rescue days, but those days are associated with ∼60 % of all rescues between 2004 and 2008. Further analysis reveals that the largest number of rescue days and total number of rescues are associated with days where the flag deployed over-estimated the surf and hazard risk, such as a red or yellow flag flying when the model predicted a green flag would be more appropriate based on the wind and wave forcing alone. While it is possible that the lifeguards were overly cautious, it is argued that they most likely identified a rip forced by a transverse-bar and rip morphology common at the study site. Regardless, the results suggest that beach users may be discounting lifeguard warnings if the flag colour is not consistent with how they perceive the surf hazard or the regional forecast. Results suggest that machine learning techniques have the potential to support lifeguards and thereby reduce the number of rescues and drownings.

Download Full-text

Monitoring the Impact of Air Quality on the COVID-19 Fatalities in Delhi, India: Using Machine Learning Techniques

Disaster Medicine and Public Health Preparedness ◽

10.1017/dmp.2020.372 ◽

2020 ◽

pp. 1-8

Author(s):

Jasleen Kaur Sethi ◽

Mamta Mittal

Keyword(s):

Machine Learning ◽

Air Quality ◽

Air Pollutants ◽

Machine Learning Techniques ◽

Environmental Restoration ◽

The Novel ◽

Ozone Pollution ◽

Learning Techniques ◽

Novel Coronavirus ◽

The Impact

ABSTRACT Objective: The focus of this study is to monitor the effect of lockdown on the various air pollutants due to the coronavirus disease (COVID-19) pandemic and identify the ones that affect COVID-19 fatalities so that measures to control the pollution could be enforced. Methods: Various machine learning techniques: Decision Trees, Linear Regression, and Random Forest have been applied to correlate air pollutants and COVID-19 fatalities in Delhi. Furthermore, a comparison between the concentration of various air pollutants and the air quality index during the lockdown period and last two years, 2018 and 2019, has been presented. Results: From the experimental work, it has been observed that the pollutants ozone and toluene have increased during the lockdown period. It has also been deduced that the pollutants that may impact the mortalities due to COVID-19 are ozone, NH3, NO2, and PM10. Conclusions: The novel coronavirus has led to environmental restoration due to lockdown. However, there is a need to impose measures to control ozone pollution, as there has been a significant increase in its concentration and it also impacts the COVID-19 mortality rate.

Download Full-text

Analyzing the impact of red-edge band on land use land cover classification using multispectral RapidEye imagery and machine learning techniques

Journal of Applied Remote Sensing ◽

10.1117/1.jrs.13.044511 ◽

2019 ◽

Vol 13 (04) ◽

pp. 1

Author(s):

Rashmi Saini ◽

Sanjay K. Ghosh

Keyword(s):

Machine Learning ◽

Land Use ◽

Land Cover ◽

Land Cover Classification ◽

Machine Learning Techniques ◽

Land Use Land Cover ◽

Red Edge ◽

Learning Techniques ◽

The Impact ◽

Edge Band

Download Full-text

Genome data analysis, protein function and structure prediction by machine learning techniques

10.32469/10355/57021 ◽

2016 ◽

Author(s):

Renzhi Cao

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Protein Function ◽

Structure Prediction ◽

Machine Learning Techniques ◽

Genome Data ◽

Learning Techniques ◽

Genome Data Analysis

Download Full-text

Energy Meter Data Analysis Using Machine Learning Techniques

International Journal on Recent and Innovation Trends in Computing and Communication ◽

10.17762/ijritcc.v8i6.5409 ◽

2020 ◽

Vol 8 (6) ◽

pp. 08-13

Author(s):

Anitha Kumari K ◽

Indusha M ◽

Abarna Devi D ◽

Dheva Dharshini S

Keyword(s):

Machine Learning ◽

Data Analysis ◽

Use Cases ◽

Machine Learning Techniques ◽

Energy Usage ◽

Energy Meter ◽

Learning Techniques ◽

Significant Interest ◽

F Measure

With the advancement of technology, existence of energy meters are not merely to measure energy units. The proliferation of energy meter deployments had led to significant interest in analyzing the energy usage by the machines. Energy meter data is often difficult to analyzeowing to the aggregation of many disparate and complex loads. At utility scales, analysis is further complicated by the vast quantity of data and hence industries turn towards applying machine learning techniques for monitoring and measuring loads of the machines. The energy meter data analysis aims at analyzing the behavior of the machine and providing insights on usage of the energy. This will help the industries to identify the faults in the machine and to rectify it.Two use cases with two different motor specifications is considered for evaluation and the efficiency is proved by considering accuracy, precision, F-measure and recall as metrics.

Download Full-text

Iterative machine learning applied to annotation of text datasets

10.5753/eniac.2021.18268 ◽

2021 ◽

Author(s):

Thiago Abdo ◽

Fabiano Silva

Keyword(s):

Machine Learning ◽

Deep Learning ◽

Learning Algorithms ◽

Computational Cost ◽

Machine Learning Techniques ◽

Learning Approaches ◽

Learning Techniques ◽

The Creation ◽

The Impact ◽

High Computational Cost

The purpose of this paper is to analyze the use of different machine learning approaches and algorithms to be integrated as an automated assistance on a tool to aid the creation of new annotated datasets. We evaluate how they scale in an environment without dedicated machine learning hardware. In particular, we study the impact over a dataset with few examples and one that is being constructed. We experiment using deep learning algorithms (Bert) and classical learning algorithms with a lower computational cost (W2V and Glove combined with RF and SVM). Our experiments show that deep learning algorithms have a performance advantage over classical techniques. However, deep learning algorithms have a high computational cost, making them inadequate to an environment with reduced hardware resources. Simulations using Active and Iterative machine learning techniques to assist the creation of new datasets are conducted. For these simulations, we use the classical learning algorithms because of their computational cost. The knowledge gathered with our experimental evaluation aims to support the creation of a tool for building new text datasets.

Download Full-text

Machine learning approach for simulation of heavy metal concentration in river water: the Crimean peninsula case study

E3S Web of Conferences ◽

10.1051/e3sconf/202016306009 ◽

2020 ◽

Vol 163 ◽

pp. 06009

Author(s):

Evgeniy Malygin ◽

Mikhail Lychagin

Keyword(s):

Machine Learning ◽

Heavy Metal ◽

Metal Concentration ◽

Heavy Metal Concentration ◽

Machine Learning Techniques ◽

Crimean Peninsula ◽

River Waters ◽

Learning Techniques ◽

Machine Learning Approach ◽

The Impact

This study proposes an approach for simulation of heavy metal concentration in river waters using machine learning techniques. A regression model was built and it captured the relationship between the concentration of heavy metal and metalloids (HMM) and several characteristics of studied catchment. Machine learning techniques allowed to simulate the annual concentration variability of HMM. This approach allows exploring the impact of different factors on studied processes.

Download Full-text