Development of a Probabilistic Subfreezing Road Temperature Nowcast and Forecast Using Machine Learning

ABSTRACTIn this study, a machine learning algorithm for generating a gridded CONUS-wide probabilistic road temperature forecast is presented. A random forest is used to tie a combination of HRRR model surface variables and information about the geographic location and time of day per year to observed road temperatures. This approach differs from its predecessors in that road temperature is not deterministic (i.e., provides a forecast of a specific road temperature), but rather it is probabilistic, providing a 0%–100% probability that the road temperature is subfreezing. This approach can account for the varying controls on road temperature that are not easily known or able to be accounted for in physical models, such as amount of traffic, road composition, and differential shading by surrounding buildings and terrain. The algorithm is trained using road temperature observations from one winter season (October 2016–March 2017) and calibrated/evaluated using observations from the following winter season (October 2017–March 2018). Case-study analyses show the algorithm performs well for various scenarios and captures the temporal and spatial evolution of the probability of subfreezing roads reliably. Statistical evaluation for the predicted probabilities shows good skill as the mean area under the receiver operating characteristics curve is 0.96 and the Brier skill score is 0.66 for a 2-h forecast and only degrades slightly as lead time is increased. Additionally, the algorithm produces well-calibrated probabilities, and consistent discrimination between clearly above-freezing and subfreezing environments.

Download Full-text

Using deep learning to nowcast the spatial coverage of convection from Himawari-8 satellite data

Monthly Weather Review ◽

10.1175/mwr-d-21-0096.1 ◽

2021 ◽

Author(s):

Ryan Lagerquist ◽

Jebb Q. Stewart ◽

Imme Ebert-Uphoff ◽

Christina Kumler

Keyword(s):

Deep Learning ◽

Satellite Data ◽

Geographic Location ◽

Ground Truth ◽

Atmospheric Science ◽

Skill Score ◽

Time Of Day ◽

Lead Times ◽

Weather Radars ◽

Spatial Coverage

AbstractPredicting the timing and location of thunderstorms (“convection”) allows for preventive actions that can save both lives and property. We have applied U-nets, a deep-learning-based type of neural network, to forecast convection on a grid at lead times up to 120 minutes. The goal is to make skillful forecasts with only present and past satellite data as predictors. Specifically, predictors are multispectral brightness-temperature images from the Himawari-8 satellite, while targets (ground truth) are provided by weather radars in Taiwan. U-nets are becoming popular in atmospheric science due to their advantages for gridded prediction. Furthermore, we use three novel approaches to advance U-nets in atmospheric science. First, we compare three architectures – vanilla, temporal, and U-net++ – and find that vanilla U-nets are best for this task. Second, we train U-nets with the fractions skill score, which is spatially aware, as the loss function. Third, because we do not have adequate ground truth over the full Himawari-8 domain, we train the U-nets with small radar-centered patches, then apply trained U-nets to the full domain. Also, we find that the best predictions are given by U-nets trained with satellite data from multiple lag times, not only the present. We evaluate U-nets in detail – by time of day, month, and geographic location – and compare to persistence models. The U-nets outperform persistence at lead times ≥ 60 minutes, and at all lead times the U-nets provide a more realistic climatology than persistence. Our code is available publicly.

Download Full-text

A Novel Machine Learning Sepsis Prediction Algorithm for Intended ICU Use (NAVOY Sepsis): A Proof-of-Concept Study (Preprint)

10.2196/preprints.28000 ◽

2021 ◽

Author(s):

Inger Persson ◽

Andreas Östling ◽

Martin Arlbrandt ◽

Joakim Söderberg ◽

David Becedas

Keyword(s):

Machine Learning ◽

High Performance ◽

Learning Algorithm ◽

Scoring Systems ◽

High Accuracy ◽

Prediction Algorithm ◽

Massachusetts Institute Of Technology ◽

Operating Characteristics ◽

Mortality And Morbidity ◽

Institute Of Technology

BACKGROUND Despite decades of research, sepsis remains a leading cause of mortality and morbidity in ICUs worldwide. The key to effective management and patient outcome is early detection, where no prospectively validated machine learning prediction algorithm is available for clinical use in Europe today. OBJECTIVE To develop a high-performance machine learning sepsis prediction algorithm based on routinely collected ICU data, designed to be implemented in Europe. METHODS The machine learning algorithm is developed using Convolutional Neural Network, based on the Massachusetts Institute of Technology Lab for Computational Physiology MIMIC-III Clinical Database, focusing on ICU patients aged 18 years or older. Twenty variables are used for prediction, on an hourly basis. Onset of sepsis is defined in accordance with the international Sepsis-3 criteria. RESULTS The developed algorithm NAVOY Sepsis uses 4 hours of input and can with high accuracy predict patients with high risk of developing sepsis in the coming hours. The prediction performance is superior to that of existing sepsis early warning scoring systems, and competes well with previously published prediction algorithms designed to predict sepsis onset in accordance with the Sepsis-3 criteria, as measured by the area under the receiver operating characteristics curve (AUROC) and the area under the precision-recall curve (AUPRC). NAVOY Sepsis yields AUROC = 0.90 and AUPRC = 0.62 for predictions up to 3 hours before sepsis onset. The predictive performance is externally validated on hold-out test data, where NAVOY Sepsis is confirmed to predict sepsis with high accuracy. CONCLUSIONS An algorithm with excellent predictive properties has been developed, based on variables routinely collected at ICUs. This algorithm is to be further validated in an ongoing prospective randomized clinical trial and will be CE marked as Software as a Medical Device, designed for commercial use in European ICUs.

Download Full-text

Risk Prediction for Winter Road Accidents on Expressways

Applied Sciences ◽

10.3390/app11209534 ◽

2021 ◽

Vol 11 (20) ◽

pp. 9534

Author(s):

Daeseong Kim ◽

Sangyun Jung ◽

Sanghoo Yoon

Keyword(s):

Machine Learning ◽

Random Forest ◽

Winter Season ◽

Weather Conditions ◽

Road Accident ◽

Weather Data ◽

Road Accidents ◽

Weather Factors ◽

The Road ◽

Road Geometry

Road accidents caused by weather conditions in winter lead to higher mortality rates than in other seasons. The main causes of road accidents include human carelessness, vehicle defects, road conditions, and weather factors. If the risk of road accidents with changes in road weather conditions can be quantitatively evaluated, it will contribute to reducing the road accident fatalities. The road accident data used in this study were obtained for the period 2017 to 2019. Spatial interpolation estimated the weather information; geographic information system (GIS) and Shuttle Radar Topography Mission (SRTM) data identified road geometry and accident area altitude; synthetic minority oversampling technique (SMOTE) addressed the data imbalance problem between road accidents due to weather conditions and from other causes, and finally, machine learning was performed on the data using various models such as random forest, XGBoost, neural network, and logistic regression. The training- to test data ratio was 7:3. Random forest model exhibited the best classification performance for road accident status according to weather risks. Thus, by applying weather data and road geometry to machine learning models, the risk of road accidents due to weather conditions in the winter season can be predicted and provided as a service.

Download Full-text

Computer Network Attack Detection Using Enhanced Clustering Technologies

Asian Journal of Applied Sciences ◽

10.24203/ajas.v9i6.6839 ◽

2022 ◽

Vol 9 (6) ◽

Author(s):

Dhamyaa Salim Mutar

Keyword(s):

Machine Learning ◽

Computer Network ◽

Learning Algorithm ◽

Attack Detection ◽

Learning Technology ◽

Feed Forward Neural Network ◽

Network Attack ◽

The Road ◽

Network Intrusions ◽

Smart Machine

The need for security means has brought from the fact of privacy of data especially after the communication revolution in the recent times. The advancement of data mining and machine learning technology has paved the road for establishment an efficient attack prediction paradigm for protecting of large scaled networks. In this project, computer network intrusions had been eliminated by using smart machine learning algorithm. Referring a big dataset named as KDD computer intrusion dataset which includes large number of connections that diagnosed with several types of attacks; the model is established for predicting the type of attack by learning through this data. Feed forward neural network model is outperformed over the other proposed clustering models in attack prediction accuracy.

Download Full-text

Novel Machine Learning Approaches for Modelling the Gully Erosion Susceptibility

Remote Sensing ◽

10.3390/rs12172833 ◽

2020 ◽

Vol 12 (17) ◽

pp. 2833 ◽

Cited By ~ 3

Author(s):

Alireza Arabameri ◽

Omid Asadi Nalivan ◽

Subodh Chandra Pal ◽

Rabin Chakrabortty ◽

Asish Saha ◽

...

Keyword(s):

Machine Learning ◽

Water Conservation ◽

Learning Algorithm ◽

Gully Erosion ◽

Support Vector ◽

Learning Approaches ◽

Operating Characteristics ◽

Validation Data ◽

Data Set ◽

Jackknife Test

The extreme form of land degradation caused by the formation of gullies is a major challenge for the sustainability of land resources. This problem is more vulnerable in the arid and semi-arid environment and associated damage to agriculture and allied economic activities. Appropriate modeling of such erosion is therefore needed with optimum accuracy for estimating vulnerable regions and taking appropriate initiatives. The Golestan Dam has faced an acute problem of gully erosion over the last decade and has adversely affected society. Here, the artificial neural network (ANN), general linear model (GLM), maximum entropy (MaxEnt), and support vector machine (SVM) machine learning algorithm with 90/10, 80/20, 70/30, 60/40, and 50/50 random partitioning of training and validation samples was selected purposively for estimating the gully erosion susceptibility. The main objective of this work was to predict the susceptible zone with the maximum possible accuracy. For this purpose, random partitioning approaches were implemented. For this purpose, 20 gully erosion conditioning factors were considered for predicting the susceptible areas by considering the multi-collinearity test. The variance inflation factor (VIF) and tolerance (TOL) limit were considered for multi-collinearity assessment for reducing the error of the models and increase the efficiency of the outcome. The ANN with 50/50 random partitioning of the sample is the most optimal model in this analysis. The area under curve (AUC) values of receiver operating characteristics (ROC) in ANN (50/50) for the training and validation data are 0.918 and 0.868, respectively. The importance of the causative factors was estimated with the help of the Jackknife test, which reveals that the most important factor is the topography position index (TPI). Apart from this, the prioritization of all predicted models was estimated taking into account the training and validation data set, which should help future researchers to select models from this perspective. This type of outcome should help planners and local stakeholders to implement appropriate land and water conservation measures.

Download Full-text

Comparison of machine and deep learning for the classification of cervical cancer based on cervicography images

Scientific Reports ◽

10.1038/s41598-021-95748-3 ◽

2021 ◽

Vol 11 (1) ◽

Author(s):

Ye Rang Park ◽

Young Jae Kim ◽

Woong Ju ◽

Kyehyun Nam ◽

Soonyung Kim ◽

...

Keyword(s):

Machine Learning ◽

Cervical Cancer ◽

Deep Learning ◽

Learning Algorithm ◽

Vaginal Wall ◽

Learning Models ◽

Operating Characteristics ◽

Deep Learning Algorithm ◽

Machine Learning Models

AbstractCervical cancer is the second most common cancer in women worldwide with a mortality rate of 60%. Cervical cancer begins with no overt signs and has a long latent period, making early detection through regular checkups vitally immportant. In this study, we compare the performance of two different models, machine learning and deep learning, for the purpose of identifying signs of cervical cancer using cervicography images. Using the deep learning model ResNet-50 and the machine learning models XGB, SVM, and RF, we classified 4119 Cervicography images as positive or negative for cervical cancer using square images in which the vaginal wall regions were removed. The machine learning models extracted 10 major features from a total of 300 features. All tests were validated by fivefold cross-validation and receiver operating characteristics (ROC) analysis yielded the following AUCs: ResNet-50 0.97(CI 95% 0.949–0.976), XGB 0.82(CI 95% 0.797–0.851), SVM 0.84(CI 95% 0.801–0.854), RF 0.79(CI 95% 0.804–0.856). The ResNet-50 model showed a 0.15 point improvement (p < 0.05) over the average (0.82) of the three machine learning methods. Our data suggest that the ResNet-50 deep learning algorithm could offer greater performance than current machine learning models for the purpose of identifying cervical cancer using cervicography images.

Download Full-text

Machine Learning Algorithm to Predict Early Complications after Brain Tumor Surgery

10.1055/s-0038-1660728 ◽

2018 ◽

Author(s):

C.H.B. van Niftrik ◽

F. van der Wouden ◽

V. Staartjes ◽

J. Fierstra ◽

M. Stienen ◽

...

Keyword(s):

Machine Learning ◽

Brain Tumor ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Tumor Surgery ◽

Early Complications ◽

Brain Tumor Surgery

Download Full-text

Design of English text-to-speech conversion algorithm based on machine learning

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189238 ◽

2020 ◽

pp. 1-12

Author(s):

Li Dongmei

Keyword(s):

Machine Learning ◽

Speech Synthesis ◽

Feature Recognition ◽

Learning Algorithm ◽

Morphological Structure ◽

English Text ◽

Text To Speech ◽

Part Of Speech ◽

Modern Computer ◽

Conversion Algorithm

English text-to-speech conversion is the key content of modern computer technology research. Its difficulty is that there are large errors in the conversion process of text-to-speech feature recognition, and it is difficult to apply the English text-to-speech conversion algorithm to the system. In order to improve the efficiency of the English text-to-speech conversion, based on the machine learning algorithm, after the original voice waveform is labeled with the pitch, this article modifies the rhythm through PSOLA, and uses the C4.5 algorithm to train a decision tree for judging pronunciation of polyphones. In order to evaluate the performance of pronunciation discrimination method based on part-of-speech rules and HMM-based prosody hierarchy prediction in speech synthesis systems, this study constructed a system model. In addition, the waveform stitching method and PSOLA are used to synthesize the sound. For words whose main stress cannot be discriminated by morphological structure, label learning can be done by machine learning methods. Finally, this study evaluates and analyzes the performance of the algorithm through control experiments. The results show that the algorithm proposed in this paper has good performance and has a certain practical effect.

Download Full-text

Intelligent system of English composition scoring model based on improved machine learning algorithm

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189235 ◽

2020 ◽

pp. 1-11

Author(s):

Jie Liu ◽

Lin Lin ◽

Xiufang Liang

Keyword(s):

Machine Learning ◽

Evaluation System ◽

Intelligent System ◽

Learning Algorithm ◽

Learning Algorithms ◽

Machine Learning Algorithms ◽

Assessment System ◽

English Composition ◽

Region Extraction ◽

Constraint Model

The online English teaching system has certain requirements for the intelligent scoring system, and the most difficult stage of intelligent scoring in the English test is to score the English composition through the intelligent model. In order to improve the intelligence of English composition scoring, based on machine learning algorithms, this study combines intelligent image recognition technology to improve machine learning algorithms, and proposes an improved MSER-based character candidate region extraction algorithm and a convolutional neural network-based pseudo-character region filtering algorithm. In addition, in order to verify whether the algorithm model proposed in this paper meets the requirements of the group text, that is, to verify the feasibility of the algorithm, the performance of the model proposed in this study is analyzed through design experiments. Moreover, the basic conditions for composition scoring are input into the model as a constraint model. The research results show that the algorithm proposed in this paper has a certain practical effect, and it can be applied to the English assessment system and the online assessment system of the homework evaluation system algorithm system.

Download Full-text

Feature extraction and prediction of Dengue Outbreaks

International Journal of Scientific Research in Computer Science Engineering and Information Technology ◽

10.32628/cseit206544 ◽

2020 ◽

pp. 216-222

Author(s):

Kunal Parikh ◽

Tanvi Makadia ◽

Harshil Patel

Keyword(s):

Public Health ◽

Machine Learning ◽

Developing Countries ◽

Feature Extraction ◽

Predictive Analytics ◽

Learning Algorithm ◽

Machine Learning Algorithm ◽

Health Concerns ◽

The World ◽

Dengue Outbreaks

Dengue is unquestionably one of the biggest health concerns in India and for many other developing countries. Unfortunately, many people have lost their lives because of it. Every year, approximately 390 million dengue infections occur around the world among which 500,000 people are seriously infected and 25,000 people have died annually. Many factors could cause dengue such as temperature, humidity, precipitation, inadequate public health, and many others. In this paper, we are proposing a method to perform predictive analytics on dengue’s dataset using KNN: a machine-learning algorithm. This analysis would help in the prediction of future cases and we could save the lives of many.

Download Full-text