Drivers of fog and low stratus - a satellite-based evaluation with machine learning

Author(s):  
Eva Pauli ◽  
Hendrik Andersen ◽  
Jörg Bendix ◽  
Jan Cermak ◽  
Sebastian Egli

<p>In this study the distribution of fog and low stratus (FLS) relative to land cover and meteorological conditions in continental Europe is investigated using a state-of-the-art machine learning technique  and geostationary satellite data. While analyses of the spatial and temporal patterns of FLS exist, the relationships to land cover and meteorological conditions have not been studied explicitly, quantitatively and on a continental scale.<br>The machine learning model is built using daily means of a FLS dataset from Egli et al. (2017) as the predictand, and different land surface and meteorological parameters as predictors over continental Europe from 2006-2015. The application of a machine learning approach provides the ability to explicitly, synchronously and quantitatively link suspected determinants of FLS to its occurrence.<br>The model shows good performance with R² values ranging between 0.5 and 0.9, depending on model grid size, season and further settings such as the exclusion of low pressure values and subtraction of seasonality. It is thus able to adequately represent the dynamics that drive FLS development. Based on a systematic analysis of this model, the most important features for FLS prediction are mean surface pressure, wind speed, and FLS on the previous day. High mean surface pressure, high FLS cover on the previous day, low evapotranspiration, wind speed and land surface temperature lead to higher predicted FLS values.<br>Generally the results show that it is possible to predict FLS occurrence over continental Europe using meteorological as well as land surface parameters with good performance indicating the benefits of using machine learning in the analysis of non-linear, multivariate systems such as the land-atmosphere system. Further studies will integrate the machine learning model into a land surface based model grid and implement fog and low cloud properties as predictands.</p>

Author(s):  
R. Meenal ◽  
Prawin Angel Michael ◽  
D. Pamela ◽  
E. Rajasekaran

The complex numerical climate models pose a big challenge for scientists in weather predictions, especially for tropical system. This paper is focused on presenting the importance of weather prediction using machine learning (ML) technique. Recently many researchers recommended that the machine learning models can produce sensible weather predictions in spite of having no precise knowledge of atmospheric physics. In this work, global solar radiation (GSR) in MJ/m2/day and wind speed in m/s is predicted for Tamil Nadu, India using a random forest ML model. The random forest ML model is validated with measured wind and solar radiation data collected from IMD, Pune. The prediction results based on the random forest ML model are compared with statistical regression models and SVM ML model. Overall, random forest machine learning model has minimum error values of 0.750 MSE and R2 score of 0.97. Compared to regression models and SVM ML model, the prediction results of random forest ML model are more accurate. Thus, this study neglects the need for an expensive measuring instrument in all potential locations to acquire the solar radiation and wind speed data.


2021 ◽  
Author(s):  
Mike Optis ◽  
Nicola Bodini ◽  
Mithu Debnath ◽  
Paula Doubrawa

Abstract. Accurate characterization of the offshore wind resource has been hindered by a sparsity of wind speed observations that span offshore wind turbine rotor-swept heights. Although public availability of floating lidar data is increasing, most offshore wind speed observations continue to come from buoy-based and satellite-based near-surface measurements. The aim of this study is to develop and validate novel vertical extrapolation methods that can accurately estimate wind speed time series across rotor-swept heights using these near-surface measurements. We contrast the conventional logarithmic profile against three novel approaches: a logarithmic profile with a long-term stability correction, a single-column model, and a machine-learning model. These models are developed and validated using 1 year of observations from two floating lidars deployed in U.S. Atlantic offshore wind energy areas. We find that the machine-learning model significantly outperforms all other models across all stability regimes, seasons, and times of day. Machine-learning model performance is considerably improved by including the air-sea temperature difference, which provides some accounting for offshore atmospheric stability. Finally, we find no degradation in machine-learning model performance when tested 83 km from its training location, suggesting promising future applications in extrapolating 10-m wind speeds from spatially resolved satellite-based wind atlases.


2021 ◽  
Vol 6 (3) ◽  
pp. 935-948
Author(s):  
Mike Optis ◽  
Nicola Bodini ◽  
Mithu Debnath ◽  
Paula Doubrawa

Abstract. Accurate characterization of the offshore wind resource has been hindered by a sparsity of wind speed observations that span offshore wind turbine rotor-swept heights. Although public availability of floating lidar data is increasing, most offshore wind speed observations continue to come from buoy-based and satellite-based near-surface measurements. The aim of this study is to develop and validate novel vertical extrapolation methods that can accurately estimate wind speed time series across rotor-swept heights using these near-surface measurements. We contrast the conventional logarithmic profile against three novel approaches: a logarithmic profile with a long-term stability correction, a single-column model, and a machine-learning model. These models are developed and validated using 1 year of observations from two floating lidars deployed in US Atlantic offshore wind energy areas. We find that the machine-learning model significantly outperforms all other models across all stability regimes, seasons, and times of day. Machine-learning model performance is considerably improved by including the air–sea temperature difference, which provides some accounting for offshore atmospheric stability. Finally, we find no degradation in machine-learning model performance when tested 83 km from its training location, suggesting promising future applications in extrapolating 10 m wind speeds from spatially resolved satellite-based wind atlases.


2021 ◽  
Author(s):  
Carsten Baumann ◽  
Aoife E. McCloskey

<p>GNSS positioning errors, spacecraft operations failures and power outages potentially originate from space weather in general and the solar wind interaction with the geomagnetic field in particular. Depending on the solar wind speed, information from L1 solar wind monitor spacecraft only give a lead time to take safety measures between 20 and 90 minutes.  This very short lead time requires end users to have the most reliable warnings when potential impacts will actually occur. In this study we present a machine learning algorithm that is suitable to predict the solar wind propagation delay between Lagrangian point L1 and the Earth.  This work introduces the proposed algorithm and investigates its operational applicability to a realtime scenario.</p><p>The propagation delay is measured from interplanetary shocks passing the Advanced Composition Explorer (ACE) first and their sudden commencements within the magnetosphere later, as recorded by ground-based magnetometers. Overall 380 interplanetary shocks with data ranging from 1998 to 2018 builds up the database that is used to train the machine learning model. We investigate two different feature sets. The training of one machine learning model DSCOVR real time solar wind (RTSW) like data which contains all three components of solar wind speed is used. For the other machine learning model ACE RTSW like data which only provide bulk solar wind speed will be used for training. Both feature sets also contain the position of the spacecrafts. The performance assessment of the machine learning model is examined on the basis of a 10-fold cross-validation. The major advantage of the machine learning approach is its simplicity when it comes to its application. After training, values for the different features have to be fed into the algorithms only and the evaluation of the propagation delay can be continuous.</p><p>Both machine learning models will be validated against a simple convective solar wind propagation delay model as it is also used in operational space weather centers. For this purpose time periods will be investigated where L1 spacecraft and Earth satellites just outside the magnetosphere probe the same features of the interplanetary magnetic field. This method allows a detailed validation of the solar wind propagation delay apart from the technique that relies on interplanetary shocks.</p>


2018 ◽  
Author(s):  
Steen Lysgaard ◽  
Paul C. Jennings ◽  
Jens Strabo Hummelshøj ◽  
Thomas Bligaard ◽  
Tejs Vegge

A machine learning model is used as a surrogate fitness evaluator in a genetic algorithm (GA) optimization of the atomic distribution of Pt-Au nanoparticles. The machine learning accelerated genetic algorithm (MLaGA) yields a 50-fold reduction of required energy calculations compared to a traditional GA.


Author(s):  
Dhilsath Fathima.M ◽  
S. Justin Samuel ◽  
R. Hari Haran

Aim: This proposed work is used to develop an improved and robust machine learning model for predicting Myocardial Infarction (MI) could have substantial clinical impact. Objectives: This paper explains how to build machine learning based computer-aided analysis system for an early and accurate prediction of Myocardial Infarction (MI) which utilizes framingham heart study dataset for validation and evaluation. This proposed computer-aided analysis model will support medical professionals to predict myocardial infarction proficiently. Methods: The proposed model utilize the mean imputation to remove the missing values from the data set, then applied principal component analysis to extract the optimal features from the data set to enhance the performance of the classifiers. After PCA, the reduced features are partitioned into training dataset and testing dataset where 70% of the training dataset are given as an input to the four well-liked classifiers as support vector machine, k-nearest neighbor, logistic regression and decision tree to train the classifiers and 30% of test dataset is used to evaluate an output of machine learning model using performance metrics as confusion matrix, classifier accuracy, precision, sensitivity, F1-score, AUC-ROC curve. Results: Output of the classifiers are evaluated using performance measures and we observed that logistic regression provides high accuracy than K-NN, SVM, decision tree classifiers and PCA performs sound as a good feature extraction method to enhance the performance of proposed model. From these analyses, we conclude that logistic regression having good mean accuracy level and standard deviation accuracy compared with the other three algorithms. AUC-ROC curve of the proposed classifiers is analyzed from the output figure.4, figure.5 that logistic regression exhibits good AUC-ROC score, i.e. around 70% compared to k-NN and decision tree algorithm. Conclusion: From the result analysis, we infer that this proposed machine learning model will act as an optimal decision making system to predict the acute myocardial infarction at an early stage than an existing machine learning based prediction models and it is capable to predict the presence of an acute myocardial Infarction with human using the heart disease risk factors, in order to decide when to start lifestyle modification and medical treatment to prevent the heart disease.


Sign in / Sign up

Export Citation Format

Share Document