scholarly journals Support vector machine tropical wind speed retrieval in the presence of rain for Ku-band wind scatterometry

2021 ◽  
Author(s):  
Xingou Xu ◽  
Ad Stoffelen

Abstract. Wind retrieval parameters, i.e., quality indicators and the 2DVAR analysis speeds, are explored with the aim to improve wind speed retrieval during rain for tropical regions. We apply the well-researched support vector machine (SVM) method in machine learning (ML) to solve this complex problem in a data-orientated regression. To guarantee the effectiveness of SVM, the inputs are extensively analysed to evaluate their appropriateness for this problem, before the results are produced. Subsequently, triple collocation shows that the similarity of the resolved Ku-band (OSCAT-2) wind speed in rain is better than the 2DVAR speed, with respect to the collocated C-band (ASCAT) speed, which is much less affected by rain. The comparisons between distributions and differences between data of rain-contaminated winds, corrected winds and good quality C-band winds, illustrate that the rain-distorted wind distributions become more nominal with SVM, hence eliminating rain-induced biases and error variance. Further confirmation is obtained from a case with synchronous Himawari-8 observation indicating rain (clouds) in the scene. Furthermore, the determination of simultaneous rain rate is attempted to retrieve both wind and rain. Although, additional observations or higher resolution may be required to better assess the accuracy of the wind and rain retrievals, the Machine Learning (ML) results demonstrate benefits of such methodology in geophysical retrieval and nowcasting applications.

2021 ◽  
Vol 14 (12) ◽  
pp. 7435-7451
Author(s):  
Xingou Xu ◽  
Ad Stoffelen

Abstract. Wind retrieval parameters, i.e. quality indicators and the two-dimensional variational ambiguity removal (2DVAR) analysis speeds, are explored with the aim to improve wind speed retrieval during rain for tropical regions. We apply the well-researched support vector machine (SVM) method in machine learning (ML) to solve this complex problem in a data-oriented regression. To guarantee the effectiveness of SVM, the inputs are extensively analysed to evaluate their appropriateness for this problem, before the results are produced. The comparisons between distributions and differences between data of rain-contaminated winds, corrected winds and good quality C-band winds illustrate that the rain-distorted wind distributions become more nominal with SVM, hence much reducing the rain-induced biases and error variance. Further confirmation is obtained from a case with synchronous Himawari-8 observation indicating rain (clouds) in the scene. Furthermore, the estimation of simultaneous rain rate is attempted with some success to retrieve both wind and rain. Although additional observations or higher resolution may be required to better assess the accuracy of the wind and rain retrievals, the ML results demonstrate benefits of such methodology in geophysical retrieval and nowcasting applications.


2020 ◽  
Vol 25 (1) ◽  
pp. 24-38
Author(s):  
Eka Patriya

Saham adalah instrumen pasar keuangan yang banyak dipilih oleh investor sebagai alternatif sumber keuangan, akan tetapi saham yang diperjual belikan di pasar keuangan sering mengalami fluktuasi harga (naik dan turun) yang tinggi. Para investor berpeluang tidak hanya mendapat keuntungan, tetapi juga dapat mengalami kerugian di masa mendatang. Salah satu indikator yang perlu diperhatikan oleh investor dalam berinvestasi saham adalah pergerakan Indeks Harga Saham Gabungan (IHSG). Tindakan dalam menganalisa IHSG merupakan hal yang penting dilakukan oleh investor dengan tujuan untuk menemukan suatu trend atau pola yang mungkin berulang dari pergerakan harga saham masa lalu, sehingga dapat digunakan untuk memprediksi pergerakan harga saham di masa mendatang. Salah satu metode yang dapat digunakan untuk memprediksi pergerakan harga saham secara akurat adalah machine learning. Pada penelitian ini dibuat sebuah model prediksi harga penutupan IHSG menggunakan algoritma Support Vector Regression (SVR) yang menghasilkan kemampuan prediksi dan generalisasi yang baik dengan nilai RMSE training dan testing sebesar 14.334 dan 20.281, serta MAPE training dan testing sebesar 0.211% dan 0.251%. Hasil penelitian ini diharapkan dapat membantu para investor dalam mengambil keputusan untuk menyusun strategi investasi saham.


2021 ◽  
Vol 186 (Supplement_1) ◽  
pp. 445-451
Author(s):  
Yifei Sun ◽  
Navid Rashedi ◽  
Vikrant Vaze ◽  
Parikshit Shah ◽  
Ryan Halter ◽  
...  

ABSTRACT Introduction Early prediction of the acute hypotensive episode (AHE) in critically ill patients has the potential to improve outcomes. In this study, we apply different machine learning algorithms to the MIMIC III Physionet dataset, containing more than 60,000 real-world intensive care unit records, to test commonly used machine learning technologies and compare their performances. Materials and Methods Five classification methods including K-nearest neighbor, logistic regression, support vector machine, random forest, and a deep learning method called long short-term memory are applied to predict an AHE 30 minutes in advance. An analysis comparing model performance when including versus excluding invasive features was conducted. To further study the pattern of the underlying mean arterial pressure (MAP), we apply a regression method to predict the continuous MAP values using linear regression over the next 60 minutes. Results Support vector machine yields the best performance in terms of recall (84%). Including the invasive features in the classification improves the performance significantly with both recall and precision increasing by more than 20 percentage points. We were able to predict the MAP with a root mean square error (a frequently used measure of the differences between the predicted values and the observed values) of 10 mmHg 60 minutes in the future. After converting continuous MAP predictions into AHE binary predictions, we achieve a 91% recall and 68% precision. In addition to predicting AHE, the MAP predictions provide clinically useful information regarding the timing and severity of the AHE occurrence. Conclusion We were able to predict AHE with precision and recall above 80% 30 minutes in advance with the large real-world dataset. The prediction of regression model can provide a more fine-grained, interpretable signal to practitioners. Model performance is improved by the inclusion of invasive features in predicting AHE, when compared to predicting the AHE based on only the available, restricted set of noninvasive technologies. This demonstrates the importance of exploring more noninvasive technologies for AHE prediction.


2019 ◽  
Vol 44 (3) ◽  
pp. 266-281 ◽  
Author(s):  
Zhongda Tian ◽  
Yi Ren ◽  
Gang Wang

Wind speed prediction is an important technology in the wind power field; however, because of their chaotic nature, predicting wind speed accurately is difficult. Aims at this challenge, a backtracking search optimization–based least squares support vector machine model is proposed for short-term wind speed prediction. In this article, the least squares support vector machine is chosen as the short-term wind speed prediction model and backtracking search optimization algorithm is used to optimize the important parameters which influence the least squares support vector machine regression model. Furthermore, the optimal parameters of the model are obtained, and the short-term wind speed prediction model of least squares support vector machine is established through parameter optimization. For time-varying systems similar to short-term wind speed time series, a model updating method based on prediction error accuracy combined with sliding window strategy is proposed. When the prediction model does not match the actual short-term wind model, least squares support vector machine trains and re-establishes. This model updating method avoids the mismatch problem between prediction model and actual wind speed data. The actual collected short-term wind speed time series is used as the research object. Multi-step prediction simulation of short-term wind speed is carried out. The simulation results show that backtracking search optimization algorithm–based least squares support vector machine model has higher prediction accuracy and reliability for the short-term wind speed. At the same time, the prediction performance indicators are also improved. The prediction result is that root mean square error is 0.1248, mean absolute error is 0.1374, mean absolute percentile error is 0.1589% and R2 is 0.9648. When the short-term wind speed varies from 0 to 4 m/s, the average value of absolute prediction error is 0.1113 m/s, and average value of absolute relative prediction error is 8.7111%. The proposed prediction model in this article has high engineering application value.


2021 ◽  
Vol 13 (6) ◽  
pp. 3497
Author(s):  
Hassan Adamu ◽  
Syaheerah Lebai Lutfi ◽  
Nurul Hashimah Ahamed Hassain Malim ◽  
Rohail Hassan ◽  
Assunta Di Vaio ◽  
...  

Sustainable development plays a vital role in information and communication technology. In times of pandemics such as COVID-19, vulnerable people need help to survive. This help includes the distribution of relief packages and materials by the government with the primary objective of lessening the economic and psychological effects on the citizens affected by disasters such as the COVID-19 pandemic. However, there has not been an efficient way to monitor public funds’ accountability and transparency, especially in developing countries such as Nigeria. The understanding of public emotions by the government on distributed palliatives is important as it would indicate the reach and impact of the distribution exercise. Although several studies on English emotion classification have been conducted, these studies are not portable to a wider inclusive Nigerian case. This is because Informal Nigerian English (Pidgin), which Nigerians widely speak, has quite a different vocabulary from Standard English, thus limiting the applicability of the emotion classification of Standard English machine learning models. An Informal Nigerian English (Pidgin English) emotions dataset is constructed, pre-processed, and annotated. The dataset is then used to classify five emotion classes (anger, sadness, joy, fear, and disgust) on the COVID-19 palliatives and relief aid distribution in Nigeria using standard machine learning (ML) algorithms. Six ML algorithms are used in this study, and a comparative analysis of their performance is conducted. The algorithms are Multinomial Naïve Bayes (MNB), Support Vector Machine (SVM), Random Forest (RF), Logistics Regression (LR), K-Nearest Neighbor (KNN), and Decision Tree (DT). The conducted experiments reveal that Support Vector Machine outperforms the remaining classifiers with the highest accuracy of 88%. The “disgust” emotion class surpassed other emotion classes, i.e., sadness, joy, fear, and anger, with the highest number of counts from the classification conducted on the constructed dataset. Additionally, the conducted correlation analysis shows a significant relationship between the emotion classes of “Joy” and “Fear”, which implies that the public is excited about the palliatives’ distribution but afraid of inequality and transparency in the distribution process due to reasons such as corruption. Conclusively, the results from this experiment clearly show that the public emotions on COVID-19 support and relief aid packages’ distribution in Nigeria were not satisfactory, considering that the negative emotions from the public outnumbered the public happiness.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Florent Le Borgne ◽  
Arthur Chatton ◽  
Maxime Léger ◽  
Rémi Lenain ◽  
Yohann Foucher

AbstractIn clinical research, there is a growing interest in the use of propensity score-based methods to estimate causal effects. G-computation is an alternative because of its high statistical power. Machine learning is also increasingly used because of its possible robustness to model misspecification. In this paper, we aimed to propose an approach that combines machine learning and G-computation when both the outcome and the exposure status are binary and is able to deal with small samples. We evaluated the performances of several methods, including penalized logistic regressions, a neural network, a support vector machine, boosted classification and regression trees, and a super learner through simulations. We proposed six different scenarios characterised by various sample sizes, numbers of covariates and relationships between covariates, exposure statuses, and outcomes. We have also illustrated the application of these methods, in which they were used to estimate the efficacy of barbiturates prescribed during the first 24 h of an episode of intracranial hypertension. In the context of GC, for estimating the individual outcome probabilities in two counterfactual worlds, we reported that the super learner tended to outperform the other approaches in terms of both bias and variance, especially for small sample sizes. The support vector machine performed well, but its mean bias was slightly higher than that of the super learner. In the investigated scenarios, G-computation associated with the super learner was a performant method for drawing causal inferences, even from small sample sizes.


Sign in / Sign up

Export Citation Format

Share Document