scholarly journals Tropical Overshooting Cloud-Top Height Retrieval from Himawari-8 Imagery Based on Random Forest Model

Atmosphere ◽  
2021 ◽  
Vol 12 (2) ◽  
pp. 173
Author(s):  
Gaoyun Wang ◽  
Hongqing Wang ◽  
Yizhou Zhuang ◽  
Qiong Wu ◽  
Siyue Chen ◽  
...  

Tropical overshooting convection has a strong impact on both heat budget and moisture distribution in the upper troposphere and lower stratosphere, and it can pose a great risk to aviation safety. Cloud-top height is one of the essential concerns of overshooting convection for both the climate system and the aviation weather forecast. The main purpose of our work is to verify the application of the machine learning method, taking the random forest (RF) model as an instance, in overshooting cloud-top height retrieval from Himawari-8 data. By using collocated CloudSat observations as a reference, we utilize several infrared indicators of Himawari-8 that are commonly recognized to relate to cloud-top height, along with some temporal and geographical parameters (latitude, month, satellite zenith angle, etc.), as predictors to construct and validate the model. Analysis of variable importance shows that the brightness temperature of 6.2 um acts as the dominant predictor, followed by satellite zenith angle, brightness temperature of 13.3 um, latitude, and month. In the comparison between the RF model and the traditional single-channel interpolation method, retrievals from the RF model agree well with observation with a high correlation coefficient (0.92), small RMSE (222 m), and small MAE (164 m), while these metrics from traditional single-channel interpolation method shows lower skills (0.70, 1305 m, and 1179 m). This work presents a new sight of overshooting cloud-top height retrieval based on the machine learning method.

2019 ◽  
Vol 18 (05) ◽  
pp. 1579-1603 ◽  
Author(s):  
Zhijiang Wan ◽  
Hao Zhang ◽  
Jiajin Huang ◽  
Haiyan Zhou ◽  
Jie Yang ◽  
...  

Many studies developed the machine learning method for discriminating Major Depressive Disorder (MDD) and normal control based on multi-channel electroencephalogram (EEG) data, less concerned about using single channel EEG collected from forehead scalp to discriminate the MDD. The EEG dataset is collected by the Fp1 and Fp2 electrode of a 32-channel EEG system. The result demonstrates that the classification performance based on the EEG of Fp1 location exceeds the performance based on the EEG of Fp2 location, and shows that single-channel EEG analysis can provide discrimination of MDD at the level of multi-channel EEG analysis. Furthermore, a portable EEG device collecting the signal from Fp1 location is used to collect the second dataset. The Classification and Regression Tree combining genetic algorithm (GA) achieves the highest accuracy of 86.67% based on leave-one-participant-out cross validation, which shows that the single-channel EEG-based machine learning method is promising to support MDD prescreening application.


2021 ◽  
Vol 5 (2) ◽  
pp. 415
Author(s):  
Firdausi Nuzula Zamzami ◽  
Adiwijaya Adiwijaya ◽  
Mahendra Dwifebri P

Information exchange is currently the most happening on the internet. Information exchange can be done in many ways, such as expressing expressions on social media. One of them is reviewing a film. When someone reviews a film he will use his emotions to express their feelings, it can be positive or negative. The fast growth of the internet has made information more diverse, plentiful and unstructured. Sentiment analysis can handle this, because sentiment analysis is a classification process to understand opinions, interactions, and emotions of a document or text that is carried out automatically by a computer system. One suitable machine learning method is the Modified Balanced Random Forest. To deal with the various data, the feature selection used is Mutual Information. With these two methods, the system is able to produce an accuracy value of 79% and F1-scores value of 75%.


The system identifies a duplicate record from the database using the machine learning method. We must pass unstructured data. Data are prepared using any natural language processing technique such as text similarity. This prepared data is then fed into the latest machine learning method called Random Forest. After this data collection, using these files, the target file is compared to the source file. We make input and output files. This is carried out until accurate efficiency is generated


2020 ◽  
Author(s):  
Eric S. Pahl ◽  
W. Nick Street ◽  
Hans J. Johnson ◽  
Alan I. Reed

Kidney transplantation is the best treatment for end-stage renal failure patients. The predominant method used for kidney quality assessment is the Cox regression-based, kidney donor risk index. A machine learning method may provide improved prediction of transplant outcomes and help decision-making. A popular tree-based machine learning method, random forest, was trained and evaluated with the same data originally used to develop the risk index (70,242 observations from 1995-2005). The random forest successfully predicted an additional 2,148 transplants than the risk index with equal type II error rates of 10%. Predicted results were analyzed with follow-up survival outcomes up to 240 months after transplant using Kaplan-Meier analysis and confirmed that the random forest performed significantly better than the risk index (p<0.05). The random forest predicted significantly more successful and longer-surviving transplants than the risk index. Random forests and other machine learning models may improve transplant decisions.


IEEE Access ◽  
2021 ◽  
Vol 9 ◽  
pp. 34112-34118
Author(s):  
Xiaohui Chen ◽  
Shuyang Yu ◽  
Yongfang Zhang ◽  
Fangfang Chu ◽  
Bin Sun

Author(s):  
Akihito Asakura ◽  
Toshiki Hirogaki ◽  
Eiichi Aoyama ◽  
Hiroyuki Kodama

Abstract In recent years, the needs associated with the development of new technologies in the manufacturing industry that utilize big data typified by the Internet-of-Things (IoT) and artificial intelligence (AI) have been increasing. Recent computer-aided manufacturing (CAM) systems have evolved so that unskilled technicians can create tool paths relatively easily with numerically controlled (NC) programs, but tool-cutting conditions used for machining cannot be automatically determined. Therefore, many unskilled technicians often set the cutting conditions based on the recommended conditions described in the tool catalog. However, given that the catalog contains large-scale data on machining technology, setting the proper conditions becomes a time-consuming and inefficient process. In this study, we aimed to construct a system to support unskilled technicians to determine the optimum machining conditions. To this end, we constructed a prediction model using a random forest machine learning method to predict the cutting conditions. It was confirmed that the prediction with the random forest method can be performed with high accuracy based on the cutting conditions recommended by the tool maker. Thus, the effectiveness of this method was verified.


2021 ◽  
Author(s):  
Afifuddin ◽  
Siti Arfah ◽  
Dionysius Bryan Sencaki ◽  
Zilda Dona Okta Permata ◽  
Mega Novetrishka Putri ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document