boosting algorithm
Recently Published Documents


TOTAL DOCUMENTS

299
(FIVE YEARS 117)

H-INDEX

21
(FIVE YEARS 3)

Mathematics ◽  
2022 ◽  
Vol 10 (2) ◽  
pp. 231
Author(s):  
Bubryur Kim ◽  
Dong-Eun Lee ◽  
Gang Hu ◽  
Yuvaraj Natarajan ◽  
Sri Preethaa ◽  
...  

Developments in fiber-reinforced polymer (FRP) composite materials have created a huge impact on civil engineering techniques. Bonding properties of FRP led to its wide usage with concrete structures for interfacial bonding. FRP materials show great promise for rehabilitation of existing infrastructure by strengthening concrete structures. Existing machine learning-based models for predicting the FRP–concrete bond strength have not attained maximum performance in evaluating the bond strength. This paper presents an ensemble machine learning approach capable of predicting the FRP–concrete interfacial bond strength. In this work, a dataset holding details of 855 single-lap shear tests on FRP–concrete interfacial bonds extracted from the literature is used to build a bond strength prediction model. Test results hold data of different material properties and geometrical parameters influencing the FRP–concrete interfacial bond. This study employs CatBoost algorithm, an improved ensemble machine learning approach used to accurately predict bond strength of FRP–concrete interface. The algorithm performance is compared with those of other ensemble methods (i.e., histogram gradient boosting algorithm, extreme gradient boosting algorithm, and random forest). The CatBoost algorithm outperforms other ensemble methods with various performance metrics (i.e., lower root mean square error (2.310), lower covariance (21.8%), lower integral absolute error (8.8%), and higher R-square (96.1%)). A comparative study is performed between the proposed model and best performing bond strength prediction models in the literature. The results show that FRP–concrete interfacial bonding can be effectively predicted using proposed ensemble method.


Author(s):  
R. J. L. Argamosa ◽  
A. C. Blanco ◽  
R. B. Reyes

Abstract. A large oil spill in Iloilo Straight that occurred on July 3, 2020, as well as a possible deliberate, small but frequent oil spill and surfactant contamination in Manila Bay, were mapped. The method employs the Sentinel 2-1C image, which is transformed into principal components to reveal the presence of oil spills and possibly surfactants. Additionally, a gradient boosting algorithm was trained to discriminate between pixels that were contaminated with oil and those that were not. The multi-band image with three principal components with a 99% cumulative explained variance ratio highlights the occurrence of an oil spill in Iloilo Straight. Further, the classified image produced by pixel-based classification clearly distinguishes between water and oil pixels in the said area. The methodology was applied to a Sentinel 2-1C image of Manila Bay, with pixels observed/identified as oil and classified as well. The highest density of supposedly oil-contaminated pixels (large or small but frequent) was observed on the eastern side of Manila Bay (Bataan). While there were no documented oil spills concurrent to the satellite image used, historical reports on the area indicate that the likelihood of an oil spill is extremely high due to the massive amount of shipping activity. Pixels supposedly contaminated by oil spills also occur in areas near ports where oil spills could occur as a result of ship operations. Pixels with the same properties as oil contamination are also visible in areas adjacent to fishponds and aquaculture, where phytoplankton and fish contribute to surfactant contamination.


2021 ◽  
Vol 16 ◽  
pp. 705-714
Author(s):  
Abela Chairunissa ◽  
Solimun Solimun ◽  
Adji Achmad Rinaldo Fernandes

Credit risk is the risk that has the greatest opportunity to occur in banking. The number of bad loans will also affect bank performance. The banking sector needs to know whether a prospective creditor is classified as a risky person or not. The purpose of this study is to classify creditors and compare the classification results through logistic regression with the maximum likelihood model and the Boosting algorithm, especially the AdaBoost algorithm, and to select a model with the Boosting algorithm Credit Scoring aims to classify prospective creditor into two classes, namely good prospective creditor (Performing Loan) and bad prospective creditor (Non Performing Loan) based on certain characteristics. The method often used for classifying creditor is logistic regression, but this method is less robust and less accurate than data mining. Thus, there is a need for methods that provide greater accuracy. Among the methods that have been proposed is a method called Boosting, which operates sequentially by applying a classification algorithm to the reweighted version of the training data set. This study uses 5 datasets. The first dataset is secondary data originating from data on non-subsidized homeownership creditors of Bank X Malang City. While the other datasets are simulation data with many samples of 10, 500, and 1000. The results of this study indicate that ensemble boosting logistic regression is more suitable for describing binary response problems, especially creditor classification because it provides more accurate information. For high-dimensional data, which is represented by a sample size of 10, ensemble logistic regression is proven to be able to produce fairly accurate predictions with an accuracy rate of up to 80%, whereas in the logistic regression analysis the model raises N.A because many samples < many independent variables. The use of boosting is preferred because it focuses on problems that are misclassified and have a tendency to increase to higher accuracy.


Space Weather ◽  
2021 ◽  
Author(s):  
Xiukuan Zhao ◽  
Guozhu Li ◽  
Haiyong Xie ◽  
Lianhuan Hu ◽  
Wenjie Sun ◽  
...  

2021 ◽  
Vol 15 ◽  
Author(s):  
Shuhao Shi ◽  
Kai Qiao ◽  
Shuai Yang ◽  
Linyuan Wang ◽  
Jian Chen ◽  
...  

The graph neural network (GNN) has been widely used for graph data representation. However, the existing researches only consider the ideal balanced dataset, and the imbalanced dataset is rarely considered. Traditional methods such as resampling, reweighting, and synthetic samples that deal with imbalanced datasets are no longer applicable in GNN. This study proposes an ensemble model called Boosting-GNN, which uses GNNs as the base classifiers during boosting. In Boosting-GNN, higher weights are set for the training samples that are not correctly classified by the previous classifiers, thus achieving higher classification accuracy and better reliability. Besides, transfer learning is used to reduce computational cost and increase fitting ability. Experimental results indicate that the proposed Boosting-GNN model achieves better performance than graph convolutional network (GCN), GraphSAGE, graph attention network (GAT), simplifying graph convolutional networks (SGC), multi-scale graph convolution networks (N-GCN), and most advanced reweighting and resampling methods on synthetic imbalanced datasets, with an average performance improvement of 4.5%.


Author(s):  
L. V. Sukhostat

Context. The problem of detecting anomalies from signals of cyber-physical systems based on spectrogram and scalogram images is considered. The object of the research is complex industrial equipment with heterogeneous sensory systems of different nature.  Objective. The goal of the work is the development of a method for signal anomalies detection based on transfer learning with the extreme gradient boosting algorithm. Method. An approach based on transfer learning and the extreme gradient boosting algorithm, developed for detecting anomalies in acoustic signals of cyber-physical systems, is proposed. Little research has been done in this area, and therefore various pre-trained deep neural model architectures have been studied to improve anomaly detection. Transfer learning uses weights from a deep neural model, pre-trained on a large dataset, and can be applied to a small dataset to provide convergence without overfitting. The classic approach to this problem usually involves signal processing techniques that extract valuable information from sensor data. This paper performs an anomaly detection task using a deep learning architecture to work with acoustic signals that are preprocessed to produce a spectrogram and scalogram. The SPOCU activation function was considered to improve the accuracy of the proposed approach. The extreme gradient boosting algorithm was used because it has high performance and requires little computational resources during the training phase. This algorithm can significantly improve the detection of anomalies in industrial equipment signals. Results. The developed approach is implemented in software and evaluated for the anomaly detection task in acoustic signals of cyber-physical systems on the MIMII dataset. Conclusions. The conducted experiments have confirmed the efficiency of the proposed approach and allow recommending it for practical use in diagnosing the state of industrial equipment. Prospects for further research may lie in the application of ensemble approaches based on transfer learning to various real datasets to improve the performance and fault-tolerance of cyber-physical systems.


Sign in / Sign up

Export Citation Format

Share Document