scholarly journals A Machine Learning Framework for Predicting Bridge Defect Detection Cost

2021 ◽  
Vol 6 (11) ◽  
pp. 152
Author(s):  
Chongjiao Wang ◽  
Changrong Yao ◽  
Bin Qiang ◽  
Siguang Zhao ◽  
Yadong Li

Evaluating the cost of detecting bridge defects is a difficult task, but one that is vital to the lifecycle cost analysis of bridges. In this study, a detection cost sample database was established based on practical engineering data, and a bridge defect detection cost prediction model and software were developed using machine learning. First, the random forest method was adopted to evaluate the importance of the seven main factors affecting the detection cost. The most important indicators were selected, and the recent GDP growth rate was employed to account for the impact of social and economic developments on the detection cost. Combining a genetic algorithm with a multilayer neural network, a detection cost prediction model was established. The predictions given by this model were found to have an average relative error of 3.41%. Finally, an intelligent prediction software for bridge defect detection costs was established, providing a reliable reference for bridge lifecycle cost analysis and the evaluation of defect detection costs during the operation period.

2021 ◽  
Author(s):  
Syeda Nadia Firdaus

Social network is a hot topic of interest for researchers in the field of computer science in recent years. These social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. Social network data are created by its users. Users’ online activities and behavior have been studied in various past research efforts in order to get a better understanding on how information is diffused on social networks. In this study, we focus on Twitter and we explore the impact of user behavior on their retweet activity. To represent a user’s behavior for predicting their retweet decision, we introduce 10-dimentional emotion and 35-dimensional personality related features. We consider the difference of a user being an author and a retweeter in terms of their behaviors, and propose a machine learning based retweet prediction model considering this difference. We also propose two approaches for matrix factorization retweet prediction model which learns the latent relation between users and tweets to predict the user’s retweet decision. In the experiment, we have tested our proposed models. We find that models based on user behavior related features provide good improvement (3% - 6% in terms of F1- score) over baseline models. By only considering user’s behavior as a retweeter, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweeting and posting behaviors are considered. In the proposed matrix factorization models, we include tweet features into the basic factorization model through newly defined regularization terms and improve the performance by 3% - 4% in terms of F1-score. Finally, we compare the performance of machine learning and matrix factorization models for retweet prediction and find that none of the models is superior to the other in all occasions. Therefore, different models should be used depending on how prediction results will be used. Machine learning model is preferable when a model’s performance quality is important such as for tweet re-ranking and tweet recommendation. Matrix factorization is a preferred option when model’s positive retweet prediction capability is more important such as for marketing campaign and finding potential retweeters.


2021 ◽  
Author(s):  
Syeda Nadia Firdaus

Social network is a hot topic of interest for researchers in the field of computer science in recent years. These social networks such as Facebook, Twitter, Instagram play an important role in information diffusion. Social network data are created by its users. Users’ online activities and behavior have been studied in various past research efforts in order to get a better understanding on how information is diffused on social networks. In this study, we focus on Twitter and we explore the impact of user behavior on their retweet activity. To represent a user’s behavior for predicting their retweet decision, we introduce 10-dimentional emotion and 35-dimensional personality related features. We consider the difference of a user being an author and a retweeter in terms of their behaviors, and propose a machine learning based retweet prediction model considering this difference. We also propose two approaches for matrix factorization retweet prediction model which learns the latent relation between users and tweets to predict the user’s retweet decision. In the experiment, we have tested our proposed models. We find that models based on user behavior related features provide good improvement (3% - 6% in terms of F1- score) over baseline models. By only considering user’s behavior as a retweeter, the data processing time is reduced while the prediction accuracy is comparable to the case when both retweeting and posting behaviors are considered. In the proposed matrix factorization models, we include tweet features into the basic factorization model through newly defined regularization terms and improve the performance by 3% - 4% in terms of F1-score. Finally, we compare the performance of machine learning and matrix factorization models for retweet prediction and find that none of the models is superior to the other in all occasions. Therefore, different models should be used depending on how prediction results will be used. Machine learning model is preferable when a model’s performance quality is important such as for tweet re-ranking and tweet recommendation. Matrix factorization is a preferred option when model’s positive retweet prediction capability is more important such as for marketing campaign and finding potential retweeters.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Kanggeun Lee ◽  
Hyoung-oh Jeong ◽  
Semin Lee ◽  
Won-Ki Jeong

AbstractWith recent advances in DNA sequencing technologies, fast acquisition of large-scale genomic data has become commonplace. For cancer studies, in particular, there is an increasing need for the classification of cancer type based on somatic alterations detected from sequencing analyses. However, the ever-increasing size and complexity of the data make the classification task extremely challenging. In this study, we evaluate the contributions of various input features, such as mutation profiles, mutation rates, mutation spectra and signatures, and somatic copy number alterations that can be derived from genomic data, and further utilize them for accurate cancer type classification. We introduce a novel ensemble of machine learning classifiers, called CPEM (Cancer Predictor using an Ensemble Model), which is tested on 7,002 samples representing over 31 different cancer types collected from The Cancer Genome Atlas (TCGA) database. We first systematically examined the impact of the input features. Features known to be associated with specific cancers had relatively high importance in our initial prediction model. We further investigated various machine learning classifiers and feature selection methods to derive the ensemble-based cancer type prediction model achieving up to 84% classification accuracy in the nested 10-fold cross-validation. Finally, we narrowed down the target cancers to the six most common types and achieved up to 94% accuracy.


2020 ◽  
Vol 4 (4) ◽  
pp. 33
Author(s):  
Toni Pano ◽  
Rasha Kashef

During the COVID-19 pandemic, many research studies have been conducted to examine the impact of the outbreak on the financial sector, especially on cryptocurrencies. Social media, such as Twitter, plays a significant role as a meaningful indicator in forecasting the Bitcoin (BTC) prices. However, there is a research gap in determining the optimal preprocessing strategy in BTC tweets to develop an accurate machine learning prediction model for bitcoin prices. This paper develops different text preprocessing strategies for correlating the sentiment scores of Twitter text with Bitcoin prices during the COVID-19 pandemic. We explore the effect of different preprocessing functions, features, and time lengths of data on the correlation results. Out of 13 strategies, we discover that splitting sentences, removing Twitter-specific tags, or their combination generally improve the correlation of sentiment scores and volume polarity scores with Bitcoin prices. The prices only correlate well with sentiment scores over shorter timespans. Selecting the optimum preprocessing strategy would prompt machine learning prediction models to achieve better accuracy as compared to the actual prices.


Author(s):  
Nam Hai Pham ◽  
Nguyen Ngoc Tan

This study was conducted to determine the factors affecting non-performing loans of commercial banks in Vietnam for the period 2007 - 2018. The study applies the Bayesian approach and the Random-walk Metropolis-Hastings algorithm to evaluate the impact of micro and macro factors on non-performing loans of commercial banks. The dependent variable is non-performing loans, which is measured by the ratio of non-performing loans divided by total outstanding loans; the independent variables in terms of bank characteristics are non-performing loans of the previous year, profitability, bank size, bak loans, and bank capital; the macro variables are inflation and GDP growth. Research data was collected from financial statements of 30 Vietnamese commercial banks and the General Statistics Office of Vietnam from 2007 to 2018. To increase the reliability and efficiency of the model as well as reasonable Bayes inference, a convergence test of the MCMC chain was performed. The result of this study shows that non-performing loans of the previous year, bank size, bank loan, bank capital, and inflation have positive impacts on bank non-performing loans. In addition, bank profitability and GDP growth rate are factors that have the opposite effects. Based on the research results, the author proposes policy implications for the decision-makers to help banks reduce non-performing loans and promote banks to operate effectively and more efficiently.


2020 ◽  
Vol 12 (18) ◽  
pp. 7522
Author(s):  
Zhenzhen Xu ◽  
Chunfu Shao ◽  
Shengyou Wang ◽  
Chunjiao Dong

To promote the sustainable development of urban traffic and improve resident travel satisfaction, the significant factors affecting resident travel satisfaction are analyzed in this paper. An evaluation and prediction model for travel satisfaction based on support vector machine (SVM) is constructed. First, a multinomial logit (MNL) model is constructed to reveal the impact of individual attributes, family attributes and safety hazards on resident travel satisfaction and to clarify the significant factors. Then, a travel satisfaction evaluation model based on the SVM is constructed by taking significant factors as independent variables. Finally, travel optimization measures are proposed and the SVM model is used to predict the effect. Futian Street in Futian District of Shenzhen is taken as the object to carry out specific research. The results show that the following factors have a significant effect on resident travel satisfaction: age, job, level of education, number of car, income, residential area and potential safety hazards of people, vehicles, roads, environment, etc. The model fitting accuracy is 87.76%. The implementation of travel optimization measures may increase travel satisfaction rate by 14.07%.


Author(s):  
George Palumbo ◽  
Richard Shick ◽  
Mark Zaporowski

Creditworthiness, as reflected in bond ratings, is of great interest to municipalities since it directly affects the cost and ability to borrow money.  Municipalities experiencing a decline in their economic health will be especially concerned about how these developments will impact their future bond ratings.  It is well known that municipal analysts closely monitor a community’s economic health since this has an important impact on creditworthiness.  What is less well known however, are the economic variables that influence bond ratings.  The purpose of this paper is to identify these economic variables and estimate to what extent they influence the probability of a municipality’s default.  We do so by developing an econometric model of the rating process.  The model will allow municipal governments to gauge the impact of economic developments on their credit ratings.


2021 ◽  
Vol 13 (6) ◽  
pp. 1224
Author(s):  
Izar Azpiroz ◽  
Noelia Oses ◽  
Marco Quartulli ◽  
Igor G. Olaizola ◽  
Diego Guidotti ◽  
...  

Machine-learning algorithms used for modelling olive-tree phenology generally and largely rely on temperature data. In this study, we developed a prediction model on the basis of climate data and geophysical information. Remote measurements of weather conditions, terrain slope, and surface spectral reflectance were considered for this purpose. The accuracy of the temperature data worsened when replacing weather-station measurements with remote-sensing records, though the addition of more complete environmental data resulted in an efficient prediction model of olive-tree phenology. Filtering and embedded feature-selection techniques were employed to analyze the impact of variables on olive-tree phenology prediction, facilitating the inclusion of measurable information in decision support frameworks for the sustainable management of olive-tree systems.


Sign in / Sign up

Export Citation Format

Share Document