Quantifying the Impacts of Pre- and Post-Conception TSH Levels on Birth Outcomes: An Examination of Different Machine Learning Models

Frontiers in Endocrinology ◽

10.3389/fendo.2021.755364 ◽

2021 ◽

Vol 12 ◽

Author(s):

Yuantong Sun ◽

Weiwei Zheng ◽

Ling Zhang ◽

Huijuan Zhao ◽

Xun Li ◽

...

Keyword(s):

Machine Learning ◽

Birth Outcomes ◽

Loss Function ◽

Apgar Score ◽

Thyroid Stimulating Hormone ◽

Adverse Birth Outcomes ◽

Learning Models ◽

Neural Network Models ◽

Tsh Levels ◽

Machine Learning Models

BackgroundWhile previous studies identified risk factors for diverse pregnancy outcomes, traditional statistical methods had limited ability to quantify their impacts on birth outcomes precisely. We aimed to use a novel approach that applied different machine learning models to not only predict birth outcomes but systematically quantify the impacts of pre- and post-conception serum thyroid-stimulating hormone (TSH) levels and other predictive characteristics on birth outcomes.MethodsWe used data from women who gave birth in Shanghai First Maternal and Infant Hospital from 2014 to 2015. We included 14,110 women with the measurement of preconception TSH in the first analysis and 3,428 out of 14,110 women with both pre- and post-conception TSH measurement in the second analysis. Synthetic Minority Over-sampling Technique (SMOTE) was applied to adjust the imbalance of outcomes. We randomly split (7:3) the data into a training set and a test set in both analyses. We compared Area Under Curve (AUC) for dichotomous outcomes and macro F1 score for categorical outcomes among four machine learning models, including logistic model, random forest model, XGBoost model, and multilayer neural network models to assess model performance. The model with the highest AUC or macro F1 score was used to quantify the importance of predictive features for adverse birth outcomes with the loss function algorithm.ResultsThe XGBoost model provided prominent advantages in terms of improved performance and prediction of polytomous variables. Predictive models with abnormal preconception TSH or not-well-controlled TSH, a novel indicator with pre- and post-conception TSH levels combined, provided the similar robust prediction for birth outcomes. The highest AUC of 98.7% happened in XGBoost model for predicting low Apgar score with not-well-controlled TSH adjusted. By loss function algorithm, we found that not-well-controlled TSH ranked 4th, 6th, and 7th among 14 features, respectively, in predicting birthweight, induction, and preterm birth, and 3rd among 19 features in predicting low Apgar score.ConclusionsOur four machine learning models offered valid predictions of birth outcomes in women during pre- and post-conception. The predictive features panel suggested the combined TSH indicator (not-well-controlled TSH) could be a potentially competitive biomarker to predict adverse birth outcomes.

Download Full-text

First-Break Picking Classification Models Using Recurrent Neural Network

10.2118/204862-ms ◽

2021 ◽

Author(s):

Mohammed Ayub ◽

SanLinn Kaka

Keyword(s):

Neural Network ◽

Machine Learning ◽

Deep Neural Network ◽

Contextual Information ◽

Classification Model ◽

Superior Performance ◽

Learning Models ◽

Neural Network Models ◽

Minimum Number ◽

Machine Learning Models

Abstract Manual first-break picking from a large volume of seismic data is extremely tedious and costly. Deployment of machine learning models makes the process fast and cost effective. However, these machine learning models require high representative and effective features for accurate automatic picking. Therefore, First- Break (FB) picking classification model that uses effective minimum number of features and promises performance efficiency is proposed. The variants of Recurrent Neural Networks (RNNs) such as Long ShortTerm Memory (LSTM) and Gated Recurrent Unit (GRU) can retain contextual information from long previous time steps. We deploy this advantage for FB picking as seismic traces are amplitude values of vibration along the time-axis. We use behavioral fluctuation of amplitude as input features for LSTM and GRU. The models are trained on noisy data and tested for generalization on original traces not seen during the training and validation process. In order to analyze the real-time suitability, the performance is benchmarked using accuracy, F1-measure and three other established metrics. We have trained two RNN models and two deep Neural Network models for FB classification using only amplitude values as features. Both LSTM and GRU have the accuracy and F1-measure with a score of 94.20%. With the same features, Convolutional Neural Network (CNN) has an accuracy of 93.58% and F1-score of 93.63%. Again, Deep Neural Network (DNN) model has scores of 92.83% and 92.59% as accuracy and F1-measure, respectively. From the pexperiment results, we see significant superior performance of LSTM and GRU to CNN and DNN when used the same features. For robustness of LSTM and GRU models, the performance is compared with DNN model that is trained using nine features derived from seismic traces and observed that the performance superiority of RNN models. Therefore, it is safe to conclude that RNN models (LSTM and GRU) are capable of classifying the FB events efficiently even by using a minimum number of features that are not computationally expensive. The novelty of our work is the capability of automatic FB classification with the RNN models that incorporate contextual behavioral information without the need for sophisticated feature extraction or engineering techniques that in turn can help in reducing the cost and fostering classification model robust and faster.

Download Full-text

Comparison Between Traditional Machine Learning Models And Neural Network Models For Vietnamese Hate Speech Detection

2020 RIVF International Conference on Computing and Communication Technologies (RIVF) ◽

10.1109/rivf48685.2020.9140745 ◽

2020 ◽

Cited By ~ 2

Author(s):

Son T. Luu ◽

Hung P. Nguyen ◽

Kiet Van Nguyen ◽

Ngan Luu-Thuy Nguyen

Keyword(s):

Neural Network ◽

Machine Learning ◽

Hate Speech ◽

Network Models ◽

Learning Models ◽

Neural Network Models ◽

Speech Detection ◽

Machine Learning Models

Download Full-text

MODIS Fractional Snow Cover Mapping Using Machine Learning Technology in a Mountainous Area

Remote Sensing ◽

10.3390/rs12060962 ◽

2020 ◽

Vol 12 (6) ◽

pp. 962 ◽

Cited By ~ 3

Author(s):

Changyu Liu ◽

Xiaodong Huang ◽

Xubing Li ◽

Tiangang Liang

Keyword(s):

Machine Learning ◽

Random Forest ◽

Snow Cover ◽

Back Propagation ◽

Network Models ◽

Support Vector ◽

Learning Models ◽

Neural Network Models ◽

Fractional Snow Cover ◽

Machine Learning Models

To improve the poor accuracy of the MODIS (Moderate Resolution Imaging Spectroradiometer) daily fractional snow cover product over the complex terrain of the Tibetan Plateau (RMSE = 0.30), unmanned aerial vehicle and machine learning technologies are employed to map the fractional snow cover based on MODIS over this terrain. Three machine learning models, including random forest, support vector machine, and back-propagation artificial neural network models, are trained and compared in this study. The results indicate that compared with the MODIS daily fractional snow cover product, the introduction of a highly accurate snow map acquired by unmanned aerial vehicles as a reference into machine learning models can significantly improve the MODIS fractional snow cover mapping accuracy. The random forest model shows the best accuracy among the three machine learning models, with an RMSE (root-mean-square error) of 0.23, especially over forestland and shrubland, with RMSEs of 0.13 and 0.18, respectively. Although the accuracy of the support vector machine and back-propagation artificial neural network models are worse over forestland and shrubland, their average errors are still better than that of MOD10A1. Different fractional snow cover gradients also affect the accuracy of the machine learning algorithms. Nevertheless, the random forest model remains stable in different fractional snow cover gradients and is, therefore, the best machine learning algorithm for MODIS fractional snow cover mapping in Tibetan Plateau areas with complex terrain and severely fragmented snow cover.

Download Full-text

Vibration characteristic analyses of medium-and small-span girder bridge groups in highway systems based on machine learning models

Advances in Structural Engineering ◽

10.1177/1369433221997722 ◽

2021 ◽

pp. 136943322199772

Author(s):

Guanya Lu ◽

Kehai Wang ◽

Weizuo Guo

Keyword(s):

Machine Learning ◽

Random Forest ◽

Large Scale ◽

Vibration Characteristics ◽

Structural Vibration ◽

Learning Models ◽

Longitudinal Vibrations ◽

Neural Network Models ◽

Artificial Neural ◽

Machine Learning Models

There are large amounts of small-and medium-span girder bridges which bear structural similarity, while the large-scale bridge structures are generally limited in the timely applications of structural vibration characteristics. Therefore, in this study a framework based on machine learning models was proposed to analyze the vibration characteristics of specific line bridge groups. The probability distributions of structural, geometric, and material properties of bridge groups in specific lines were obtained using statistical tools and a Latin hypercube sampling method was used to generate reasonable sample sets for the bridges group, and parameterized finite element models of the bridges were established. Then, the optimal models were tuned and determined to predict fundamental mode and period by the 10-fold cross-validation method applying the numerical simulation results. This study’s results showed that the random forest models divided the vibration modes of the bridge groups into the longitudinal vibrations of the main girders and the longitudinal vibrations of the adjacent spans and side piers with a classification accuracy of greater than 90%, while the artificial neural network models exhibited the lowest normalized mean square error for the periods. The periods mainly ranged between 0.7 and 1.5 s. Furthermore, the bearing settings, ratios of the pier height to section diameters, and boundary types were determined to be the most significant properties influencing the fundamental modes and periods of the examined bridges, by respectively observing the reduced value of the random forest Gini indices and distribution of the generalized weight value of the input variables in artificial neural networks. This study provides an intelligent and efficient method for obtaining vibration characteristics of bridges group for a specific network.

Download Full-text

Automated Facial Recognition for Noonan Syndrome Using Novel Deep Convolutional Neural Network With Additive Angular Margin Loss

Frontiers in Genetics ◽

10.3389/fgene.2021.669841 ◽

2021 ◽

Vol 12 ◽

Author(s):

Hang Yang ◽

Xin-Rong Hu ◽

Ling Sun ◽

Dian Hong ◽

Ying-Yi Zheng ◽

...

Keyword(s):

Neural Network ◽

Machine Learning ◽

Convolutional Neural Network ◽

Loss Function ◽

Noonan Syndrome ◽

Facial Recognition ◽

Deep Convolutional Neural Network ◽

Healthy Children ◽

Learning Models ◽

Machine Learning Models

BackgroundNoonan syndrome (NS), a genetically heterogeneous disorder, presents with hypertelorism, ptosis, dysplastic pulmonary valve stenosis, hypertrophic cardiomyopathy, and small stature. Early detection and assessment of NS are crucial to formulating an individualized treatment protocol. However, the diagnostic rate of pediatricians and pediatric cardiologists is limited. To overcome this challenge, we propose an automated facial recognition model to identify NS using a novel deep convolutional neural network (DCNN) with a loss function called additive angular margin loss (ArcFace).MethodsThe proposed automated facial recognition models were trained on dataset that included 127 NS patients, 163 healthy children, and 130 children with several other dysmorphic syndromes. The photo dataset contained only one frontal face image from each participant. A novel DCNN framework with ArcFace loss function (DCNN-Arcface model) was constructed. Two traditional machine learning models and a DCNN model with cross-entropy loss function (DCNN-CE model) were also constructed. Transfer learning and data augmentation were applied in the training process. The identification performance of facial recognition models was assessed by five-fold cross-validation. Comparison of the DCNN-Arcface model to two traditional machine learning models, the DCNN-CE model, and six physicians were performed.ResultsAt distinguishing NS patients from healthy children, the DCNN-Arcface model achieved an accuracy of 0.9201 ± 0.0138 and an area under the receiver operator characteristic curve (AUC) of 0.9797 ± 0.0055. At distinguishing NS patients from children with several other genetic syndromes, it achieved an accuracy of 0.8171 ± 0.0074 and an AUC of 0.9274 ± 0.0062. In both cases, the DCNN-Arcface model outperformed the two traditional machine learning models, the DCNN-CE model, and six physicians.ConclusionThis study shows that the proposed DCNN-Arcface model is a promising way to screen NS patients and can improve the NS diagnosis rate.

Download Full-text

Applying Artificial Intelligence Methods for the Estimation of Disease Incidence: The Utility of Language Models

Frontiers in Digital Health ◽

10.3389/fdgth.2020.569261 ◽

2020 ◽

Vol 2 ◽

Author(s):

Yuanzhao Zhang ◽

Robert Walecki ◽

Joanne R. Winter ◽

Felix J. S. Bragman ◽

Sara Lourenco ◽

...

Keyword(s):

Machine Learning ◽

Digital Health ◽

Contextual Information ◽

Disease Incidence ◽

Language Models ◽

Mathematical Representation ◽

Learning Models ◽

Neural Network Models ◽

Universal Sentence ◽

Machine Learning Models

Background: AI-driven digital health tools often rely on estimates of disease incidence or prevalence, but obtaining these estimates is costly and time-consuming. We explored the use of machine learning models that leverage contextual information about diseases from unstructured text, to estimate disease incidence.Methods: We used a class of machine learning models, called language models, to extract contextual information relating to disease incidence. We evaluated three different language models: BioBERT, Global Vectors for Word Representation (GloVe), and the Universal Sentence Encoder (USE), as well as an approach which uses all jointly. The output of these models is a mathematical representation of the underlying data, known as “embeddings.” We used these to train neural network models to predict disease incidence. The neural networks were trained and validated using data from the Global Burden of Disease study, and tested using independent data sourced from the epidemiological literature.Findings: A variety of language models can be used to encode contextual information of diseases. We found that, on average, BioBERT embeddings were the best for disease names across multiple tasks. In particular, BioBERT was the best performing model when predicting specific disease-country pairs, whilst a fusion model combining BioBERT, GloVe, and USE performed best on average when predicting disease incidence in unseen countries. We also found that GloVe embeddings performed better than BioBERT embeddings when applied to country names. However, we also noticed that the models were limited in view of predicting previously unseen diseases. Further limitations were also observed with substantial variations across age groups and notably lower performance for diseases that are highly dependent on location and climate.Interpretation: We demonstrate that context-aware machine learning models can be used for estimating disease incidence. This method is quicker to implement than traditional epidemiological approaches. We therefore suggest it complements existing modeling efforts, where data is required more rapidly or at larger scale. This may particularly benefit AI-driven digital health products where the data will undergo further processing and a validated approximation of the disease incidence is adequate.

Download Full-text

COMPARATIVE ANALYSIS OF MACHINE LEARNING MODELS AND REGRESSIONS FOR CAR PRICE PREDICTION

Bulletin of V. N. Karazin Kharkiv National University Economic Series ◽

10.26565/2311-2379-2019-97-04 ◽

2019 ◽

Keyword(s):

Neural Network ◽

Machine Learning ◽

Comparative Analysis ◽

Random Forest ◽

Network Models ◽

Gradient Boosting ◽

Learning Models ◽

Neural Network Models ◽

Boosting Algorithms ◽

Machine Learning Models

The purpose of the research described in this article is a comparative analysis of the predictive qualities of some models of machine learning and regression. The factors for models are the consumer characteristics of a used car: brand, transmission type, drive type, engine type, mileage, body type, year of manufacture, seller's region in Ukraine, condition of the car, information about accident, average price for analogue in Ukraine, engine volume, quantity of doors, availability of extra equipment, quantity of passenger’s seats, the first registration of a car, car was driven from abroad or not. Qualitative variables has been encoded as binary variables or by mean target encoding. The information about more than 200 thousand cars have been used for modeling. All models have been evaluated in the Python Software using Sklearn, Catboost, StatModels and Keras libraries. The following regression models and machine learning models were considered in the course of the study: linear regression; polynomial regression; decision tree; neural network; models based on "k-nearest neighbors", "random forest", "gradient boosting" algorithms; ensemble of models. The article presents the best in terms of quality (according to the criteria R2, MAE, MAD, MAPE) options from each class of models. It has been found that the best way to predict the price of a passenger car is through non-linear models. The results of the modeling show that the dependence between the price of a car and its characteristics is best described by the ensemble of models, which includes a neural network, models using "random forest" and "gradient boosting" algorithms. The ensemble of models showed an average relative approximation error of 11.2% and an average relative forecast error of 14.34%. All nonlinear models for car price have approximately the same predictive qualities (the difference between the MAPE within 2%) in this research.

Download Full-text

Neural network with Particle Swarm Optimisation for Analysing Band Structure of Elastic Metamaterials

Current Mechanics and Advanced Materials ◽

10.2174/2666184501666210908120227 ◽

2021 ◽

Vol 01 ◽

Author(s):

Weiqi Chen ◽

Cheuk Yu Lee ◽

Xiuping Jia ◽

Qing-Hua Qin

Keyword(s):

Machine Learning ◽

Band Structure ◽

Loss Function ◽

Particle Swarm ◽

Physical Property ◽

Particle Swarm Optimisation ◽

Machine Learning Techniques ◽

Learning Models ◽

Particle Swarm Optimisation Algorithm ◽

Machine Learning Models

Background: Due to the development of computing resources, machine learning techniques and models integrated with evolutionary algorithms have been successfully applied to solve a vast of engineering problems. With the advance in elastic metamaterial research, the identification of band structure, which reflects the physical property of Elastic Metamaterial, holds the key to the design of wave-controlled devices. Objective: In order to conduct bandgap analysis on two specific metamaterial structures, machine learning models that are integrated with the evolutionary algorithm are proposed to predict band structure. Methods: This paper proposes two integration models with a modified loss function for predicting elastic metamaterial’s band structure. The self-defined loss function composed of mean square error and concordance correlation is designed to ensure the numerical eigenfrequency values but also the position of each band. Results: The results of the integration models indicate the MLPs-PSO and RBFs – PSO models indeed have relatively satisfying performances on such pattern recognition tasks with respect to the numerical values of the error measurements. The performances of the machine learning models could be outstandingly improved by the Particle Swarm Optimisation algorithm. Conclusion: In short, the well-trained machine models are able to predict the band structure and could be contributing to bandgap enlargement study.

Download Full-text

Machine Learning Crowdfunding

International Journal of Knowledge-Based Organizations ◽

10.4018/ijkbo.2020040101 ◽

2020 ◽

Vol 10 (2) ◽

pp. 1-11

Author(s):

Evangelos Katsamakas ◽

Hao Sun

Keyword(s):

Neural Network ◽

Machine Learning ◽

Language Processing ◽

Topic Modeling ◽

Network Models ◽

Unstructured Data ◽

Text Analytics ◽

Learning Models ◽

Neural Network Models ◽

Machine Learning Models

Crowdfunding is a novel and important economic mechanism for funding projects and promoting innovation in the digital economy. This article explores most recent structured and unstructured data from a crowdfunding platform. It provides an in-depth exploration of the data using text analytics techniques, such as sentiment analysis and topic modeling. It uses novel natural language processing to represent project descriptions, and evaluates machine learning models, including neural network models, to predict project fundraising success. It discusses the findings of the performance evaluation, and summarizes lessons for crowdfunding platforms and their users.

Download Full-text

Improving XGBoost with Imagination Sampling

Communications of the Blyth Institute ◽

10.33014/issn.2640-5652.2.1.holloway.1 ◽

2020 ◽

Vol 2 (1) ◽

pp. 3-6

Author(s):

Eric Holloway

Keyword(s):

Machine Learning ◽

General System ◽

Learning Models ◽

Starting Point ◽

Machine Learning Models

Imagination Sampling is the usage of a person as an oracle for generating or improving machine learning models. Previous work demonstrated a general system for using Imagination Sampling for obtaining multibox models. Here, the possibility of importing such models as the starting point for further automatic enhancement is explored.

Download Full-text