scholarly journals Deep Neural Networks for Genomic Prediction Do Not Estimate Marker Effects

2021 ◽  
Author(s):  
Jordan Ubbens ◽  
Isobel Parkin ◽  
Christina Eynck ◽  
Ian Stavness ◽  
Andrew Sharpe

Genomic prediction is a promising technology for advancing both plant and animal breeding, with many different prediction models evaluated in the literature. It has been suggested that the ability of powerful nonlinear models such as deep neural networks to capture complex epistatic effects between markers offers advantages for genomic prediction. However, these methods tend not to outperform classical linear methods, leaving it an open question why this capacity to model nonlinear effects does not seem to result in better predictive capability. In this work, we propose the theory that, due to a principle called shortcut learning, deep neural networks tend to base their predictions on overall genetic relatedness, rather than on the effects of particular markers, such as epistatic effects. Using several datasets of crop plants (lentil, wheat, and Brassica carinata), we demonstrate the network's indifference to the values of the markers by showing that the same network, provided with only the locations of matches between markers for two individuals, is able to perform prediction to the same level of accuracy.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Dipendra Jha ◽  
Vishu Gupta ◽  
Logan Ward ◽  
Zijiang Yang ◽  
Christopher Wolverton ◽  
...  

AbstractThe application of machine learning (ML) techniques in materials science has attracted significant attention in recent years, due to their impressive ability to efficiently extract data-driven linkages from various input materials representations to their output properties. While the application of traditional ML techniques has become quite ubiquitous, there have been limited applications of more advanced deep learning (DL) techniques, primarily because big materials datasets are relatively rare. Given the demonstrated potential and advantages of DL and the increasing availability of big materials datasets, it is attractive to go for deeper neural networks in a bid to boost model performance, but in reality, it leads to performance degradation due to the vanishing gradient problem. In this paper, we address the question of how to enable deeper learning for cases where big materials data is available. Here, we present a general deep learning framework based on Individual Residual learning (IRNet) composed of very deep neural networks that can work with any vector-based materials representation as input to build accurate property prediction models. We find that the proposed IRNet models can not only successfully alleviate the vanishing gradient problem and enable deeper learning, but also lead to significantly (up to 47%) better model accuracy as compared to plain deep neural networks and traditional ML techniques for a given input materials representation in the presence of big data.


Author(s):  
Vishal Babu Siramshetty ◽  
Dac-Trung Nguyen ◽  
Natalia J. Martinez ◽  
Anton Simeonov ◽  
Noel T. Southall ◽  
...  

The rise of novel artificial intelligence methods necessitates a comparison of this wave of new approaches with classical machine learning for a typical drug discovery project. Inhibition of the potassium ion channel, whose alpha subunit is encoded by human Ether-à-go-go-Related Gene (hERG), leads to prolonged QT interval of the cardiac action potential and is a significant safety pharmacology target for the development of new medicines. Several computational approaches have been employed to develop prediction models for assessment of hERG liabilities of small molecules including recent work using deep learning methods. Here we perform a comprehensive comparison of prediction models based on classical (random forests and gradient boosting) and modern (deep neural networks and recurrent neural networks) artificial intelligence methods. The training set (~9000 compounds) was compiled by integrating hERG bioactivity data from ChEMBL database with experimental data generated from an in-house, high-throughput thallium flux assay. We utilized different molecular descriptors including the latent descriptors, which are real-valued continuous vectors derived from chemical autoencoders trained on a large chemical space (> 1.5 million compounds). The models were prospectively validated on ~840 in-house compounds screened in the same thallium flux assay. The deep neural networks performed significantly better than the classical methods with the latent descriptors. The recurrent neural networks that operate on SMILES provided highest model sensitivity. The best models were merged into a consensus model that offered superior performance compared to reference models from academic and commercial domains. Further, we shed light on the potential of artificial intelligence methods to exploit the chemistry big data and generate novel chemical representations useful in predictive modeling and tailoring new chemical space.<br>


Algorithms ◽  
2020 ◽  
Vol 13 (12) ◽  
pp. 342
Author(s):  
Guojing Huang ◽  
Qingliang Chen ◽  
Congjian Deng

With the development of E-commerce, online advertising began to thrive and has gradually developed into a new mode of business, of which Click-Through Rates (CTR) prediction is the essential driving technology. Given a user, commodities and scenarios, the CTR model can predict the user’s click probability of an online advertisement. Recently, great progress has been made with the introduction of Deep Neural Networks (DNN) into CTR. In order to further advance the DNN-based CTR prediction models, this paper introduces a new model of FO-FTRL-DCN, based on the prestigious model of Deep&Cross Network (DCN) augmented with the latest optimization technique of Follow The Regularized Leader (FTRL) for DNN. The extensive comparative experiments on the iPinYou datasets show that the proposed model has outperformed other state-of-the-art baselines, with better generalization across different datasets in the benchmark.


2017 ◽  
Author(s):  
Yilun Wang ◽  
Michal Kosinski

We show that faces contain much more information about sexual orientation than can be perceived and interpreted by the human brain. We used deep neural networks to extract features from 35,326 facial images. These features were entered into a logistic regression aimed at classifying sexual orientation. Given a single facial image, a classifier could correctly distinguish between gay and heterosexual men in 81% of cases, and in 74% of cases for women. Human judges achieved much lower accuracy: 61% for men and 54% for women. The accuracy of the algorithm increased to 91% and 83%, respectively, given five facial images per person. Facial features employed by the classifier included both fixed (e.g., nose shape) and transient facial features (e.g., grooming style). Consistent with the prenatal hormone theory of sexual orientation, gay men and women tended to have gender-atypical facial morphology, expression, and grooming styles. Prediction models aimed at gender alone allowed for detecting gay males with 57% accuracy and gay females with 58% accuracy. Those findings advance our understanding of the origins of sexual orientation and the limits of human perception. Additionally, given that companies and governments are increasingly using computer vision algorithms to detect people’s intimate traits, our findings expose a threat to the privacy and safety of gay men and women.


2019 ◽  
Vol 141 (8) ◽  
Author(s):  
Ali Madani ◽  
Ahmed Bakhaty ◽  
Jiwon Kim ◽  
Yara Mubarak ◽  
Mohammad R. K. Mofrad

Finite element and machine learning modeling are two predictive paradigms that have rarely been bridged. In this study, we develop a parametric model to generate arterial geometries and accumulate a database of 12,172 2D finite element simulations modeling the hyperelastic behavior and resulting stress distribution. The arterial wall composition mimics vessels in atherosclerosis–a complex cardiovascular disease and one of the leading causes of death globally. We formulate the training data to predict the maximum von Mises stress, which could indicate risk of plaque rupture. Trained deep learning models are able to accurately predict the max von Mises stress within 9.86% error on a held-out test set. The deep neural networks outperform alternative prediction models and performance scales with amount of training data. Lastly, we examine the importance of contributing features on stress value and location prediction to gain intuitions on the underlying process. Moreover, deep neural networks can capture the functional mapping described by the finite element method, which has far-reaching implications for real-time and multiscale prediction tasks in biomechanics.


2021 ◽  
Vol 13 (22) ◽  
pp. 12442
Author(s):  
Amal A. Al-Shargabi ◽  
Abdulbasit Almhafdy ◽  
Dina M. Ibrahim ◽  
Manal Alghieth ◽  
Francisco Chiclana

The dramatic growth in the number of buildings worldwide has led to an increase interest in predicting energy consumption, especially for the case of residential buildings. As the heating and cooling system highly affect the operation cost of buildings; it is worth investigating the development of models to predict the heating and cooling loads of buildings. In contrast to the majority of the existing related studies, which are based on historical energy consumption data, this study considers building characteristics, such as area and floor height, to develop prediction models of heating and cooling loads. In particular, this study proposes deep neural networks models based on several hyper-parameters: the number of hidden layers, the number of neurons in each layer, and the learning algorithm. The tuned models are constructed using a dataset generated with the Integrated Environmental Solutions Virtual Environment (IESVE) simulation software for the city of Buraydah city, the capital of the Qassim region in Saudi Arabia. The Qassim region was selected because of its harsh arid climate of extremely cold winters and hot summers, which means that lot of energy is used up for cooling and heating of residential buildings. Through model tuning, optimal parameters of deep learning models are determined using the following performance measures: Mean Square Error (MSE), Root Mean Square Error (RMSE), Regression (R) values, and coefficient of determination (R2). The results obtained with the five-layer deep neural network model, with 20 neurons in each layer and the Levenberg–Marquardt algorithm, outperformed the results of the other models with a lower number of layers. This model achieved MSE of 0.0075, RMSE 0.087, R and R2 both as high as 0.99 in predicting the heating load and MSE of 0.245, RMSE of 0.495, R and R2 both as high as 0.99 in predicting the cooling load. As the developed prediction models were based on buildings characteristics, the outcomes of the research may be relevant to architects at the pre-design stage of heating and cooling energy-efficient buildings.


2017 ◽  
Vol 25 (3) ◽  
pp. 878-891 ◽  
Author(s):  
Reda Al-Bahrani ◽  
Ankit Agrawal ◽  
Alok Choudhary

We utilize deep neural networks to develop prediction models for patient survival and conditional survival of colon cancer. Our models are trained and validated on data obtained from the Surveillance, Epidemiology, and End Results Program. We provide an online outcome calculator for 1, 2, and 5 years survival periods. We experimented with multiple neural network structures and found that a network with five hidden layers produces the best results for these data. Moreover, the online outcome calculator provides conditional survival of 1, 2, and 5 years after surviving the mentioned survival periods. In this article, we report an approximate 0.87 area under the receiver operating characteristic curve measurements, higher than the 0.85 reported by Stojadinovic et al.


2020 ◽  
Author(s):  
Vishal Babu Siramshetty ◽  
Dac-Trung Nguyen ◽  
Natalia J. Martinez ◽  
Anton Simeonov ◽  
Noel T. Southall ◽  
...  

The rise of novel artificial intelligence methods necessitates a comparison of this wave of new approaches with classical machine learning for a typical drug discovery project. Inhibition of the potassium ion channel, whose alpha subunit is encoded by human Ether-à-go-go-Related Gene (hERG), leads to prolonged QT interval of the cardiac action potential and is a significant safety pharmacology target for the development of new medicines. Several computational approaches have been employed to develop prediction models for assessment of hERG liabilities of small molecules including recent work using deep learning methods. Here we perform a comprehensive comparison of prediction models based on classical (random forests and gradient boosting) and modern (deep neural networks and recurrent neural networks) artificial intelligence methods. The training set (~9000 compounds) was compiled by integrating hERG bioactivity data from ChEMBL database with experimental data generated from an in-house, high-throughput thallium flux assay. We utilized different molecular descriptors including the latent descriptors, which are real-valued continuous vectors derived from chemical autoencoders trained on a large chemical space (> 1.5 million compounds). The models were prospectively validated on ~840 in-house compounds screened in the same thallium flux assay. The deep neural networks performed significantly better than the classical methods with the latent descriptors. The recurrent neural networks that operate on SMILES provided highest model sensitivity. The best models were merged into a consensus model that offered superior performance compared to reference models from academic and commercial domains. Further, we shed light on the potential of artificial intelligence methods to exploit the chemistry big data and generate novel chemical representations useful in predictive modeling and tailoring new chemical space.<br>


Author(s):  
Zachi I Attia ◽  
Gilad Lerman ◽  
Paul A Friedman

Abstract Aims We sought to investigate whether artificial intelligence (AI) and specifically deep neural networks for electrocardiogram (ECG) signal analysis can be explained using human selected features. We also sought to quantify such explainability and test if the AI model learns features that are similar to a human expert. Methods We used a set of 100,000 ECGs that were annotated by human explainable features. We applied both linear and nonlinear models to predict published ECG AI models output for the detection of patients' age and sex. We further used canonical correlation analysis to quantify the amount of shared information between the features found in an unsupervised way. We reconstructed single human-selected ECG features from the unexplained neural network features using a simple linear model. Results We noticed a strong correlation between the simple models and the AI output (R2 of 0.49-0.57 for the linear models and R2 of 0.69-0.70 for the nonlinear models). We found that the correlation of the human explainable features with either 13 of the strongest age AI features or 15 of the strongest sex AI features was above 0.85 (for comparison, the first 14 principal components explain 90% of the human feature variance). We linearly reconstructed single human-selected ECG features from the AI features with R2 up to 0.86. Conclusion This work shows that neural networks for ECG signals extract features in a similar manner to human experts and that they also generate additional novel features that help achieve superior performance.


2021 ◽  
Author(s):  
Hailun Liang ◽  
Dongzuo Liang ◽  
Lei Tao ◽  
Xiaoshuai Zhang ◽  
Xiao Li ◽  
...  

BACKGROUND Dyslipidemia is an important risk factor for coronary artery disease and stroke. Early detection and prevention of dyslipidemia can markedly alter cardiovascular morbidity and mortality. Cox proportional hazard model has been commonly employed for survival datasets to construct the prediction model. Recently, the data-driven learning algorithm began to be used to analyze right-censored survival data. However, there is no attempt to use deep neural networks in dyslipidemia prediction. OBJECTIVE The objective of this study is to predict the risk of dyslipidemia via deep neural networks for survival data. METHODS The study cohort was based on the routine health check-up data from 6,328 participants aged 19 to 90 years and free of dyslipidemia at baseline. A deep neural network (DNN) was used to develop risk models for predicting dyslipidemia. Cox Proportional Hazards (Cox) and Random Survival Forests (RSF) were applied in comparison with the DNN model. As our metric of performance, we use the time-dependent concordance index (Ctd-index). RESULTS The Ctd-index of the prediction models by using DNN was 0.802. The DNN model performed significantly better than Cox and RSF model (Ctd-index: 0.735 and 0.770, respectively). The improvement of DNN over the competing methods was statistically significant. Moreover, DNN provides performance gain on time intervals compared to conventional survival models. CONCLUSIONS DNN is a promising method in learning the estimated distribution of survival time and event while capturing the right-censored nature inherent in survival data. DNN achieves large and statistically significant performance improvements over previous intuitive regression model and state-of-the-art data-mining methods.


Sign in / Sign up

Export Citation Format

Share Document