Integrating ensemble systems biology feature selection and bimodal deep neural network for breast cancer prognosis prediction

AbstractBreast cancer is a heterogeneous disease. To guide proper treatment decisions for each patient, robust prognostic biomarkers, which allow reliable prognosis prediction, are necessary. Gene feature selection based on microarray data is an approach to discover potential biomarkers systematically. However, standard pure-statistical feature selection approaches often fail to incorporate prior biological knowledge and select genes that lack biological insights. Besides, due to the high dimensionality and low sample size properties of microarray data, selecting robust gene features is an intrinsically challenging problem. We hence combined systems biology feature selection with ensemble learning in this study, aiming to select genes with biological insights and robust prognostic predictive power. Moreover, to capture breast cancer's complex molecular processes, we adopted a multi-gene approach to predict the prognosis status using deep learning classifiers. We found that all ensemble approaches could improve feature selection robustness, wherein the hybrid ensemble approach led to the most robust result. Among all prognosis prediction models, the bimodal deep neural network (DNN) achieved the highest test performance, further verified by survival analysis. In summary, this study demonstrated the potential of combining ensemble learning and bimodal DNN in guiding precision medicine.

Download Full-text

Integrating ensemble systems biology feature selection and bimodal deep neural network for breast cancer prognosis prediction

10.21203/rs.3.rs-208766/v1 ◽

2021 ◽

Author(s):

Li-Hsin Cheng ◽

Te-Cheng Hsu ◽

Che Lin

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Feature Selection ◽

Systems Biology ◽

Ensemble Learning ◽

Microarray Data ◽

Deep Neural Network ◽

Prediction Models ◽

Biological Knowledge ◽

Prognosis Prediction

Abstract Breast cancer is a heterogeneous disease. To guide proper treatment decisions for each patient, robust prognostic biomarkers, which allow reliable prognosis prediction, are necessary. Gene feature selection based on microarray data is an approach to discover potential biomarkers systematically. However, standard pure-statistical feature selection approaches often fail to incorporate prior biological knowledge and select genes that lack biological insights. Besides, due to the high dimensionality and low sample size properties of microarray data, selecting robust gene features is an intrinsically challenging problem. We hence combined systems biology feature selection with ensemble learning in this study, aiming to select genes with biological insights and robust prognostic, predictive power. Moreover, to capture breast cancer's complex molecular processes, we adopted a multi-gene approach to predict the prognosis status using deep learning classifiers. We found that all ensemble approaches could improve feature selection robustness, wherein the hybrid ensemble approach led to the most robust result. Among all prognosis prediction models, the bimodal deep neural network (DNN) achieved the highest test performance, further verified by survival analysis. In summary, this study demonstrated the potential of combining ensemble learning and bimodal DNN in guiding precision medicine.

Download Full-text

Integrating ensemble systems biology feature selection and bimodal deep neural network for breast cancer prognosis prediction

10.1101/810176 ◽

2019 ◽

Author(s):

Li-Hsin Cheng ◽

Che Lin

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Feature Selection ◽

Systems Biology ◽

Ensemble Learning ◽

Microarray Data ◽

Deep Neural Network ◽

Biological Knowledge ◽

Prognosis Prediction ◽

Gene Feature

AbstractMotivationBreast cancer is a heterogeneous disease. In order to guide proper treatment decisions for each individual patient, there is an urgent need for robust prognostic biomarkers that allow reliable prognosis prediction. Gene feature selection on microarray data is an approach to systematically discover potential biomarkers. However, common pure-statistical feature selection approaches often fail to incorporate prior biological knowledge and thus tend to select genes that lack biological insights. In addition, due to the high dimensionality and low sample size properties of microarray data, selecting robust gene features is an intrinsically challenging problem. We therefore combined systems biology feature selection with ensemble learning in this study, aiming to address the above challenges and select genes with biological insights, as well as robust prognostic predictive power. Moreover, in order to capture the complex molecular processes of breast cancer, where multiple disease-contributing genes may exist and interact, we adopted a multi-gene approach to predict the prognosis status using machine learning classifiers.ResultsWe systematically evaluated three different ensemble approaches that all improved the original systems biology feature selector. We found that compared to the most popular data-perturbation approach, function perturbation can produce significant improvement with just a few ensembles. Among all, the hybrid ensemble approach led to the most robust feature selection result, and the identified genes were shown to be highly involved in pathways, such as ubiquitination and cell cycle. Final prognosis prediction models were constructed using the identified genes and clinical information as input features. Among all models, bimodal deep neural network (DNN) achieved the highest AUC (area under receiver operating characteristic curve) in test performance evaluation, where subsequent survival analysis also verified its ability to differentiate patients with different prognosis statuses. In summary, the study demonstrated the potential of ensemble learning to improve gene feature selection robustness, as well as the potential of bimodal DNN in providing reliable prognosis prediction and guiding precision medicine.

Download Full-text

Detection of breast cancer using the infinite feature selection with genetic algorithm and deep neural network

Distributed and Parallel Databases ◽

10.1007/s10619-021-07355-w ◽

2021 ◽

Author(s):

S. S. Ittannavar ◽

R. H. Havaldar

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Genetic Algorithm ◽

Feature Selection ◽

Deep Neural Network

Download Full-text

Prognosis prediction of human breast cancer by integrating deep neural network and support vector machine: Supervised feature extraction and classification for breast cancer prognosis prediction

2017 10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI) ◽

10.1109/cisp-bmei.2017.8301908 ◽

2017 ◽

Cited By ~ 3

Author(s):

Dongdong Sun ◽

Minghui Wang ◽

Huanqing Feng ◽

Ao Li

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Support Vector Machine ◽

Human Breast Cancer ◽

Deep Neural Network ◽

Cancer Prognosis ◽

Support Vector ◽

Human Breast ◽

Prognosis Prediction ◽

Supervised Feature Extraction

Download Full-text

A Multimodal Deep Neural Network for Human Breast Cancer Prognosis Prediction by Integrating Multi-Dimensional Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2018.2806438 ◽

2019 ◽

Vol 16 (3) ◽

pp. 841-850 ◽

Cited By ~ 25

Author(s):

Dongdong Sun ◽

Minghui Wang ◽

Ao Li

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Human Breast Cancer ◽

Deep Neural Network ◽

Breast Cancer Prognosis ◽

Cancer Prognosis ◽

Human Breast ◽

Prognosis Prediction

Download Full-text

Breast cancer diagnosis using multiple activation deep neural network

Concurrent Engineering ◽

10.1177/1063293x211025105 ◽

2021 ◽

pp. 1063293X2110251

Author(s):

K Vijayakumar ◽

Vinod J Kadam ◽

Sudhir Kumar Sharma

Keyword(s):

Breast Cancer ◽

Neural Network ◽

Deep Neural Network ◽

Breast Cancer Diagnosis ◽

Activation Function ◽

Linear Functions ◽

Cancer Data ◽

Final Layer ◽

Improved Performance ◽

Hidden Layer

Deep Neural Network (DNN) stands for multilayered Neural Network (NN) that is capable of progressively learn the more abstract and composite representations of the raw features of the input data received, with no need for any feature engineering. They are advanced NNs having repetitious hidden layers between the initial input and the final layer. The working principle of such a standard deep classifier is based on a hierarchy formed by the composition of linear functions and a defined nonlinear Activation Function (AF). It remains uncertain (not clear) how the DNN classifier can function so well. But it is clear from many studies that within DNN, the AF choice has a notable impact on the kinetics of training and the success of tasks. In the past few years, different AFs have been formulated. The choice of AF is still an area of active study. Hence, in this study, a novel deep Feed forward NN model with four AFs has been proposed for breast cancer classification: hidden layer 1: Swish, hidden layer, 2:-LeakyReLU, hidden layer 3: ReLU, and final output layer: naturally Sigmoidal. The purpose of the study is twofold. Firstly, this study is a step toward a more profound understanding of DNN with layer-wise different AFs. Secondly, research is also aimed to explore better DNN-based systems to build predictive models for breast cancer data with improved accuracy. Therefore, the benchmark UCI dataset WDBC was used for the validation of the framework and evaluated using a ten-fold CV method and various performance indicators. Multiple simulations and outcomes of the experimentations have shown that the proposed solution performs in a better way than the Sigmoid, ReLU, and LeakyReLU and Swish activation DNN in terms of different parameters. This analysis contributes to producing an expert and precise clinical dataset classification method for breast cancer. Furthermore, the model also achieved improved performance compared to many established state-of-the-art algorithms/models.

Download Full-text

Multiple-Molecule Drug Design Based on Systems Biology Approaches and Deep Neural Network to Mitigate Human Skin Aging

Molecules ◽

10.3390/molecules26113178 ◽

2021 ◽

Vol 26 (11) ◽

pp. 3178

Author(s):

Shan-Ju Yeh ◽

Jin-Fu Lin ◽

Bor-Sen Chen

Keyword(s):

Neural Network ◽

Systems Biology ◽

Signaling Pathways ◽

Human Skin ◽

Deep Neural Network ◽

Middle Age ◽

Skin Aging ◽

Identification System ◽

Potential Candidate ◽

Systems Medicine

Human skin aging is affected by various biological signaling pathways, microenvironment factors and epigenetic regulations. With the increasing demand for cosmetics and pharmaceuticals to prevent or reverse skin aging year by year, designing multiple-molecule drugs for mitigating skin aging is indispensable. In this study, we developed strategies for systems medicine design based on systems biology methods and deep neural networks. We constructed the candidate genomewide genetic and epigenetic network (GWGEN) via big database mining. After doing systems modeling and applying system identification, system order detection and principle network projection methods with real time-profile microarray data, we could obtain core signaling pathways and identify essential biomarkers based on the skin aging molecular progression mechanisms. Afterwards, we trained a deep neural network of drug–target interaction in advance and applied it to predict the potential candidate drugs based on our identified biomarkers. To narrow down the candidate drugs, we designed two filters considering drug regulation ability and drug sensitivity. With the proposed systems medicine design procedure, we not only shed the light on the skin aging molecular progression mechanisms but also suggested two multiple-molecule drugs for mitigating human skin aging from young adulthood to middle age and middle age to old age, respectively.

Download Full-text