New Computational Tool Based on Machine-learning Algorithms for the Identification of Rhinovirus Infection-Related Genes

2020 ◽  
Vol 22 (10) ◽  
pp. 665-674 ◽  
Author(s):  
Yan Xu ◽  
Yu-Hang Zhang ◽  
JiaRui Li ◽  
Xiao Y. Pan ◽  
Tao Huang ◽  
...  

Background: Human rhinovirus has different identified serotypes and is the most common cause of cold in humans. To date, many genes have been discovered to be related to rhinovirus infection. However, the pathogenic mechanism of rhinovirus is difficult to elucidate through experimental approaches due to the high cost and consuming time. Method and Results: In this study, we presented a novel approach that relies on machine-learning algorithms and identified two genes OTOF and SOCS1. The expression levels of these genes in the blood samples can be used to accurately distinguish virus-infected and non-infected individuals. Conclusion: Our findings suggest the crucial roles of these two genes in rhinovirus infection and the robustness of the computational tool in dissecting pathogenic mechanisms.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Duy Ngoc Nguyen ◽  
Tuoi Thi Phan ◽  
Phuc Do

AbstractSentiment classification, which uses deep learning algorithms, has achieved good results when tested with popular datasets. However, it will be challenging to build a corpus on new topics to train machine learning algorithms in sentiment classification with high confidence. This study proposes a method that processes embedding knowledge in the ontology of opinion datasets called knowledge processing and representation based on ontology (KPRO) to represent the significant features of the dataset into the word embedding layer of deep learning algorithms in sentiment classification. Unlike the methods that lexical encode or add information to the corpus, this method adds presentation of raw data based on the expert’s knowledge in the ontology. Once the data has a rich knowledge of the topic, the efficiency of the machine learning algorithms is significantly enhanced. Thus, this method is appliable to embed knowledge in datasets in other languages. The test results show that deep learning methods achieved considerably higher accuracy when trained with the KPRO method’s dataset than when trained with datasets not processed by this method. Therefore, this method is a novel approach to improve the accuracy of deep learning algorithms and increase the reliability of new datasets, thus making them ready for mining.


2020 ◽  
Vol 12 (35) ◽  
pp. 4303-4309
Author(s):  
Gustavo Larios ◽  
Gustavo Nicolodelli ◽  
Matheus Ribeiro ◽  
Thalita Canassa ◽  
Andre R. Reis ◽  
...  

A novel approach to distinguish soybean seed vigor based on Fourier transform infrared spectroscopy (FTIR) associated with chemometric methods is presented.


2021 ◽  
Author(s):  
Jie Cao ◽  
Jian Li ◽  
Zhen Gu ◽  
Jia-jia Niu ◽  
Guo-shuai An ◽  
...  

Abstract Background: Acute myocardial ischemia (AMI) remains the leading cause of death worldwide. In particular, when death occurs within a short time, it is hard to find post-mortem specific structural anomalies of the heart at autopsy with standard methods. Therefore, the post-mortem diagnosis of AMI represents a current challenge for both clinical and forensic pathologists. Metabolomics technology plays an important role in searching for new diagnostic biomarkers. Here, we characterize metabolic profiles of AMI and attempted to interpret the role of metabolic changes in sudden cardiac death (SCD).Methods: The untargeted metabolomics was applied to analyze serum metabolic signatures from AMI experimental group (ligation of left coronary artery at 5mm below the left atrial appendage in rats), along with the control and sham groups (n = 10 per group). The analytical strategy based on ultra performance liquid chromatography combined with high-resolution mass spectrometry. The resulting data was preprocessed to discriminant metabolites, and a set of machine learning algorithms were used to construct predictable models. Seventeen blood samples from autopsy cases were applied to validate the classification model's value in human samples.Results: A total of 28 endogenous metabolites in serum were significantly altered in AMI group relative to control and sham groups. Gradient tree boosting, support vector machines, random forests, logistic regression, and multilayer perceptron models were used to further screen the more valuable metabolites from 28 metabolites to optimize the biomarker panel. The results showed that classification accuracy and performance of multilayer perceptron (MLP) models were better than other algorithms when the metabolites consisting of L-threonic acid, N-acetyl-L-cysteine, CMPF, glycocholic acid, L-tyrosine, cholic acid, and glycoursodeoxycholic acid. In autopsy cases, the MLP model constructed based on rat dataset achieved an accuracy of 88.23, and ROC of 0.89 for predicting AMI-SCD.Conclusions: A panel of 7 molecular biomarkers was identified by assessment the accuracy and efficacy of different metabolite combinations in inferring AMI using machine learning algorithms. The constructed MLP model has a high diagnostic performance for both AMI rats and autopsies-based blood samples. Thus, the combination of metabolomics and machine learning algorithms provides a novel strategy for AMI diagnosis.


2020 ◽  
Author(s):  
Ashis Kumar Das ◽  
Shiba Mishra ◽  
Devi Kalyan Mishra ◽  
Saji Saraswathy Gopalan

AbstractBackgroundBladder cancer is the most common cancer of the urinary system among the American population and it is the fourth most common cause of cancer morbidity and the eight most common cause of cancer mortality among men. Using machine learning algorithms, we predict the five-year survival among bladder cancer patients and deploy the best performing algorithm as a web application for survival prediction.MethodsMicroscopically confirmed adult bladder cancer patients were included from the Surveillance Epidemiology and End Results (SEER) database (2000-2017) and randomly split into training and test datasets (70/30 ratio). Five machine learning algorithms (logistic regression, support vector machine, gradient boosting, random forest, and K nearest neighbor) were trained on features to predict five-year survival. The algorithms were compared with performance metrics and the best performing algorithm was deployed as a web application.ResultsA total of 52,529 patients were included in our study. The gradient boosting algorithm was the best performer in terms of predictive ability and discrimination. It was deployed as the survival prediction web application named BlaCaSurv (https://blacasurv.herokuapp.com/).ConclusionsWe tested several machine learning algorithms and developed a web application for predicting five-year survival for bladder cancer patients. This application can be used as a supplementary prognostic tool to clinical decision making.


2021 ◽  
Vol 7 ◽  
pp. e390
Author(s):  
Shafaq Abbas ◽  
Zunera Jalil ◽  
Abdul Rehman Javed ◽  
Iqra Batool ◽  
Mohammad Zubair Khan ◽  
...  

Breast cancer is one of the leading causes of death in the current age. It often results in subpar living conditions for a patient as they have to go through expensive and painful treatments to fight this cancer. One in eight women all over the world is affected by this disease. Almost half a million women annually do not survive this fight and die from this disease. Machine learning algorithms have proven to outperform all existing solutions for the prediction of breast cancer using models built on the previously available data. In this paper, a novel approach named BCD-WERT is proposed that utilizes the Extremely Randomized Tree and Whale Optimization Algorithm (WOA) for efficient feature selection and classification. WOA reduces the dimensionality of the dataset and extracts the relevant features for accurate classification. Experimental results on state-of-the-art comprehensive dataset demonstrated improved performance in comparison with eight other machine learning algorithms: Support Vector Machine (SVM), Random Forest, Kernel Support Vector Machine, Decision Tree, Logistic Regression, Stochastic Gradient Descent, Gaussian Naive Bayes and k-Nearest Neighbor. BCD-WERT outperformed all with the highest accuracy rate of 99.30% followed by SVM achieving 98.60% accuracy. Experimental results also reveal the effectiveness of feature selection techniques in improving prediction accuracy.


Sign in / Sign up

Export Citation Format

Share Document