Data Description and Treatment

2015 ◽  
pp. 299-330
Author(s):  
Bilal M. Ayyub ◽  
Richard H. Mccuen
Keyword(s):  
2020 ◽  
Vol 15 ◽  
Author(s):  
Yi Zou ◽  
Hongjie Wu ◽  
Xiaoyi Guo ◽  
Li Peng ◽  
Yijie Ding ◽  
...  

Background: Detecting DNA-binding proetins (DBPs) based on biological and chemical methods is time consuming and expensive. Objective: In recent years, the rise of computational biology methods based on Machine Learning (ML) has greatly improved the detection efficiency of DBPs. Method: In this study, Multiple Kernel-based Fuzzy SVM Model with Support Vector Data Description (MK-FSVM-SVDD) is proposed to predict DBPs. Firstly, sex features are extracted from protein sequence. Secondly, multiple kernels are constructed via these sequence feature. Than, multiple kernels are integrated by Centered Kernel Alignment-based Multiple Kernel Learning (CKA-MKL). Next, fuzzy membership scores of training samples are calculated with Support Vector Data Description (SVDD). FSVM is trained and employed to detect new DBPs. Results: Our model is test on several benchmark datasets. Compared with other methods, MK-FSVM-SVDD achieves best Matthew's Correlation Coefficient (MCC) on PDB186 (0.7250) and PDB2272 (0.5476). Conclusion: We can conclude that MK-FSVM-SVDD is more suitable than common SVM, as the classifier for DNA-binding proteins identification.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Velma Herwanto ◽  
Benjamin Tang ◽  
Ya Wang ◽  
Maryam Shojaei ◽  
Marek Nalos ◽  
...  

Abstract Objectives Hospitalized patients who presented within the last 24 h with a bacterial infection were recruited. Participants were assigned into sepsis and uncomplicated infection groups. In addition, healthy volunteers were recruited as controls. RNA was prepared from whole blood, depleted from beta-globin mRNA and sequenced. This dataset represents a highly valuable resource to better understand the biology of sepsis and to identify biomarkers for severe sepsis in humans. Data description The data presented here consists of raw and processed transcriptome data obtained by next generation RNA sequencing from 105 peripheral blood samples from patients with uncomplicated infections, patients who developed sepsis, septic shock patients, and healthy controls. It is provided as raw sequenced reads and as normalized log2 transformed relative expression levels. This data will allow performing detailed analyses of gene expression changes between uncomplicated infections and sepsis patients, such as identification of differentially expressed genes, co-regulated modules as well as pathway activation studies.


Sign in / Sign up

Export Citation Format

Share Document