Automated cell analysis tool for a genome-wide RNAi screen with support vector machine based supervised learning

2011 ◽  
Author(s):  
Steffen Remmele ◽  
Julia Ritzerfeld ◽  
Walter Nickel ◽  
Jürgen Hesser
2013 ◽  
Vol 2013 ◽  
pp. 1-8 ◽  
Author(s):  
Jinseog Kim ◽  
Insuk Sohn ◽  
Dennis (Dong Hwan) Kim ◽  
Sin-Ho Jung

One of main objectives of a genome-wide association study (GWAS) is to develop a prediction model for a binary clinical outcome using single-nucleotide polymorphisms (SNPs) which can be used for diagnostic and prognostic purposes and for better understanding of the relationship between the disease and SNPs. Penalized support vector machine (SVM) methods have been widely used toward this end. However, since investigators often ignore the genetic models of SNPs, a final model results in a loss of efficiency in prediction of the clinical outcome. In order to overcome this problem, we propose a two-stage method such that the the genetic models of each SNP are identified using the MAX test and then a prediction model is fitted using a penalized SVM method. We apply the proposed method to various penalized SVMs and compare the performance of SVMs using various penalty functions. The results from simulations and real GWAS data analysis show that the proposed method performs better than the prediction methods ignoring the genetic models in terms of prediction power and selectivity.


2014 ◽  
Vol 226 (03) ◽  
Author(s):  
F Ponthan ◽  
D Pal ◽  
J Vormoor ◽  
O Heidenreich
Keyword(s):  

2011 ◽  
Vol 195 (6) ◽  
pp. i9-i9 ◽  
Author(s):  
Bart A. Westerman ◽  
A. Koen Braat ◽  
Nicole Taub ◽  
Marko Potman ◽  
Joseph H.A. Vissers ◽  
...  

2020 ◽  
Author(s):  
Lei Li ◽  
Yanjie Chao

ABSTRACTSmall proteins shorter than 50 amino acids have been long overlooked. A number of small proteins have been identified in several model bacteria using experimental approaches and assigned important functions in diverse cellular processes. The recent development of ribosome profiling technologies has allowed a genome-wide identification of small proteins and small ORFs (smORFs), but our incomplete understanding of small proteins hinders de novo computational prediction of smORFs in non-model bacterial species. Here, we have identified several sequence features for smORFs by a systematic analysis of all the known small proteins in E. coli, among which the translation initiation rate is the strongest determinant. By integrating these features into a support vector machine learning model, we have developed a novel sPepFinder algorithm that can predict conserved smORFs in bacterial genomes with a high accuracy of 92.8%. De novo prediction in E. coli has revealed several novel smORFs with evidence of translation supported by ribosome profiling. Further application of sPepFinder in 549 bacterial species has led to the identification of > 100,000 novel smORFs, many of which are conserved at the amino acid and nucleotide levels under purifying selection. Overall, we have established sPepFinder as a valuable tool to identify novel smORFs in both model and non-model bacterial organisms, and provided a large resource of small proteins for functional characterizations.


2011 ◽  
Vol 2 (11) ◽  
pp. 918-939 ◽  
Author(s):  
Yinyan Sun ◽  
Peiguo Yang ◽  
Yuxia Zhang ◽  
Xin Bao ◽  
Jun Li ◽  
...  
Keyword(s):  
P Bodies ◽  

Sign in / Sign up

Export Citation Format

Share Document