LncRNA functional annotation with improved false discovery rate achieved by disease associations

Abstract Functional annotation of protein sequence with high accuracy has become one of the most important issues in modern biomedical studies, and computational approaches of significantly accelerated analysis process and enhanced accuracy are greatly desired. Although a variety of methods have been developed to elevate protein annotation accuracy, their ability in controlling false annotation rates remains either limited or not systematically evaluated. In this study, a protein encoding strategy, together with a deep learning algorithm, was proposed to control the false discovery rate in protein function annotation, and its performances were systematically compared with that of the traditional similarity-based and de novo approaches. Based on a comprehensive assessment from multiple perspectives, the proposed strategy and algorithm were found to perform better in both prediction stability and annotation accuracy compared with other de novo methods. Moreover, an in-depth assessment revealed that it possessed an improved capacity of controlling the false discovery rate compared with traditional methods. All in all, this study not only provided a comprehensive analysis on the performances of the newly proposed strategy but also provided a tool for the researcher in the fields of protein function annotation.

Download Full-text

A Pleiotropy-Informed Bayesian False Discovery Rate adapted to a Shared Control Design Finds New Disease Associations From GWAS Summary Statistics

10.1101/014886 ◽

2015 ◽

Cited By ~ 2

Author(s):

James Liley ◽

Chris Wallace

Keyword(s):

False Discovery Rate ◽

Upper Bound ◽

Association Studies ◽

Genome Wide Association Studies ◽

Summary Statistics ◽

Nucleotide Polymorphisms ◽

P Values ◽

Replication Studies ◽

False Discovery ◽

Disease Associations

Genome-wide association studies (GWAS) have been successful in identifying single nucleotide polymorphisms (SNPs) associated with many traits and diseases. However, at existing sample sizes, these variants explain only part of the estimated heritability. Leverage of GWAS results from related phenotypes may improve detection without the need for larger datasets. The Bayesian conditional false discovery rate (cFDR) constitutes an upper bound on the expected false discovery rate (FDR) across a set of SNPs whose p values for two diseases are both less than two disease-specific thresholds. Calculation of the cFDR requires only summary statistics and has several advantages over traditional GWAS analysis. However, existing methods require distinct control samples between studies. Here, we extend the technique to allow for some or all controls to be shared, increasing applicability. Several different SNP sets can be defined with the same cFDR value, and we show that the expected FDR across the union of these sets may exceed expected FDR in any single set. We describe a procedure to establish an upper bound for the expected FDR among the union of such sets of SNPs. We apply our technique to pairwise analysis of p values from ten autoimmune diseases with variable sharing of controls, enabling discovery of 59 SNP-disease associations which do not reach GWAS significance after genomic control in individual datasets. Most of the SNPs we highlight have previously been confirmed using replication studies or larger GWAS, a useful validation of our technique; we report eight SNP-disease associations across five diseases not previously declared. Our technique extends and strengthens the previous algorithm, and establishes robust limits on the expected FDR. This approach can improve SNP detection in GWAS, and give insight into shared aetiology between phenotypically related conditions.

Download Full-text

Faculty Opinions recommendation of An investigation of the false discovery rate and the misinterpretation of p-values.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.725432010.793514527 ◽

2016 ◽

Author(s):

Geoffrey Goodhill

Keyword(s):

False Discovery Rate ◽

P Values ◽

False Discovery

Download Full-text

A simple yet efficient method of local false discovery rate estimation designed for genome-wide association data analysis

Statistical Methods & Applications ◽

10.1007/s10260-021-00560-y ◽

2021 ◽

Author(s):

Ali Karimnezhad

Keyword(s):

Data Analysis ◽

False Discovery Rate ◽

Efficient Method ◽

Genome Wide Association ◽

Local False Discovery Rate ◽

Rate Estimation ◽

False Discovery ◽

Genome Wide ◽

False Discovery Rate Estimation ◽

Association Data

Download Full-text

False Discovery Rate Control Under General Dependence By Symmetrized Data Aggregation

Journal of the American Statistical Association ◽

10.1080/01621459.2021.1945459 ◽

2021 ◽

pp. 1-34

Author(s):

Lilun Du ◽

Xu Guo ◽

Wenguang Sun ◽

Changliang Zou

Keyword(s):

False Discovery Rate ◽

Rate Control ◽

Data Aggregation ◽

False Discovery Rate Control ◽

False Discovery

Download Full-text

False Discovery Rate in Linkage and Association Genome Screens for Complex Disorders

Genetics ◽

10.1093/genetics/164.2.829 ◽

2003 ◽

Vol 164 (2) ◽

pp. 829-833

Author(s):

Chiara Sabatti ◽

Susan Service ◽

Nelson Freimer

Keyword(s):

Gene Mapping ◽

False Discovery Rate ◽

Disease Gene ◽

Susceptibility Genes ◽

Complex Disorders ◽

Disease Gene Mapping ◽

False Discovery ◽

Simple Step ◽

Multiple Comparison Procedure ◽

Step Down

Abstract We explore the implications of the false discovery rate (FDR) controlling procedure in disease gene mapping. With the aid of simulations, we show how, under models commonly used, the simple step-down procedure introduced by Benjamini and Hochberg controls the FDR for the dependent tests on which linkage and association genome screens are based. This adaptive multiple comparison procedure may offer an important tool for mapping susceptibility genes for complex diseases.

Download Full-text

P14.21 Tehila Kaisman-Elbaz MD/PhD

Neuro-Oncology ◽

10.1093/neuonc/noz126.256 ◽

2019 ◽

Vol 21 (Supplement_3) ◽

pp. iii71-iii71

Author(s):

T Kaisman-Elbaz ◽

Y Elbaz ◽

V Merkin ◽

L Dym ◽

A Noy ◽

...

Keyword(s):

Overall Survival ◽

Multivariate Analysis ◽

False Discovery Rate ◽

Medical Center ◽

Tumor Resection ◽

Multiple Hypothesis Testing ◽

Hemoglobin Level ◽

Distribution Width ◽

False Discovery ◽

Dismal Prognosis

Abstract BACKGROUND Glioblastoma is known for its dismal prognosis though its dependency on patients’ readily available RBCs parameters defining the patient’s anemic status such as hemoglobin level and Red blood cells distribution Width (RDW) is not fully established. Several works demonstrated a connection between low hemoglobin level or high RDW values to overall glioblastoma patient’s survival, but in other works, a clear connection was not found. This study addresses this unclarity. MATERIAL AND METHODS In this work, 170 glioblastoma patients, diagnosed and treated in Soroka University Medical Center (SUMC) in the last 12 years were retrospectively inspected for their survival dependency on pre-operative RBCs parameters using multivariate analysis followed by false discovery rate procedure due to the multiple hypothesis testing. A survival stratification tree and Kaplan-Meier survival curves that indicate the patient’s prognosis according to these parameters were prepared. RESULTS Beside KPS>70 and tumor resection supplemented by oncological treatment, age<70 (HR=0.4, 95% CI 0.24–0.65), low hemoglobin level (HR=1.79, 95% CI 1.06–2.99) and RDW<14% (HR=0.57, 95% CI 0.37–0.88) were found to be prognostic to patients’ overall survival in multivariate analysis, accounting for false discovery rate of less than 5%. CONCLUSION A survival stratification highlighted a non-anemic subgroup of nearly 30% of the cohort’s patients whose median overall survival was 21.1 months (95% CI 16.2–27.2) - higher than the average Stupp protocol overall median survival of about 15 months. A discussion on the beneficial or detrimental effect of RBCs parameters on glioblastoma prognosis and its possible causes is given.

Download Full-text

Associations Between Genetically Predicted Protein Levels and COVID-19 Severity

The Journal of Infectious Diseases ◽

10.1093/infdis/jiaa660 ◽

2020 ◽

Vol 223 (1) ◽

pp. 19-22

Author(s):

Jingjing Zhu ◽

Chong Wu ◽

Lang Wu

Keyword(s):

False Discovery Rate ◽

Drug Repurposing ◽

Bonferroni Correction ◽

Host Genetics ◽

Protein Levels ◽

False Discovery

Abstract It is critical to identify potential causal targets for SARS-CoV-2, which may guide drug repurposing options. We assessed the associations between genetically predicted protein levels and COVID-19 severity. Leveraging data from the COVID-19 Host Genetics Initiative comparing 6492 hospitalized COVID-19 patients and 1 012 809 controls, we identified 18 proteins with genetically predicted levels to be associated with COVID-19 severity at a false discovery rate of <0.05, including 12 that showed an association even after Bonferroni correction. Of the 18 proteins, 6 showed positive associations and 12 showed inverse associations. In conclusion, we identified 18 candidate proteins for COVID-19 severity.

Download Full-text

Resampling-based false discovery rate controlling multiple test procedures for correlated test statistics

Journal of Statistical Planning and Inference ◽

10.1016/s0378-3758(99)00041-5 ◽

1999 ◽

Vol 82 (1-2) ◽

pp. 171-196 ◽

Cited By ~ 325

Author(s):

Daniel Yekutieli ◽

Yoav Benjamini

Keyword(s):

False Discovery Rate ◽

Test Statistics ◽

Test Procedures ◽

Multiple Test ◽

False Discovery

Download Full-text