Human Genomics | ScienceGate

Contribution of 3D genome topological domains to genetic risk of cancers: a genome-wide computational study

Human Genomics ◽

10.1186/s40246-022-00375-2 ◽

2022 ◽

Vol 16 (1) ◽

Author(s):

Kim Philipp Jablonski ◽

Leopold Carron ◽

Julien Mozziconacci ◽

Thierry Forné ◽

Marc-Thorsten Hütt ◽

...

Keyword(s):

Genetic Risk ◽

Genome Organization ◽

Computational Study ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Topological Domains ◽

3D Genome ◽

Genome Wide ◽

A Genome

Abstract Background Genome-wide association studies have identified statistical associations between various diseases, including cancers, and a large number of single-nucleotide polymorphisms (SNPs). However, they provide no direct explanation of the mechanisms underlying the association. Based on the recent discovery that changes in three-dimensional genome organization may have functional consequences on gene regulation favoring diseases, we investigated systematically the genome-wide distribution of disease-associated SNPs with respect to a specific feature of 3D genome organization: topologically associating domains (TADs) and their borders. Results For each of 449 diseases, we tested whether the associated SNPs are present in TAD borders more often than observed by chance, where chance (i.e., the null model in statistical terms) corresponds to the same number of pointwise loci drawn at random either in the entire genome, or in the entire set of disease-associated SNPs listed in the GWAS catalog. Our analysis shows that a fraction of diseases displays such a preferential localization of their risk loci. Moreover, cancers are relatively more frequent among these diseases, and this predominance is generally enhanced when considering only intergenic SNPs. The structure of SNP-based diseasome networks confirms that localization of risk loci in TAD borders differs between cancers and non-cancer diseases. Furthermore, different TAD border enrichments are observed in embryonic stem cells and differentiated cells, consistent with changes in topological domains along embryogenesis and delineating their contribution to disease risk. Conclusions Our results suggest that, for certain diseases, part of the genetic risk lies in a local genetic variation affecting the genome partitioning in topologically insulated domains. Investigating this possible contribution to genetic risk is particularly relevant in cancers. This study thus opens a way of interpreting genome-wide association studies, by distinguishing two types of disease-associated SNPs: one with an effect on an individual gene, the other acting in interplay with 3D genome organization.

Update of the keratin gene family: evolution, tissue-specific expression patterns, and relevance to clinical disorders

Human Genomics ◽

10.1186/s40246-021-00374-9 ◽

2022 ◽

Vol 16 (1) ◽

Author(s):

Minh Ho ◽

Brian Thompson ◽

Jeffrey Nicholas Fisk ◽

Daniel W. Nebert ◽

Elspeth A. Bruford ◽

...

Keyword(s):

Plasma Membranes ◽

Expression Patterns ◽

Tissue Expression ◽

Gene Family Evolution ◽

Type I ◽

Type Ii ◽

Specific Expression ◽

Keratin Gene ◽

Tissue Specific ◽

Tissue Specific Expression

AbstractIntermediate filament (IntFil) genes arose during early metazoan evolution, to provide mechanical support for plasma membranes contacting/interacting with other cells and the extracellular matrix. Keratin genes comprise the largest subset of IntFil genes. Whereas the first keratin gene appeared in sponge, and three genes in arthropods, more rapid increases in keratin genes occurred in lungfish and amphibian genomes, concomitant with land animal-sea animal divergence (~ 440 to 410 million years ago). Human, mouse and zebrafish genomes contain 18, 17 and 24 non-keratin IntFil genes, respectively. Human has 27 of 28 type I “acidic” keratin genes clustered at chromosome (Chr) 17q21.2, and all 26 type II “basic” keratin genes clustered at Chr 12q13.13. Mouse has 27 of 28 type I keratin genes clustered on Chr 11, and all 26 type II clustered on Chr 15. Zebrafish has 18 type I keratin genes scattered on five chromosomes, and 3 type II keratin genes on two chromosomes. Types I and II keratin clusters—reflecting evolutionary blooms of keratin genes along one chromosomal segment—are found in all land animal genomes examined, but not fishes; such rapid gene expansions likely reflect sudden requirements for many novel paralogous proteins having divergent functions to enhance species survival following sea-to-land transition. Using data from the Genotype-Tissue Expression (GTEx) project, tissue-specific keratin expression throughout the human body was reconstructed. Clustering of gene expression patterns revealed similarities in tissue-specific expression patterns for previously described “keratin pairs” (i.e., KRT1/KRT10, KRT8/KRT18, KRT5/KRT14, KRT6/KRT16 and KRT6/KRT17 proteins). The ClinVar database currently lists 26 human disease-causing variants within the various domains of keratin proteins.

Identification of a novel signature based on unfolded protein response-related gene for predicting prognosis in bladder cancer

Human Genomics ◽

10.1186/s40246-021-00372-x ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Ke Zhu ◽

Liu Xiaoqiang ◽

Wen Deng ◽

Gongxian Wang ◽

Bin Fu

Keyword(s):

Bladder Cancer ◽

High Risk ◽

Unfolded Protein Response ◽

Immune Cells ◽

Risk Groups ◽

Immune Checkpoints ◽

Related Gene ◽

Prognostic Signature ◽

Unfolded Protein ◽

Protein Response

Abstract Background The unfolded protein response (UPR) served as a vital role in the progression of tumors, but the molecule mechanisms of UPR in bladder cancer (BLCA) have been not fully investigated. Methods We identified differentially expressed unfolded protein response-related genes (UPRRGs) between BLCA samples and normal bladder samples in the Cancer Genome Atlas (TCGA) database. Univariate Cox analysis and the least absolute shrinkage and selection operator penalized Cox regression analysis were used to construct a prognostic signature in the TCGA set. We implemented the validation of the prognostic signature in GSE13507 from the Gene Expression Omnibus database. The ESTIMATE, CIBERSORT, and ssGSEA algorithms were used to explore the correlation between the prognostic signature and immune cells infiltration as well as key immune checkpoints (PD-1, PD-L1, CTLA-4, and HAVCR2). GDSC database analyses were conducted to investigate the chemotherapy sensitivity among different groups. GSEA analysis was used to explore the potential mechanisms of UPR-based signature. Results A prognostic signature comprising of seven genes (CALR, CRYAB, DNAJB4, KDELR3, CREB3L3, HSPB6, and FBXO6) was constructed to predict the outcome of BLCA. Based on the UPRRGs signature, the patients with BLCA could be classified into low-risk groups and high-risk groups. Patients with BLCA in the low-risk groups showed the more favorable outcomes than those in the high-risk groups, which was verified in GSE13507 set. This signature could serve as an autocephalous prognostic factor in BLCA. A nomogram based on risk score and clinical characteristics was established to predict the over survival of BLCA patients. Furthermore, the signature was closely related to immune checkpoints (PD-L1, CTLA-4, and HAVCR2) and immune cells infiltration including CD8+ T cells, follicular helper T cells, activated dendritic cells, and M2 macrophages. GSEA analysis indicated that immune and carcinogenic pathways were enriched in high-risk group. Conclusions We identified a novel unfolded protein response-related gene signature which could predict the over survival, immune microenvironment, and chemotherapy response of patients with bladder cancer.

Estimating prevalence of human traits among populations from polygenic risk scores

Human Genomics ◽

10.1186/s40246-021-00370-z ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Britney E. Graham ◽

Brian Plotkin ◽

Louis Muglia ◽

Jason H. Moore ◽

Scott M. Williams

Keyword(s):

Phenotypic Variation ◽

Significant Proportion ◽

Population Level ◽

Risk Scores ◽

Polygenic Risk Score ◽

Diverse Populations ◽

Polygenic Risk ◽

Risk Variants ◽

Complex Architecture ◽

Variance Explained

AbstractThe genetic basis of phenotypic variation across populations has not been well explained for most traits. Several factors may cause disparities, from variation in environments to divergent population genetic structure. We hypothesized that a population-level polygenic risk score (PRS) can explain phenotypic variation among geographic populations based solely on risk allele frequencies. We applied a population-specific PRS (psPRS) to 26 populations from the 1000 Genomes to four phenotypes: lactase persistence (LP), melanoma, multiple sclerosis (MS) and height. Our models assumed additive genetic architecture among the polymorphisms in the psPRSs, as is convention. Linear psPRSs explained a significant proportion of trait variance ranging from 0.32 for height in men to 0.88 for melanoma. The best models for LP and height were linear, while those for melanoma and MS were nonlinear. As not all variants in a PRS may confer similar, or even any, risk among diverse populations, we also filtered out SNPs to assess whether variance explained was improved using psPRSs with fewer SNPs. Variance explained usually improved with fewer SNPs in the psPRS and was as high as 0.99 for height in men using only 548 of the initial 4208 SNPs. That reducing SNPs improves psPRSs performance may indicate that missing heritability is partially due to complex architecture that does not mandate additivity, undiscovered variants or spurious associations in the databases. We demonstrated that PRS-based analyses can be used across diverse populations and phenotypes for population prediction and that these comparisons can identify the universal risk variants.

A regulatory miRNA–mRNA network is associated with transplantation response in acute kidney injury

Human Genomics ◽

10.1186/s40246-021-00363-y ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Duan Guo ◽

Yu Fan ◽

Ji-Rong Yue ◽

Tao Lin

Keyword(s):

Acute Kidney Injury ◽

Molecular Mechanisms ◽

Kidney Injury ◽

Mapk Signaling ◽

Rapid Decline ◽

Therapeutic Drugs ◽

Transplantation Surgery ◽

Life Threatening ◽

New Biomarkers ◽

Mapk Signaling Pathways

Abstract Background Acute kidney injury (AKI) is a life-threatening complication characterized by rapid decline in renal function, which frequently occurs after transplantation surgery. However, the molecular mechanism underlying the development of post-transplant (post-Tx) AKI still remains unknown. An increasing number of studies have demonstrated that certain microRNAs (miRNAs) exert crucial functions in AKI. The present study sought to elucidate the molecular mechanisms in post-Tx AKI by constructing a regulatory miRNA–mRNA network. Results Based on two datasets (GSE53771 and GSE53769), three key modules, which contained 55 mRNAs, 76 mRNAs, and 151 miRNAs, were identified by performing weighted gene co-expression network analysis (WGCNA). The miRDIP v4.1 was applied to predict the interactions of key module mRNAs and miRNAs, and the miRNA–mRNA pairs with confidence of more than 0.2 were selected to construct a regulatory miRNA–mRNA network by Cytoscape. The miRNA–mRNA network consisted of 82 nodes (48 mRNAs and 34 miRNAs) and 125 edges. Two miRNAs (miR-203a-3p and miR-205-5p) and ERBB4 with higher node degrees compared with other nodes might play a central role in post-Tx AKI. Additionally, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway analysis indicated that this network was mainly involved in kidney-/renal-related functions and PI3K–Akt/HIF-1/Ras/MAPK signaling pathways. Conclusion We constructed a regulatory miRNA–mRNA network to provide novel insights into post-Tx AKI development, which might help discover new biomarkers or therapeutic drugs for enhancing the ability for early prediction and intervention and decreasing mortality rate of AKI after transplantation.

Driving mosaicism: somatic variants in reference population databases and effect on variant interpretation in rare genetic disease

Human Genomics ◽

10.1186/s40246-021-00371-y ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Vladimir Avramović ◽

Simona Denise Frederiksen ◽

Marjana Brkić ◽

Maja Tarailo-Graovac

Keyword(s):

Cell Proliferation ◽

Genetic Variants ◽

Allelic Imbalance ◽

Autosomal Dominant ◽

Reference Population ◽

Mendelian Disease ◽

Single Nucleotide Variants ◽

Single Nucleotide ◽

The Impact ◽

Population Databases

Abstract Background Genetic variation databases provide invaluable information on the presence and frequency of genetic variants in the ‘untargeted’ human population, aggregated with the primary goal to facilitate the interpretation of clinically important variants. The presence of somatic variants in such databases can affect variant assessment in undiagnosed rare disease (RD) patients. Previously, the impact of somatic mosaicism was only considered in relation to two Mendelian disease-associated genes. Here, we expand the analyses to identify additional mosaicism-prone genes in blood-derived reference population databases. Results To identify additional mosaicism-prone genes relevant to RDs, we focused on known/previously established ClinVar pathogenic and likely pathogenic single-nucleotide variants, residing in genes associated with early onset, severe autosomal dominant diseases. We asked whether any of these variants are present in a higher-than-expected frequency in the reference population databases and whether there is evidence of somatic origin (i.e., allelic imbalance) rather than germline heterozygosity (~ half of the reads supporting alternative allele). The mosaicism-prone genes identified were further categorized according to the processes they are involved in. Beyond the previously reported ASXL1 and DNMT3A, we identified 7 additional autosomal dominant RD-associated genes with known pathogenic single-nucleotide variants present in the reference population databases and good evidence of allelic imbalance: BRAF, CBL, FGFR3, IDH2, KRAS, PTPN11 and SETBP1. From this group of 9 genes, the majority (n = 7) was important for hematopoiesis. In addition, 4 of these genes were involved in cell proliferation. Further assessment of the known 156 hematopoietic genes led to identification of 48 genes (21 not yet associated with RDs) with at least some evidence of mosaicism detectable in reference population databases. Conclusions These results stress the importance of considering genes involved in hematopoiesis and cell proliferation when interpreting the presence and frequency of genetic variants in blood-derived reference population databases, both public and private. This is especially important when considering new variants of uncertain significance in known hematopoietic/cell proliferation RD genes and future novel gene–disease associations involving this class of genes.

Detection of low-level parental somatic mosaicism for clinically relevant SNVs and indels identified in a large exome sequencing dataset

Human Genomics ◽

10.1186/s40246-021-00369-6 ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Daniel D. Domogala ◽

Tomasz Gambin ◽

Roni Zemet ◽

Chung Wah Wu ◽

Katharina V. Schulze ◽

...

Keyword(s):

Exome Sequencing ◽

Peripheral Blood ◽

De Novo ◽

Somatic Mosaicism ◽

Hair Follicles ◽

Clinical Diagnostics ◽

Diagnostic Methods ◽

Single Nucleotide Variants ◽

Bioinformatics Pipeline ◽

Low Level

Abstract Background Due to the limitations of the current routine diagnostic methods, low-level somatic mosaicism with variant allele fraction (VAF) < 10% is often undetected in clinical settings. To date, only a few studies have attempted to analyze tissue distribution of low-level parental mosaicism in a large clinical exome sequencing (ES) cohort. Methods Using a customized bioinformatics pipeline, we analyzed apparent de novo single-nucleotide variants or indels identified in the affected probands in ES trio data at Baylor Genetics clinical laboratories. Clinically relevant variants with VAFs between 30 and 70% in probands and lower than 10% in one parent were studied. DNA samples extracted from saliva, buccal cells, redrawn peripheral blood, urine, hair follicles, and nail, representing all three germ layers, were tested using PCR amplicon next-generation sequencing (amplicon NGS) and droplet digital PCR (ddPCR). Results In a cohort of 592 clinical ES trios, we found 61 trios, each with one parent suspected of low-level mosaicism. In 21 parents, the variants were validated using amplicon NGS and seven of them by ddPCR in peripheral blood DNA samples. The parental VAFs in blood samples varied between 0.08 and 9%. The distribution of VAFs in additional tissues ranged from 0.03% in hair follicles to 9% in re-drawn peripheral blood. Conclusions Our study illustrates the importance of analyzing ES data using sensitive computational and molecular methods for low-level parental somatic mosaicism for clinically relevant variants previously diagnosed in routine clinical diagnostics as apparent de novo.

Correction to: Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility

Human Genomics ◽

10.1186/s40246-021-00373-w ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Shannon Bruse ◽

Michael Moreau ◽

Yana Bromberg ◽

Jun-Ho Jang ◽

Nan Wang ◽

...

Keyword(s):

Chronic Obstructive Pulmonary Disease ◽

Exome Sequencing ◽

Candidate Genes ◽

Pulmonary Disease ◽

Whole Exome Sequencing ◽

Disease Susceptibility ◽

Chronic Obstructive ◽

Obstructive Pulmonary Disease ◽

Whole Exome

Single-cell transcriptome identifies molecular subtype of autism spectrum disorder impacted by de novo loss-of-function variants regulating glial cells

Human Genomics ◽

10.1186/s40246-021-00368-7 ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Nasna Nassir ◽

Asma Bankapur ◽

Bisan Samara ◽

Abdulrahman Ali ◽

Awab Ahmed ◽

...

Keyword(s):

Single Cell ◽

Large Scale ◽

De Novo ◽

Cell Types ◽

Brain Regions ◽

Autism Spectrum ◽

Loss Of Function ◽

Cell Transcriptome ◽

Single Cell Transcriptome ◽

Variant Genes

Abstract Background In recent years, several hundred autism spectrum disorder (ASD) implicated genes have been discovered impacting a wide range of molecular pathways. However, the molecular underpinning of ASD, particularly from the point of view of ‘brain to behaviour’ pathogenic mechanisms, remains largely unknown. Methods We undertook a study to investigate patterns of spatiotemporal and cell type expression of ASD-implicated genes by integrating large-scale brain single-cell transcriptomes (> million cells) and de novo loss-of-function (LOF) ASD variants (impacting 852 genes from 40,122 cases). Results We identified multiple single-cell clusters from three distinct developmental human brain regions (anterior cingulate cortex, middle temporal gyrus and primary visual cortex) that evidenced high evolutionary constraint through enrichment for brain critical exons and high pLI genes. These clusters also showed significant enrichment with ASD loss-of-function variant genes (p < 5.23 × 10–11) that are transcriptionally highly active in prenatal brain regions (visual cortex and dorsolateral prefrontal cortex). Mapping ASD de novo LOF variant genes into large-scale human and mouse brain single-cell transcriptome analysis demonstrate enrichment of such genes into neuronal subtypes and are also enriched for subtype of non-neuronal glial cell types (astrocyte, p < 6.40 × 10–11, oligodendrocyte, p < 1.31 × 10–09). Conclusion Among the ASD genes enriched with pathogenic de novo LOF variants (i.e. KANK1, PLXNB1), a subgroup has restricted transcriptional regulation in non-neuronal cell types that are evolutionarily conserved. This association strongly suggests the involvement of subtype of non-neuronal glial cells in the pathogenesis of ASD and the need to explore other biological pathways for this disorder.

RNA-seq driven expression and enrichment analysis to investigate CVD genes with associated phenotypes among high-risk heart failure patients

Human Genomics ◽

10.1186/s40246-021-00367-8 ◽

2021 ◽

Vol 15 (1) ◽

Author(s):

Zeeshan Ahmed ◽

Saman Zeeshan ◽

Bruce T. Liang

Keyword(s):

Heart Failure ◽

High Risk ◽

Differentially Expressed Genes ◽

Enrichment Analysis ◽

Differentially Expressed ◽

Rna Seq ◽

Genetic Associations ◽

Specific Analysis ◽

Gender Specific ◽

Unique Genes

Abstract Background Heart failure (HF) is one of the most common complications of cardiovascular diseases (CVDs) and among the leading causes of death in the US. Many other CVDs can lead to increased mortality as well. Investigating the genetic epidemiology and susceptibility to CVDs is a central focus of cardiology and biomedical life sciences. Several studies have explored expression of key CVD genes specially in HF, yet new targets and biomarkers for early diagnosis are still missing to support personalized treatment. Lack of gender-specific cardiac biomarker thresholds in men and women may be the reason for CVD underdiagnosis in women, and potentially increased morbidity and mortality as a result, or conversely, an overdiagnosis in men. In this context, it is important to analyze the expression and enrichment of genes with associated phenotypes and disease-causing variants among high-risk CVD populations. Methods We performed RNA sequencing focusing on key CVD genes with a great number of genetic associations to HF. Peripheral blood samples were collected from a broad age range of adult male and female CVD patients. These patients were clinically diagnosed with CVDs and CMS/HCC HF, as well as including cardiomyopathy, hypertension, obesity, diabetes, asthma, high cholesterol, hernia, chronic kidney, joint pain, dizziness and giddiness, osteopenia of multiple sites, chest pain, osteoarthritis, and other diseases. Results We report RNA-seq driven case–control study to analyze patterns of expression in genes and differentiating the pathways, which differ between healthy and diseased patients. Our in-depth gene expression and enrichment analysis of RNA-seq data from patients with mostly HF and other CVDs on differentially expressed genes and CVD annotated genes revealed 4,885 differentially expressed genes (DEGs) and regulation of 41 genes known for HF and 23 genes related to other CVDs, with 15 DEGs as significantly expressed including four genes already known (FLNA, CST3, LGALS3, and HBA1) for HF and CVDs with the enrichment of many pathways. Furthermore, gender and ethnic group specific analysis showed shared and unique genes between the genders, and among different races. Broadening the scope of the results in clinical settings, we have linked the CVD genes with ICD codes. Conclusions Many pathways were found to be enriched, and gender-specific analysis showed shared and unique genes between the genders. Additional testing of these genes may lead to the development of new clinical tools to improve diagnosis and prognosis of CVD patients.

Human Genomics
Latest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer (Biomed Central Ltd.)

Contribution of 3D genome topological domains to genetic risk of cancers: a genome-wide computational study

Update of the keratin gene family: evolution, tissue-specific expression patterns, and relevance to clinical disorders

Identification of a novel signature based on unfolded protein response-related gene for predicting prognosis in bladder cancer

Estimating prevalence of human traits among populations from polygenic risk scores

A regulatory miRNA–mRNA network is associated with transplantation response in acute kidney injury

Driving mosaicism: somatic variants in reference population databases and effect on variant interpretation in rare genetic disease

Detection of low-level parental somatic mosaicism for clinically relevant SNVs and indels identified in a large exome sequencing dataset

Correction to: Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility

Single-cell transcriptome identifies molecular subtype of autism spectrum disorder impacted by de novo loss-of-function variants regulating glial cells

RNA-seq driven expression and enrichment analysis to investigate CVD genes with associated phenotypes among high-risk heart failure patients

Export Citation Format

Human GenomicsLatest Publications

TOTAL DOCUMENTS

H-INDEX

Published By Springer (Biomed Central Ltd.)

Contribution of 3D genome topological domains to genetic risk of cancers: a genome-wide computational study

Update of the keratin gene family: evolution, tissue-specific expression patterns, and relevance to clinical disorders

Identification of a novel signature based on unfolded protein response-related gene for predicting prognosis in bladder cancer

Estimating prevalence of human traits among populations from polygenic risk scores

A regulatory miRNA–mRNA network is associated with transplantation response in acute kidney injury

Driving mosaicism: somatic variants in reference population databases and effect on variant interpretation in rare genetic disease

Detection of low-level parental somatic mosaicism for clinically relevant SNVs and indels identified in a large exome sequencing dataset

Correction to: Whole exome sequencing identifies novel candidate genes that modify chronic obstructive pulmonary disease susceptibility

Single-cell transcriptome identifies molecular subtype of autism spectrum disorder impacted by de novo loss-of-function variants regulating glial cells

RNA-seq driven expression and enrichment analysis to investigate CVD genes with associated phenotypes among high-risk heart failure patients

Human Genomics
Latest Publications