scholarly journals A 127 kb truncating deletion of PGRMC1 is a novel cause of X-linked isolated paediatric cataract

Author(s):  
Johanna L. Jones ◽  
Mark A. Corbett ◽  
Elise Yeaman ◽  
Duran Zhao ◽  
Jozef Gecz ◽  
...  

AbstractInherited paediatric cataract is a rare Mendelian disease that results in visual impairment or blindness due to a clouding of the eye’s crystalline lens. Here we report an Australian family with isolated paediatric cataract, which we had previously mapped to Xq24. Linkage at Xq24–25 (LOD = 2.53) was confirmed, and the region refined with a denser marker map. In addition, two autosomal regions with suggestive evidence of linkage were observed. A segregating 127 kb deletion (chrX:g.118373226_118500408del) in the Xq24–25 linkage region was identified from whole-genome sequencing data. This deletion completely removed a commonly deleted long non-coding RNA gene LOC101928336 and truncated the protein coding progesterone receptor membrane component 1 (PGRMC1) gene following exon 1. A literature search revealed a report of two unrelated males with non-syndromic intellectual disability, as well as congenital cataract, who had contiguous gene deletions that accounted for their intellectual disability but also disrupted the PGRMC1 gene. A morpholino-induced pgrmc1 knockdown in a zebrafish model produced significant cataract formation, supporting a role for PGRMC1 in lens development and cataract formation. We hypothesise that the loss of PGRMC1 causes cataract through disrupted PGRMC1-CYP51A1 protein–protein interactions and altered cholesterol biosynthesis. The cause of paediatric cataract in this family is the truncating deletion of PGRMC1, which we report as a novel cataract gene.


2021 ◽  
Vol 11 (2) ◽  
pp. 131
Author(s):  
Laura B. Scheinfeldt ◽  
Andrew Brangan ◽  
Dara M. Kusic ◽  
Sudhir Kumar ◽  
Neda Gharani

Pharmacogenomics holds the promise of personalized drug efficacy optimization and drug toxicity minimization. Much of the research conducted to date, however, suffers from an ascertainment bias towards European participants. Here, we leverage publicly available, whole genome sequencing data collected from global populations, evolutionary characteristics, and annotated protein features to construct a new in silico machine learning pharmacogenetic identification method called XGB-PGX. When applied to pharmacogenetic data, XGB-PGX outperformed all existing prediction methods and identified over 2000 new pharmacogenetic variants. While there are modest pharmacogenetic allele frequency distribution differences across global population samples, the most striking distinction is between the relatively rare putatively neutral pharmacogene variants and the relatively common established and newly predicted functional pharamacogenetic variants. Our findings therefore support a focus on individual patient pharmacogenetic testing rather than on clinical presumptions about patient race, ethnicity, or ancestral geographic residence. We further encourage more attention be given to the impact of common variation on drug response and propose a new ‘common treatment, common variant’ perspective for pharmacogenetic prediction that is distinct from the types of variation that underlie complex and Mendelian disease. XGB-PGX has identified many new pharmacovariants that are present across all global communities; however, communities that have been underrepresented in genomic research are likely to benefit the most from XGB-PGX’s in silico predictions.



Author(s):  
Christoph Schaniel ◽  
Priyanka Dhanan ◽  
Bin Hu ◽  
Yuguang Xiong ◽  
Teeya Raghunandan ◽  
...  

AbstractA library of well-characterized human induced pluripotent stem cell (hiPSC) lines from clinically healthy human subjects could serve as a powerful resource of normal controls for in vitro human development, disease modeling, genotype-phenotype association studies, and drug response evaluation. We report generation and extensive characterization of a gender-balanced, racially/ethnically diverse library of hiPSC lines from forty clinically healthy human individuals who range in age from 22-61. The hiPSCs match the karyotype and short tandem repeat identity of their parental fibroblasts, and have a transcription profile characteristic of pluripotent stem cells. We provide whole genome sequencing data for one hiPSC clone from each individual, ancestry determination, and analysis of Mendelian disease genes and risks. We document similar physiology of cardiomyocytes differentiated from multiple independent hiPSC clones derived from two individuals. This extensive characterization makes this hiPSC library a unique and valuable resource for many studies on human biology.



2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Dimitrios Vitsios ◽  
Ryan S. Dhindsa ◽  
Lawrence Middleton ◽  
Ayal B. Gussow ◽  
Slavé Petrovski

AbstractElucidating functionality in non-coding regions is a key challenge in human genomics. It has been shown that intolerance to variation of coding and proximal non-coding sequence is a strong predictor of human disease relevance. Here, we integrate intolerance to variation, functional genomic annotations and primary genomic sequence to build JARVIS: a comprehensive deep learning model to prioritize non-coding regions, outperforming other human lineage-specific scores. Despite being agnostic to evolutionary conservation, JARVIS performs comparably or outperforms conservation-based scores in classifying pathogenic single-nucleotide and structural variants. In constructing JARVIS, we introduce the genome-wide residual variation intolerance score (gwRVIS), applying a sliding-window approach to whole genome sequencing data from 62,784 individuals. gwRVIS distinguishes Mendelian disease genes from more tolerant CCDS regions and highlights ultra-conserved non-coding elements as the most intolerant regions in the human genome. Both JARVIS and gwRVIS capture previously inaccessible human-lineage constraint information and will enhance our understanding of the non-coding genome.



2021 ◽  
Vol 6 (1) ◽  
Author(s):  
Chun-Yu Wei ◽  
Jenn-Hwai Yang ◽  
Erh-Chan Yeh ◽  
Ming-Fang Tsai ◽  
Hsiao-Jung Kao ◽  
...  

AbstractPersonalized medical care focuses on prediction of disease risk and response to medications. To build the risk models, access to both large-scale genomic resources and human genetic studies is required. The Taiwan Biobank (TWB) has generated high-coverage, whole-genome sequencing data from 1492 individuals and genome-wide SNP data from 103,106 individuals of Han Chinese ancestry using custom SNP arrays. Principal components analysis of the genotyping data showed that the full range of Han Chinese genetic variation was found in the cohort. The arrays also include thousands of known functional variants, allowing for simultaneous ascertainment of Mendelian disease-causing mutations and variants that affect drug metabolism. We found that 21.2% of the population are mutation carriers of autosomal recessive diseases, 3.1% have mutations in cancer-predisposing genes, and 87.3% carry variants that affect drug response. We highlight how TWB data provide insight into both population history and disease burden, while showing how widespread genetic testing can be used to improve clinical care.



2021 ◽  
Vol 12 ◽  
Author(s):  
Christopher M. Grochowski ◽  
Ana C. V. Krepischi ◽  
Jesper Eisfeldt ◽  
Haowei Du ◽  
Debora R. Bertola ◽  
...  

Chromoanagenesis is a descriptive term that encompasses classes of catastrophic mutagenic processes that generate localized and complex chromosome rearrangements in both somatic and germline genomes. Herein, we describe a 5-year-old female presenting with a constellation of clinical features consistent with a clinical diagnosis of Coffin–Siris syndrome 1 (CSS1). Initial G-banded karyotyping detected a 90-Mb pericentric and a 47-Mb paracentric inversion on a single chromosome. Subsequent analysis of short-read whole-genome sequencing data and genomic optical mapping revealed additional inversions, all clustered on chromosome 6, one of them disrupting ARID1B for which haploinsufficiency leads to the CSS1 disease trait (MIM:135900). The aggregate structural variant data show that the resolved, the resolved derivative chromosome architecture presents four de novo inversions, one pericentric and three paracentric, involving six breakpoint junctions in what appears to be a shuffling of genomic material on this chromosome. Each junction was resolved to nucleotide-level resolution with mutational signatures suggestive of non-homologous end joining. The disruption of the gene ARID1B is shown to occur between the fourth and fifth exon of the canonical transcript with subsequent qPCR studies confirming a decrease in ARID1B expression in the patient versus healthy controls. Deciphering the underlying genomic architecture of chromosomal rearrangements and complex structural variants may require multiple technologies and can be critical to elucidating the molecular etiology of a patient’s clinical phenotype or resolving unsolved Mendelian disease cases.



Author(s):  
Eric S Tvedte ◽  
Mark Gasser ◽  
Benjamin C Sparklin ◽  
Jane Michalski ◽  
Carl E Hjelmen ◽  
...  

Abstract The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined.



2021 ◽  
Vol 12 (1) ◽  
Author(s):  
Zhongbo Chen ◽  
◽  
David Zhang ◽  
Regina H. Reynolds ◽  
Emil K. Gustavsson ◽  
...  

AbstractKnowledge of genomic features specific to the human lineage may provide insights into brain-related diseases. We leverage high-depth whole genome sequencing data to generate a combined annotation identifying regions simultaneously depleted for genetic variation (constrained regions) and poorly conserved across primates. We propose that these constrained, non-conserved regions (CNCRs) have been subject to human-specific purifying selection and are enriched for brain-specific elements. We find that CNCRs are depleted from protein-coding genes but enriched within lncRNAs. We demonstrate that per-SNP heritability of a range of brain-relevant phenotypes are enriched within CNCRs. We find that genes implicated in neurological diseases have high CNCR density, including APOE, highlighting an unannotated intron-3 retention event. Using human brain RNA-sequencing data, we show the intron-3-retaining transcript to be more abundant in Alzheimer’s disease with more severe tau and amyloid pathological burden. Thus, we demonstrate potential association of human-lineage-specific sequences in brain development and neurological disease.



Sign in / Sign up

Export Citation Format

Share Document