small indels
Recently Published Documents


TOTAL DOCUMENTS

81
(FIVE YEARS 51)

H-INDEX

11
(FIVE YEARS 4)

2022 ◽  
Vol 23 (2) ◽  
pp. 654
Author(s):  
Yan Du ◽  
Zhuo Feng ◽  
Jie Wang ◽  
Wenjie Jin ◽  
Zhuanzi Wang ◽  
...  

Genetic variations are an important source of germplasm diversity, as it provides an allele resource that contributes to the development of new traits for plant breeding. Gamma rays have been widely used as a physical agent for mutation creation in plants, and their mutagenic effect has attracted extensive attention. However, few studies are available on the comprehensive mutation profile at both the large-scale phenotype mutation screening and whole-genome mutation scanning. In this study, biological effects on M1 generation, large-scale phenotype screening in M2 generation, as well as whole-genome re-sequencing of seven M3 phenotype-visible lines were carried out to comprehensively evaluate the mutagenic effects of gamma rays on Arabidopsis thaliana. A total of 417 plants with visible mutated phenotypes were isolated from 20,502 M2 plants, and the phenotypic mutation frequency of gamma rays was 2.03% in Arabidopsis thaliana. On average, there were 21.57 single-base substitutions (SBSs) and 11.57 small insertions and deletions (InDels) in each line. Single-base InDels accounts for 66.7% of the small InDels. The genomic mutation frequency was 2.78 × 10−10/bp/Gy. The ratio of transition/transversion was 1.60, and 64.28% of the C > T events exhibited the pyrimidine dinucleotide sequence; 69.14% of the small InDels were located in the sequence with 1 to 4 bp terminal microhomology that was used for DNA end rejoining, while SBSs were less dependent on terminal microhomology. Nine genes, on average, were predicted to suffer from functional alteration in each re-sequenced line. This indicated that a suitable mutation gene density was an advantage of gamma rays when trying to improve elite materials for one certain or a few traits. These results will aid the full understanding of the mutagenic effects and mechanisms of gamma rays and provide a basis for suitable mutagen selection and parameter design, which can further facilitate the development of more controlled mutagenesis methods for plant mutation breeding.


2022 ◽  
Author(s):  
Justin Wilcox ◽  
Barbara Arca-Ruibal ◽  
Jaime Samour ◽  
Victor Mateuta ◽  
Youssef Idaghdour ◽  
...  

Falcons are diverse birds of cultural and economic importance. They have undergone major lineage specific chromosomal rearrangements, resulting in greatly-reduced chromosome counts relative to other birds. Here, we use 10X Genomics linked reads to provide new high-contiguity genomes for two gyrfalcons, a saker falcon, a lanner falcon, three subspecies of peregrine falcons, and the common kestrel. Assisted by a transcriptome sequenced from 22 gyrfalcon tissues, we annotate these genomes for a variety of genomic features, estimate historical demography, and then investigate genomic equilibrium in the context of falcon-specific chromosomal rearrangements. We find that falcon genomes are not in AT-GC equilibrium with a bias in mutations towards higher AT content; this bias is predominantly driven by, but not dependent on, hypermutability of CpG sites. Small indels and large structural variants were also biased towards insertions rather than deletions. Patterns of disequilibrium were linked to chromosomal rearrangements: falcons have lost GC content in regions that have fused to larger chromosomes from microchromosomes and gained GC content in regions of macrochromosomes that have translocated to microchromosomes. Inserted bases have accumulated on regions ancestrally belonging to microchromosomes, consistent with insertion-biased gene conversion. We also find an excess of interspersed repeats on regions of microchromosomes that have fused to macrochromosomes. Our results reveal that falcon genomes are in a state of flux. They further suggest that many of the key differences between microchromosomes and macrochromosomes are driven by differences in chromosome size, and indicate a clear role for recombination and biased gene conversion in determining genomic equilibrium.


Author(s):  
Skylar R Wyant ◽  
M Fernanda Rodriguez ◽  
Corey K Carter ◽  
Wayne A Parrott ◽  
Scott A Jackson ◽  
...  

Abstract The mutagenic effects of ionizing radiation have been used for decades to create novel variants in experimental populations. Fast neutron (FN) bombardment as a mutagen has been especially widespread in plants, with extensive reports describing the induction of large structural variants, i.e., deletions, insertions, inversions, and translocations. However, the full spectrum of FN-induced mutations is poorly understood. We contrast small insertions and deletions (indels) observed in 27 soybean lines subject to FN irradiation with the standing indels identified in 107 diverse soybean lines. We use the same populations to contrast the nature and context (bases flanking a nucleotide change) of single nucleotide variants. The accumulation of new single nucleotide changes in FN lines is marginally higher than expected based on spontaneous mutation. In FN treated lines and in standing variation, C→T transitions and the corresponding reverse complement G→A transitions are the most abundant and occur most frequently in a CpG local context. These data indicate that most SNPs identified in FN lines are likely derived from spontaneous de novo processes in generations following mutagenesis rather than from the FN irradiation mutagen. However, small indels in FN lines differ from standing variants. Short insertions, from 1–6 base pairs, are less abundant than in standing variation. Short deletions are more abundant and prone to induce frameshift mutations that should disrupt the structure and function of encoded proteins. These findings indicate that FN irradiation generates numerous small indels, increasing the abundance of loss of function mutations that impact single genes.


2021 ◽  
Author(s):  
Ajay Gupta ◽  
Wanlong Li

As a versatile tool for genome engineering, CRISPR-Cas9 has been revolutionizing the field of molecular biology, biotechnology, and crop improvement. By precisely targeting pre-selected genomic sites, CRISPR-Cas9 primarily induces insertions or deletions (indels) of variable size. Despite the significant advance in the technology per se, detecting these indels is the major and difficult part of the CRISPR program in polyploid species, like wheat, with relatively low mutation rates. A plethora of methods are available for detecting mutations, but no method is perfect for all mutation types. In this case study, we demonstrated a new, protocol for capturing length polymorphism from small indels using a nested PCR approach. This new method is tractable, efficient, and cost-effective in detecting and genotyping indels >3-bp. We also discussed the major genotyping platforms used in our wheat CRISPR projects, such as mismatch cleavage assay, restriction enzyme assay, ribonucleoprotein assay, and Sanger sequencing, for their advantages and pitfalls in wheat CRISPR mutation detection.


2021 ◽  
Vol 16 (1) ◽  
Author(s):  
Hans-Ulrich Klein ◽  
Caroline Trumpff ◽  
Hyun-Sik Yang ◽  
Annie J. Lee ◽  
Martin Picard ◽  
...  

Abstract Background Mitochondrial dysfunction is a feature of neurodegenerative diseases, including Alzheimer’s disease (AD). Changes in the mitochondrial DNA copy number (mtDNAcn) and increased mitochondrial DNA mutation burden have both been associated with neurodegenerative diseases and cognitive decline. This study aims to systematically identify which common brain pathologies in the aged human brain are associated with mitochondrial recalibrations and to disentangle the relationship between these pathologies, mtDNAcn, mtDNA heteroplasmy, aging, neuronal loss, and cognitive function. Methods Whole-genome sequencing data from n = 1361 human brain samples from 5 different regions were used to quantify mtDNAcn as well as heteroplasmic mtDNA point mutations and small indels. Brain samples were assessed for 10 common pathologies. Annual cognitive test results were used to assess cognitive function proximal to death. For a subset of samples, neuronal proportions were estimated from RNA-seq profiles, and mass spectrometry was used to quantify the mitochondrial protein content of the tissue. Results mtDNAcn was 7–14% lower in AD relative to control participants. When accounting for all 10 common neuropathologies, only tau was significantly associated with lower mtDNAcn in the dorsolateral prefrontal cortex. In the posterior cingulate cortex, TDP-43 pathology demonstrated a distinct association with mtDNAcn. No changes were observed in the cerebellum, which is affected late by pathologies. Neither age nor gender was associated with mtDNAcn in the studied brain regions when adjusting for pathologies. Mitochondrial content and mtDNAcn independently explained variance in cognitive function unaccounted by pathologies, implicating complex mitochondrial recalibrations in cognitive decline. In contrast, mtDNA heteroplasmy levels increased by 1.5% per year of life in the cortical regions, but displayed no association with any of the pathologies or cognitive function. Conclusions We studied mtDNA quantity and quality in relation to mixed pathologies of aging and showed that tau and not amyloid-β is primarily associated with reduced mtDNAcn. In the posterior cingulate cortex, the association of TDP-43 with low mtDNAcn points to a vulnerability of this region in limbic-predominant age-related TDP-43 encephalopathy. While we found low mtDNAcn in brain regions affected by pathologies, the absence of associations with mtDNA heteroplasmy burden indicates that mtDNA point mutations and small indels are unlikely to be involved in the pathogenesis of late-onset neurodegenerative diseases.


Blood ◽  
2021 ◽  
Vol 138 (Supplement 1) ◽  
pp. 2192-2192
Author(s):  
Burak Altintas ◽  
Neelam Giri ◽  
Lisa J. McReynolds ◽  
Blanche P. Alter

Abstract Fanconi anemia (FA) is predominantly an autosomal recessive inherited bone marrow failure syndrome (IBMFS) characterized by congenital anomalies, bone marrow failure (BMF) and an increased cancer risk. It is caused by pathogenic variants in more than 22 genes in the FA/BRCA DNA repair pathway. Approximately 60% of patients have biallelic variants in FANCA. We sought to identify phenotypic differences and clinical outcomes of patients with diverse FANCA variants. We analyzed data from 86 patients with variants in FANCA enrolled in the National Cancer Institute (NCI) IBMFS study (ClinicalTrials.gov identifier 00027274). FANCA variants were determined through review of genetic test reports or whole exome sequencing done as part of the study. The variants were plotted using the online tool ProteinPaint (https://proteinpaint.stjude.org, Figure 1A). Clinical data were extracted from review of medical records and/or evaluations at the NIH. We compared patients with: 1) hypomorphic (hypomorphic variant on one or both alleles) versus null genotype (null variants on both alleles), 2) single nucleotide variant (SNV)/small insertions/deletions (indels) on both alleles versus SNV/small indels on one allele plus large multi-exon deletion on the other allele versus large multi-exon deletions on both alleles. We further compared patients with one or biallelic variants involving the BRCA1 interaction region in the N-terminal domain, FAAP20-binding domain, and variants in exons 27 to 30 where we saw an accumulation of variants, with patients who did not have variants in these regions. We evaluated physical abnormalities that are part of VACTERL-H (Vertebral, Anal, Cardiac, Tracheo-esophageal fistula, Esophageal or duodenal atresia, Renal, upper Limb abnormalities, Hydrocephalus) association and PHENOS (skin Pigmentation abnormalities, small Head, small Eyes, central Nervous system, Otologic abnormalities, Short stature). We focused on the presence, severity and age at BMF, and development of cancers and age at first solid tumor. Frequencies were compared by Fisher's exact test, and a multiple Cox regression model was used to estimate hazard ratio of solid tumors in patients with variants in different regions of FANCA, adjusting for age and hematopoietic cell transplant status of patients with cancer. Analyses were performed using Excel and RStudio, p-value <0.05 was considered significant. The phenotypes, BMF and cancer outcomes were similar between patients with hypomorphic and null genotypes. Similarly, comparison between patients with SNV/small indels, SNV/small indel plus large deletion, and biallelic large deletions did not reveal significant associations. Comparison according the location of the variants on FANCA protein showed that VACTERL-H and VACTERL-H plus PHENOS were less common in patients with at least one SNV/small indel in the BRCA1 interaction region of FANCA compared with patients without variants in this region (2/33 vs 12/51, p= 0.04; 1/33 vs 11/51, p= 0.024, respectively). These associations were not observed when we included patients with large deletions encompassing the BRCA1 interaction region. Eighteen of the 86 patients developed solid tumors; 15/45 patients with an SNV/small indel and/or large deletion within or including exons 27-30 region developed solid tumors compared with 3/41 patients without variants in this region (p= 0.003). Cox regression analysis showed that patients with variants within or involving exons 27-30 were at higher risk of developing solid tumors compared with those without variants in this region (HR: 6.2, 95% CI: 1.36-28.2, Figure 1B). There was no difference between the age at first cancer or type of solid tumors in patients with and without the variant involving this region. The frequency, severity, and age of BMF were also similar between the groups. Our data highlight the possibility that variants involving exons 27-30 within the C-terminal domain of FANCA may be associated with solid tumor development. FANCA forms a homodimer through the interaction between C-terminal domains; variants in this region may affect dimerization and further protein function. Functional analysis and in vivo studies of individual variants in this region and effects of the variants in trans might provide new insights into oncogenesis in FA and may have implications in personalized cancer screening. Figure 1 Figure 1. Disclosures No relevant conflicts of interest to declare.


2021 ◽  
Vol 23 (Supplement_6) ◽  
pp. vi103-vi103
Author(s):  
Changlin Yang ◽  
Vrunda Trivedi ◽  
Kyle Dyson ◽  
Oleg Yegorov ◽  
Duane Mitchell

Abstract BACKGROUND Cancer immunogenomics represents a complementary approach to the application of genomics in developing novel immunotherapies. We performed a multi-faceted computer algorithm, the Open Reading Frame Antigen Network (O.R.A.N.), on medulloblastoma transcription profiles and predicted antigens across a broad array of antigen classes. METHODS Patient-specific HLA haplotypes were called via customized Optitype and Phlat algorithms. Preclinical models- sonic hedgehog driven (Ptch1) and Group 3 MYC-driven (NSC) medulloblastoma were derived from C57BL6 murine strain with known MHC haplotypes. Only expressed mutations such as single nucleotide variations, small indels, gene fusions, and personalized TAAs were used for antigenic epitope predictions. Patient-specific or murine tumor associated antigens (TAA) were selected only if expressed >1 transcript per million (TPM) in tumor and the standardized expression across a human tissue database (29 organs or sub-regions, n=9,141) or a mouse normal tissue database (ENCODE, n=99) was below 1 TPM, respectively. TAA sequences were passed through eight MHC class I and four MHC class II affinity algorithms. All epitopes were screened against a customized human or murine proteomic library to guarantee that epitopes were not shared by other expressed isoforms or genes. Immune deconvolution with single cell RNASeq integration was leveraged for teasing out medulloblastoma immunologic landscape. RESULTS MB patients harbor MHC-I restricted 1.9 SNV, 0.1 Indel, 0.5 gene fusions and MHC-II restricted 2.5 SNV, 0.1 Indel and 0.5 gene fusion. 79.4% patients have at least 1 neoantigen. 88.2% patients have at least one immunogenic TAA. Importantly, cancer testis antigens and previously unappreciated neurodevelopmental antigens were found expressed across all medulloblastoma subgroups. We predicted 6 neoantigens and 14 TAAs for murine NSC tumor and 19 neoantigens and 13 TAAs for Ptch1 tumor. CONCULSION: Using a custom antigen prediction pipeline, we identified potential human and murine tumor rejection antigens with important implications for development of medulloblastoma cellular therapies.


2021 ◽  
Author(s):  
Karolin Heinze ◽  
Tayyebeh M. Nazeran ◽  
Sandra Lee ◽  
Pauline Krämer ◽  
Evan S. Cairns ◽  
...  

AbstractARID1A (BAF250a) is a component of the SWI/SNF chromatin modifying complex, plays an important tumor suppressor role, and is considered prognostic in several malignancies. However, in ovarian carcinomas there are contradictory reports on its relationship to outcome, immune response, and correlation with clinicopathological features. We assembled a series of 1623 endometriosis-associated ovarian carcinomas, including 1078 endometrioid (ENOC) and 545 clear cell (CCOC) ovarian carcinomas through combining resources of the Ovarian Tumor Tissue Analysis (OTTA) Consortium, the Canadian Ovarian Unified Experimental Resource (COEUR), local, and collaborative networks. Validated immunohistochemical surrogate assays for ARID1A mutations were applied to all samples. We investigated associations between ARID1A loss/mutation, clinical features, outcome, CD8+ tumor-infiltrating lymphocytes (CD8+ TIL), and DNA mismatch repair deficiency (MMRd). ARID1A loss was observed in 42% of CCOC and 25% of ENOC. We found no associations between ARID1A loss and outcomes, stage, age, or CD8+ TIL status in CCOC. Similarly, we found no association with outcome or stage in endometrioid cases. In ENOC, ARID1A loss was more prevalent in younger patients (p=0.012), and associated with MMRd (p<0.001), and presence of CD8+ TIL (p=0.008). Consistent with MMRd being causative of ARID1A mutations, in a subset of ENOC we also observed an association between ARID1A loss-of-function mutation as a result of small indels (p=0.011, vs. single nucleotide variants). In ENOC, the association between ARID1A loss, CD8+ TIL, and age, appears confounded by MMRd status. Although this observation does not explicitly rule out a role for ARID1A influence on CD8+ TIL infiltration in ENOC, given current knowledge regarding MMRd, it seems more likely that effects are dominated by the hypermutation phenotype. This large dataset with consistently applied biomarker assessment now provides a benchmark for the prevalence of ARID1A loss-of-function mutations in endometriosis-associated ovarian cancers and brings clarity to the prognostic significance.


2021 ◽  
Author(s):  
Patricia L Foster ◽  
Brittany A Niccum ◽  
Heewook Lee

Encounters between DNA replication and transcription can cause genomic disruption, particularly when the two meet head-on. Whether these conflicts produce point mutations is debated. This paper presents detailed analyses of a large collection of mutations generated during mutation accumulation experiments with mismatch-repair (MMR) defective Escherichia coli. With MMR absent, mutations are primarily due to DNA replication errors. Overall, there were no differences in the frequencies of base-pair substitutions or small indels (insertion and deletions ≤ 4 bp) in the coding sequences or promoters of genes oriented codirectionally versus head-on to replication. Among a subset of highly expressed genes there was a 2- to 3-fold bias for indels in genes oriented head-on to replication, but this difference was almost entirely due to the asymmetrical genomic locations of tRNA genes containing mononucleotide runs, which are hotspots for indels.No additional orientation bias in mutation frequencies occurred when MMR-strains were also defective for transcription-coupled repair (TCR). However, in contrast to other reports, loss of TCR slightly increased the overall mutation rate, meaning that TCR is antimutagenic. There was no orientation bias in mutation frequencies among the stress-response genes that are regulated by RpoS or induced by DNA damage. Thus, biases in the locations of mutational targets can account for most, if not all, apparent biases in mutation frequencies between genes oriented head-on versus co-directional to replication. In addition, the data revealed a strong correlation of the frequency of base-pair substitutions with gene length, but no correlation with gene expression levels.


2021 ◽  
Author(s):  
Thomas Hackl ◽  
Florian Trigodet ◽  
A Murat Eren ◽  
Steven J Biller ◽  
John M Eppley ◽  
...  

Long-read sequencing technologies hold big promises for the genomic analysis of complex samples such as microbial communities. Yet, despite improving accuracy, basic gene prediction on long-read data is still often impaired by frameshifts resulting from small indels. Consensus polishing using either complementary short reads or to a lesser extent the long reads themselves can mitigate this effect but requires universally high sequencing depth, which is difficult to achieve in complex samples where the majority of community members are rare. Here we present proovframe, a software implementing an alternative approach to overcome frameshift errors in long-read assemblies and raw long reads. We utilize protein-to-nucleotide alignments against reference databases to pinpoint indels in contigs or reads and correct them by deleting or inserting 1-2 bases, thereby conservatively restoring reading-frame fidelity in aligned regions. Using simulated and real-world benchmark data we show that proovframe performs comparably to short-read-based polishing on assembled data, works well with remote protein homologs, and can even be applied to raw reads directly. Together, our results demonstrate that protein-guided frameshift correction significantly improves the analyzability of long-read data both in combination with and as an alternative to common polishing strategies. Proovframe is available from https://github.com/thackl/proovframe.


Sign in / Sign up

Export Citation Format

Share Document