scholarly journals A framework for mutational signature analysis based on DNA shape parameters

PLoS ONE ◽  
2022 ◽  
Vol 17 (1) ◽  
pp. e0262495
Author(s):  
Aleksandra Karolak ◽  
Jurica Levatić ◽  
Fran Supek

The mutation risk of a DNA locus depends on its oligonucleotide context. In turn, mutability of oligonucleotides varies across individuals, due to exposure to mutagenic agents or due to variable efficiency and/or accuracy of DNA repair. Such variability is captured by mutational signatures, a mathematical construct obtained by a deconvolution of mutation frequency spectra across individuals. There is a need to enhance methods for inferring mutational signatures to make better use of sparse mutation data (e.g., resulting from exome sequencing of cancers), to facilitate insight into underlying biological mechanisms, and to provide more accurate mutation rate baselines for inferring positive and negative selection. We propose a conceptualization of mutational signatures that represents oligonucleotides via descriptors of DNA conformation: base pair, base pair step, and minor groove width parameters. We demonstrate how such DNA structural parameters can accurately predict mutation occurrence due to DNA repair failures or due to exposure to diverse mutagens such as radiation, chemical exposure, and the APOBEC cytosine deaminase enzymes. Furthermore, the mutation frequency of DNA oligomers classed by structural features can accurately capture systematic variability in mutagenesis of >1,000 tumors originating from diverse human tissues. A nonnegative matrix factorization was applied to mutation spectra stratified by DNA structural features, thereby extracting novel mutational signatures. Moreover, many of the known trinucleotide signatures were associated with an additional spectrum in the DNA structural descriptor space, which may aid interpretation and provide mechanistic insight. Overall, we suggest that the power of DNA sequence motif-based mutational signature analysis can be enhanced by drawing on DNA shape features.

2020 ◽  
Author(s):  
Aleksandra Karolak ◽  
Fran Supek

AbstractThe propensity to acquire mutations depends on the oligonucleotide context of a DNA locus. In turn, this differential mutability of oligonucleotides varies across individuals due to exposure to mutagenic agents or due to variable efficiency of DNA repair pathways. Such variability is captured by mutational signatures, mathematical constructs resulting from a deconvolution of mutation frequency spectra across individuals. There is a need to enhance methods for inferring mutational signatures to make better use of sparse mutation frequency data that results from genome sequencing, and additionally to facilitate insight into underlying biological mechanisms. In cancer genomics, novel approaches to analyze somatic mutation patterns may help explain the etiology of various tumor types, as well as provide a more accurate baseline to infer positive and negative selection on somatic changes that drive tumor evolution. We propose a conceptualization of mutational signatures that represents oligonucleotides via descriptors of DNA conformation: base pair, base pair step, and minor groove width parameters. We demonstrate how such DNA structural parameters can accurately predict mutation occurrence due to DNA repair failures or due to exposure to diverse mutagens, including radiation, chemical exposure and the APOBEC cytosine deaminase enzymes. Furthermore, the mutation frequency of DNA oligomers classed by structural features can accurately capture systematic variability in mutational spectra of >1,000 tumors originating from diverse human tissues. Overall, we suggest that the power of DNA sequence-based mutational signature analysis can be enhanced by drawing on DNA shape features.


2019 ◽  
Author(s):  
Harald Vöhringer ◽  
Arne van Hoeck ◽  
Edwin Cuppen ◽  
Moritz Gerstung

AbstractMutational signature analysis is an essential part of the cancer genome analysis toolkit. Conventionally, mutational signature analysis extracts patterns of different mutation types across many cancer genomes. Here we present TensorSignatures, an algorithm to learn mutational signatures jointly across all variant categories and their genomic context. The analysis of 2,778 primary and 3,824 metastatic cancer genomes of the PCAWG consortium and the HMF cohort shows that practically all signatures operate dynamically in response to various genomic and epigenomic states. The analysis pins differential spectra of UV mutagenesis found in active and inactive chromatin to global genome nucleotide excision repair. TensorSignatures accurately characterises transcription-associated mutagenesis, which is detected in 7 different cancer types. The analysis also unmasks replication- and double strand break repair-driven APOBEC mutagenesis, which manifests with differential numbers and length of mutation clusters indicating a differential processivity of the two triggers. As a fourth example, TensorSignatures detects a signature of somatic hypermutation generating highly clustered variants around the transcription start sites of active genes in lymphoid leukaemia, distinct from a more general and less clustered signature of Polη-driven translesion synthesis found in a broad range of cancer types.Key findingsSimultaneous inference of mutational signatures across mutation types and genomic features refines signature spectra and defines their genomic determinants.Analysis of 6,602 cancer genomes reveals pervasive intra-genomic variation of mutational processes.Distinct mutational signatures found in quiescent and active regions of the genome reveal differential repair and mutagenicity of UV- and tobacco-induced DNA damage.APOBEC mutagenesis produces two signatures reflecting highly clustered, double strand break repair-initiated and lowly clustered replication-driven mutagenesis, respectively.Somatic hypermutation in lymphoid cancers produces a strongly clustered mutational signature localised to transcription start sites, which is distinct from a weakly clustered translesion synthesis signature found in multiple tumour types.


2018 ◽  
Author(s):  
Kevin Gori ◽  
Adrian Baez-Ortega

Mutational signature analysis aims to infer the mutational spectra and relative exposures of processes that contribute mutations to genomes. Different models for signature analysis have been developed, mostly based on non-negative matrix factorisation or non-linear optimisation. Here we present sigfit, an R package for mutational signature analysis that applies Bayesian inference to perform fitting and extraction of signatures from mutation data. We compare the performance of sigfit to prominent existing software, and find that it compares favourably. Moreover, sigfit introduces novel probabilistic models that enable more robust, powerful and versatile fitting and extraction of mutational signatures and broader biological patterns. The package also provides user-friendly visualisation routines and is easily integrable with other bioinformatic packages.


2020 ◽  
Vol 3 (1) ◽  
Author(s):  
Vinod Kumar Singh ◽  
Arnav Rastogi ◽  
Xiaoju Hu ◽  
Yaqun Wang ◽  
Subhajyoti De

AbstractAlthough a majority of somatic mutations in cancer are passengers, their mutational signatures provide mechanistic insights into mutagenesis and DNA repair processes. Mutational signature SBS8 is common in most cancers, but its etiology is debated. Incorporating genomic, epigenomic, and cellular process features for multiple cell-types we develop genome-wide composite epigenomic context-maps relevant for mutagenesis and DNA repair. Analyzing somatic mutation data from multiple cancer types in their epigenomic contexts, we show that SBS8 preferentially occurs in gene-poor, lamina-proximal, late replicating heterochromatin domains. While SBS8 is uncommon among mutations in non-malignant tissues, in tumor genomes its proportions increase with replication timing and speed, and checkpoint defects further promote this signature - suggesting that SBS8 probably arises due to uncorrected late replication errors during cancer progression. Our observations offer a potential reconciliation among different perspectives in the debate about the etiology of SBS8 and its relationship with other mutational signatures.


Author(s):  
Lauren Lawrence ◽  
Christian A. Kunder ◽  
Eula Fung ◽  
Henning Stehr ◽  
James Zehnder

Context.— Mutational signatures have been described in the literature and a few centers have implemented pipelines for clinical reporting. Objective.— To describe the performance of a mutational signature caller with clinical samples sequenced on a targeted next-generation sequencing panel with a small genomic footprint. Design.— One thousand six hundred eighty-two (n = 1682) clinical samples were analyzed for the presence of mutational signatures using deconstructSigs on variant calls with at least 20 variant reads. Results.— Signature 10 (associated with POLe mutation) achieved separation of cases and controls in hypermutated samples. Signatures 4 (associated with tobacco smoking) and 7 (associated with ultraviolet radiation) as an indicator of pulmonary or cutaneous primary sites showed moderate sensitivity and high specificity at optimal cutpoints. Mutational signatures in malignancies with unknown primaries were somewhat consistent with the clinically suspected primary site, with an apparent dose-response relationship between the number of variants analyzed and the ability of mutational signature analysis to correctly suggest a primary site. Conclusions.— Mutational signatures represent an opportunity for orthogonal testing of primary site, which may be particularly useful in supporting cutaneous or pulmonary sites in poorly differentiated neoplasms. Tobacco smoking, ultraviolet radiation, and POLe mutational signatures are the most appropriate signatures for implementation. Even relatively small numbers of variants appear capable of supporting a clinically suspected primary.


2020 ◽  
Author(s):  
Julián Candia

AbstractSummarymutSigMapper aims to resolve a critical shortcoming of existing software for mutational signature analysis, namely that of finding parsimonious and biologically plausible exposures. By implementing a shot-noise-based model to generate spectral ensembles, this package addresses this gap and provides a quantitative, non-parametric assessment of statistical significance for the association between mutational signatures and observed spectra.Availability and implementationThe mutSigMapper R package is available under GPLv3 license at https://github.com/juliancandia/mutSigMapper. Its documentation provides additional details and demonstrates applications to biological datasets.


2021 ◽  
Author(s):  
Palash Pandey ◽  
Sanjeevani Arora ◽  
Gail Rosen

The analysis of mutational signatures is becoming increasingly common in cancer genetics, with emerging implications in cancer evolution, classification, treatment decision and prognosis. Recently, several packages have been developed for mutational signature analysis, with each using different methodology and yielding significantly different results. Because of the nontrivial differences in tools' refitting results, researchers may desire to survey and compare the available tools, in order to objectively evaluate the results for their specific research question, such as which mutational signatures are prevalent in different cancer types. There is a need for a software that can aggregate results from different refitting packages and present them in a user-friendly way to facilitate effective comparison of mutational signatures.


2018 ◽  
Author(s):  
Judith E. Grolleman ◽  
Richarda M. de Voer ◽  
Fadwa A. Elsayed ◽  
Maartje Nielsen ◽  
Robbert D. A. Weren ◽  
...  

2018 ◽  
Vol 20 (suppl_6) ◽  
pp. vi106-vi106
Author(s):  
Sylvia Kurz ◽  
Benjamin Liechty ◽  
Stephen Kelly ◽  
Varshini Vasudevaraja ◽  
Ramona Bledea ◽  
...  

2019 ◽  
Vol 152 (1) ◽  
pp. 11-19 ◽  
Author(s):  
Charles W. Ashley ◽  
Arnaud Da Cruz Paula ◽  
Rahul Kumar ◽  
Diana Mandelker ◽  
Xin Pei ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document