Abstract
Somatic mutations drive cancer development and may contribute to ageing and other diseases. Yet, the difficulty of detecting mutations present only in single cells or small clones has limited our knowledge of somatic mutagenesis to a minority of tissues. To overcome these limitations, we introduce nanorate sequencing (NanoSeq), a new duplex sequencing protocol with error rates <5 errors per billion base pairs in single DNA molecules from cell populations. The version of the protocol described here uses clean genome fragmentation with a restriction enzyme to prevent end-repair-associated errors and ddBTPs/dATPs during A-tailing to prevent nick extension. Both changes reduce the error rate of standard duplex sequencing protocols by preventing the fixation of DNA damage into both strands of DNA molecules during library preparation. We also use qPCR quantification of the library prior to amplification to optimise the complexity of the sequencing library given the desired sequencing coverage, maximising duplex coverage. The sample preparation protocol takes between 1 and 2 days, depending on the number of samples processed. The bioinformatic protocol is described in:https://github.com/cancerit/NanoSeqhttps://github.com/fa8sanger/NanoSeq_Paper_Code