Self-organizing Clustering: Non-hierarchical Clustering for Large Scale DNA Sequence Data

Sequence variation aware genome references and read mapping with the variation graph toolkit

10.1101/234856 ◽

2017 ◽

Cited By ~ 12

Author(s):

Erik Garrison ◽

Jouni Sirén ◽

Adam M. Novak ◽

Glenn Hickey ◽

Jordan M. Eizenga ◽

...

Keyword(s):

Dna Sequence ◽

Large Scale ◽

De Novo ◽

Sequence Data ◽

Variant Calling ◽

Read Mapping ◽

Dna Sequence Data ◽

Suffix Arrays ◽

Improved Accuracy ◽

Reference Genomes

AbstractReference genomes guide our interpretation of DNA sequence data. However, conventional linear references are fundamentally limited in that they represent only one version of each locus, whereas the population may contain multiple variants. When the reference represents an individual’s genome poorly, it can impact read mapping and introduce bias. Variation graphs are bidirected DNA sequence graphs that compactly represent genetic variation, including large scale structural variation such as inversions and duplications.1 Equivalent structures are produced by de novo genome assemblers.2,3 Here we present vg, a toolkit of computational methods for creating, manipulating, and utilizing these structures as references at the scale of the human genome. vg provides an efficient approach to mapping reads onto arbitrary variation graphs using generalized compressed suffix arrays,4 with improved accuracy over alignment to a linear reference, creating data structures to support downstream variant calling and genotyping. These capabilities make using variation graphs as reference structures for DNA sequencing practical at the scale of vertebrate genomes, or at the topological complexity of new species assemblies.

Download Full-text

A Show of Character—a further response to Wiley et al.

Zootaxa ◽

10.11646/zootaxa.2946.1.6 ◽

2011 ◽

Vol 2946 (1) ◽

pp. 29 ◽

Cited By ~ 1

Author(s):

ANTHONY C. GILL ◽

RANDALL D. MOOI

Keyword(s):

Dna Sequence ◽

Large Scale ◽

Sequence Data ◽

Molecular Data ◽

Scientific Rigor ◽

Dna Sequence Data ◽

Phylogenetic Hypotheses ◽

Alternative Relationships

Wiley et al. (2011) begin their critique of our paper (Mooi & Gill, 2010) with an assertion: “we need to make itclear that the foundation of their arguments rests not on scientific rigor, but rather on opinions about the re-classification of fishes using molecular data. This bias is the reason that they only targeted researchers who proposed changes in the higher-level taxonomy of fishes using phylogenetic hypotheses based on DNA sequence data (Miya et al. 2007, Smith & Craig 2007, Thacker 2009). In criticizing these studies, they do not suggest any alternative relationships or provide any counter evidence to the proposed relationships.” And on page 8, they apparently read our thoughts (aside from the title, none of the words in quotations was written by us in that context) and concluded: “Mooi & Gill entitled their paper “A crisis in fish systematics” because they long for the days when “real” ichthyologists found “meaningful” characters and “true” relationships.” Finally (p. 9), they contend that “Mooi & Gill’s various studies are usually focused on Johnson & Patterson’s (1993: 555) “disparate twigs of the [percomorph] tree,” whereas the explicit studies they criticize are large-scale and taxon rich datasets that have not otherwise been analyzed in Percomorpha.”

Download Full-text

Using DNA sequence data to investigate the invasive spotted lanternfly's origin, parasitoids, and microbial associates

10.1603/ice.2016.109017 ◽

2016 ◽

Author(s):

Julie M. Urban

Keyword(s):

Dna Sequence ◽

Sequence Data ◽

Dna Sequence Data

Download Full-text

DNA sonification for public engagement in bioinformatics

BMC Research Notes ◽

10.1186/s13104-021-05685-7 ◽

2021 ◽

Vol 14 (1) ◽

Author(s):

Heleen Plaisier ◽

Thomas R. Meagher ◽

Daniel Barker

Keyword(s):

Dna Sequence ◽

Public Engagement ◽

Sequence Data ◽

Sensory Perception ◽

Data Representation ◽

Sequence Information ◽

Dna Sequence Data ◽

Public Events ◽

Dna Base ◽

Alternative Means

Abstract Objective Visualisation methods, primarily color-coded representation of sequence data, have been a predominant means of representation of DNA data. Algorithmic conversion of DNA sequence data to sound—sonification—represents an alternative means of representation that uses a different range of human sensory perception. We propose that sonification has value for public engagement with DNA sequence information because it has potential to be entertaining as well as informative. We conduct preliminary work to explore the potential of DNA sequence sonification in public engagement with bioinformatics. We apply a simple sonification technique for DNA, in which each DNA base is represented by a specific note. Additionally, a beat may be added to indicate codon boundaries or for musical effect. We report a brief analysis from public engagement events we conducted that featured this method of sonification. Results We report on use of DNA sequence sonification at two public events. Sonification has potential in public engagement with bioinformatics, both as a means of data representation and as a means to attract audience to a drop-in stand. We also discuss further directions for research on integration of sonification into bioinformatics public engagement and education.

Download Full-text

When DNA sequence data and morphological results fit together: Phylogenetic position of Crenubiotus within Macrobiotoidea (Eutardigrada) with description of Crenubiotus ruhesteini sp. nov

Journal of Zoological Systematics & Evolutionary Research ◽

10.1111/jzs.12449 ◽

2021 ◽

Author(s):

Roberto Guidetti ◽

Ralph O. Schill ◽

Ilaria Giovannini ◽

Edoardo Massa ◽

Sara Elena Goldoni ◽

...

Keyword(s):

Dna Sequence ◽

Sequence Data ◽

Phylogenetic Position ◽

Dna Sequence Data

Download Full-text

A new Liopeltis Fitzinger, 1843 (Squamata: Colubridae) from Pulau Tioman, Peninsular Malaysia

Zootaxa ◽

10.11646/zootaxa.4766.3.6 ◽

2020 ◽

Vol 4766 (3) ◽

pp. 472-484

Author(s):

HANNAH E. SOM ◽

L. LEE GRISMER ◽

PERRY L. JR. WOOD ◽

EVAN S. H. QUAH ◽

RAFE M. BROWN ◽

...

Keyword(s):

Mitochondrial Dna ◽

New Species ◽

Dna Sequence ◽

Rare Species ◽

Sequence Data ◽

Peninsular Malaysia ◽

Tropical Asia ◽

Dna Sequence Data ◽

A New Species ◽

Lines Of Evidence

Liopeltis is a genus of poorly known, infrequently sampled species of colubrid snakes in tropical Asia. We collected a specimen of Liopeltis from Pulau Tioman, Peninsular Malaysia, that superficially resembled L. philippina, a rare species that is endemic to the Palawan Pleistocene Aggregate Island Complex, western Philippines. We analyzed morphological and mitochondrial DNA sequence data from the Pulau Tioman specimen and found distinct differences to L. philippina and all other congeners. On the basis of these corroborated lines of evidence, the Pulau Tioman specimen is described as a new species, L. tiomanica sp. nov. The new species occurs in sympatry with L. tricolor on Pulau Tioman, and our description of L. tiomanica sp. nov. brings the number of endemic amphibians and reptiles on Pulau Tioman to 12.

Download Full-text

Insights into the Biogeography and Polyploid Evolution of New Zealand Asplenium from Chloroplast DNA Sequence Data

American Fern Journal ◽

10.1640/0002-8444(2005)095[0001:iitbap]2.0.co;2 ◽

2005 ◽

Vol 95 (1) ◽

pp. 1-21 ◽

Cited By ~ 47

Author(s):

Leon R. Perrie ◽

Patrick J. Brownsey

Keyword(s):

New Zealand ◽

Chloroplast Dna ◽

Dna Sequence ◽

Sequence Data ◽

Polyploid Evolution ◽

Dna Sequence Data

Download Full-text

Accuracy and efficiency of algorithms for the demarcation of bacterial ecotypes from DNA sequence data

International Journal of Bioinformatics Research and Applications ◽

10.1504/ijbra.2014.062992 ◽

2014 ◽

Vol 10 (4/5) ◽

pp. 409 ◽

Cited By ~ 5

Author(s):

Juan Carlos Francisco ◽

Frederick M. Cohan ◽

Danny Krizanc

Keyword(s):

Dna Sequence ◽

Sequence Data ◽

Dna Sequence Data ◽

Efficiency Of Algorithms

Download Full-text

Natural variation and conservation ofLepidium sisymbrioidesHook, f. andL. solandriKirk (Brassicaceae) in South Island, New Zealand, based on morphological and DNA sequence data

New Zealand Journal of Botany ◽

10.1080/00288250709509712 ◽

2007 ◽

Vol 45 (1) ◽

pp. 237-264 ◽

Cited By ~ 14

Author(s):

P. B. Heenan ◽

A. D. Mitchell ◽

P. A. McLenachan ◽

P. J. Lockhart ◽

P. J. de Lange

Keyword(s):

New Zealand ◽

Dna Sequence ◽

Natural Variation ◽

Sequence Data ◽

Dna Sequence Data

Download Full-text

Genetic diversity within scorpions of the genus Buthus from the Iberian Peninsula: mitochondrial DNA sequence data indicate additional distinct cryptic lineages

Journal of Arachnology ◽

10.1636/h08-98.1 ◽

2010 ◽

Vol 38 (2) ◽

pp. 206-211 ◽

Cited By ~ 19

Author(s):

Pedro Sousa ◽

Elsa Froufe ◽

Paulo Célio Alves ◽

D. James Harris

Keyword(s):

Genetic Diversity ◽

Mitochondrial Dna ◽

Iberian Peninsula ◽

Dna Sequence ◽

Sequence Data ◽

Dna Sequence Data ◽

Mitochondrial Dna Sequence

Download Full-text