Approximate Pattern Matching for DNA Sequence Data

Author(s):  
Nagamma Patil ◽  
Durga Toshniwal ◽  
Kumkum Garg
2019 ◽  
Vol 2019 ◽  
pp. 1-9 ◽  
Author(s):  
Maleeha Najam ◽  
Raihan Ur Rasool ◽  
Hafiz Farooq Ahmad ◽  
Usman Ashraf ◽  
Asad Waqar Malik

Storing and processing of large DNA sequences has always been a major problem due to increasing volume of DNA sequence data. However, a number of solutions have been proposed but they require significant computation and memory. Therefore, an efficient storage and pattern matching solution is required for DNA sequencing data. Bloom filters (BFs) represent an efficient data structure, which is mostly used in the domain of bioinformatics for classification of DNA sequences. In this paper, we explore more dimensions where BFs can be used other than classification. A proposed solution is based on Multiple Bloom Filters (MBFs) that finds all the locations and number of repetitions of the specified pattern inside a DNA sequence. Both of these factors are extremely important in determining the type and intensity of any disease. This paper serves as a first effort towards optimizing the search for location and frequency of substrings in DNA sequences using MBFs. We expect that further optimizations in the proposed solution can bring remarkable results as this paper presents a proof of concept implementation for a given set of data using proposed MBFs technique. Performance evaluation shows improved accuracy and time efficiency of the proposed approach.


2021 ◽  
Vol 14 (1) ◽  
Author(s):  
Heleen Plaisier ◽  
Thomas R. Meagher ◽  
Daniel Barker

Abstract Objective Visualisation methods, primarily color-coded representation of sequence data, have been a predominant means of representation of DNA data. Algorithmic conversion of DNA sequence data to sound—sonification—represents an alternative means of representation that uses a different range of human sensory perception. We propose that sonification has value for public engagement with DNA sequence information because it has potential to be entertaining as well as informative. We conduct preliminary work to explore the potential of DNA sequence sonification in public engagement with bioinformatics. We apply a simple sonification technique for DNA, in which each DNA base is represented by a specific note. Additionally, a beat may be added to indicate codon boundaries or for musical effect. We report a brief analysis from public engagement events we conducted that featured this method of sonification. Results We report on use of DNA sequence sonification at two public events. Sonification has potential in public engagement with bioinformatics, both as a means of data representation and as a means to attract audience to a drop-in stand. We also discuss further directions for research on integration of sonification into bioinformatics public engagement and education.


Zootaxa ◽  
2020 ◽  
Vol 4766 (3) ◽  
pp. 472-484
Author(s):  
HANNAH E. SOM ◽  
L. LEE GRISMER ◽  
PERRY L. JR. WOOD ◽  
EVAN S. H. QUAH ◽  
RAFE M. BROWN ◽  
...  

Liopeltis is a genus of poorly known, infrequently sampled species of colubrid snakes in tropical Asia. We collected a specimen of Liopeltis from Pulau Tioman, Peninsular Malaysia, that superficially resembled L. philippina, a rare species that is endemic to the Palawan Pleistocene Aggregate Island Complex, western Philippines. We analyzed morphological and mitochondrial DNA sequence data from the Pulau Tioman specimen and found distinct differences to L. philippina and all other congeners. On the basis of these corroborated lines of evidence, the Pulau Tioman specimen is described as a new species, L. tiomanica sp. nov. The new species occurs in sympatry with L. tricolor on Pulau Tioman, and our description of L. tiomanica sp. nov. brings the number of endemic amphibians and reptiles on Pulau Tioman to 12. 


2007 ◽  
Vol 3 ◽  
pp. 193-197 ◽  
Author(s):  
Kou Amano ◽  
Hiroaki Ichikawa ◽  
Hidemitsu Nakamura ◽  
Hisataka Numa ◽  
Kaoru Fukami-Kobayashi ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document