scholarly journals MitoSuite: a graphical tool for human mitochondrial genome profiling in massive parallel sequencing

PeerJ ◽  
2017 ◽  
Vol 5 ◽  
pp. e3406 ◽  
Author(s):  
Koji Ishiya ◽  
Shintaroh Ueda

Recent rapid advances in high-throughput, next-generation sequencing (NGS) technologies have promoted mitochondrial genome studies in the fields of human evolution, medical genetics, and forensic casework. However, scientists unfamiliar with computer programming often find it difficult to handle the massive volumes of data that are generated by NGS. To address this limitation, we developed MitoSuite, a user-friendly graphical tool for analysis of data from high-throughput sequencing of the human mitochondrial genome. MitoSuite generates a visual report on NGS data with simple mouse operations. Moreover, it analyzes high-coverage sequencing data but runs on a stand-alone computer, without the need for file upload. Therefore, MitoSuite offers outstanding usability for handling massive NGS data, and is ideal for evolutionary, clinical, and forensic studies on the human mitochondrial genome variations. It is freely available for download from the website https://mitosuite.com.

2020 ◽  
Author(s):  
Charles Hadley S. King ◽  
Jonathon Keeney ◽  
Nuria Guimera ◽  
Souvik Das ◽  
Brian Fochtman ◽  
...  

AbstractFor regulatory submissions of next generation sequencing (NGS) data it is vital for the analysis workflow to be robust, reproducible, and understandable. This project demonstrates that the use of the IEEE 2791-2020 Standard, (BioCompute objects [BCO]) enables complete and concise communication of NGS data analysis results. One arm of a clinical trial was replicated using synthetically generated data made to resemble real biological data. Two separate, independent analyses were then carried out using BCOs as the tool for communication of analysis: one to simulate a pharmaceutical regulatory submission to the FDA, and another to simulate the FDA review. The two results were compared and tabulated for concordance analysis: of the 118 simulated patient samples generated, the final results of 117 (99.15%) were in agreement. This high concordance rate demonstrates the ability of a BCO, when a verification kit is included, to effectively capture and clearly communicate NGS analyses within regulatory submissions. BCO promotes transparency and induces reproducibility, thereby reinforcing trust in the regulatory submission process.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Gwenna Breton ◽  
Anna C. V. Johansson ◽  
Per Sjödin ◽  
Carina M. Schlebusch ◽  
Mattias Jakobsson

Abstract Background Population genetic studies of humans make increasing use of high-throughput sequencing in order to capture diversity in an unbiased way. There is an abundance of sequencing technologies, bioinformatic tools and the available genomes are increasing in number. Studies have evaluated and compared some of these technologies and tools, such as the Genome Analysis Toolkit (GATK) and its “Best Practices” bioinformatic pipelines. However, studies often focus on a few genomes of Eurasian origin in order to detect technical issues. We instead surveyed the use of the GATK tools and established a pipeline for processing high coverage full genomes from a diverse set of populations, including Sub-Saharan African groups, in order to reveal challenges from human diversity and stratification. Results We surveyed 29 studies using high-throughput sequencing data, and compared their strategies for data pre-processing and variant calling. We found that processing of data is very variable across studies and that the GATK “Best Practices” are seldom followed strictly. We then compared three versions of a GATK pipeline, differing in the inclusion of an indel realignment step and with a modification of the base quality score recalibration step. We applied the pipelines on a diverse set of 28 individuals. We compared the pipelines in terms of count of called variants and overlap of the callsets. We found that the pipelines resulted in similar callsets, in particular after callset filtering. We also ran one of the pipelines on a larger dataset of 179 individuals. We noted that including more individuals at the joint genotyping step resulted in different counts of variants. At the individual level, we observed that the average genome coverage was correlated to the number of variants called. Conclusions We conclude that applying the GATK “Best Practices” pipeline, including their recommended reference datasets, to underrepresented populations does not lead to a decrease in the number of called variants compared to alternative pipelines. We recommend to aim for coverage of > 30X if identifying most variants is important, and to work with large sample sizes at the variant calling stage, also for underrepresented individuals and populations.


Author(s):  
Romesh Kumar Salgotra ◽  
Rafiq Ahmad Bhat ◽  
Deyue Yu ◽  
Javaid Akhter Bhat

Abstract: Over the past two decades, the advances in the next generation sequencing (NGS) platforms have led to the identification of numerous genes/QTLs at high-resolution for their potential use in crop improvement. The genomic resources generated through these high-throughput sequencing techniques have been efficiently used in screening of particular gene of interest particularly for numerous types of plant stresses and quality traits. Subsequently, the identified-markers linked to a particular trait have been used in marker-assisted backcross breeding (MABB) activities. Besides, these markers are also being used to catalogue the food crops for detection of adulteration to improve the quality of food. With the advancement of technologies, the genomic resources are originating with new markers; however, to use these markers efficiently in crop breeding, high-throughput techniques (HTT) such as multiplex PCR and capillary electrophoresis (CE) can be exploited. Robustness, ease of operation, good reproducibility and low cost are the main advantages of multiplex PCR and CE. The CE is capable of separating and characterizing proteins with simplicity, speed and small sample requirements. Keeping in view the availability of vast data generated through NGS techniques and development of numerous markers, there is a need to use these resources efficiently in crop improvement programmes. In summary, this review describes the use of molecular markers in the screening of resistance genes in breeding programmes and detection of adulterations in food crops using high-throughput techniques.


MycoKeys ◽  
2018 ◽  
Vol 39 ◽  
pp. 29-40 ◽  
Author(s):  
Sten Anslan ◽  
R. Henrik Nilsson ◽  
Christian Wurzbacher ◽  
Petr Baldrian ◽  
Leho Tedersoo ◽  
...  

Along with recent developments in high-throughput sequencing (HTS) technologies and thus fast accumulation of HTS data, there has been a growing need and interest for developing tools for HTS data processing and communication. In particular, a number of bioinformatics tools have been designed for analysing metabarcoding data, each with specific features, assumptions and outputs. To evaluate the potential effect of the application of different bioinformatics workflow on the results, we compared the performance of different analysis platforms on two contrasting high-throughput sequencing data sets. Our analysis revealed that the computation time, quality of error filtering and hence output of specific bioinformatics process largely depends on the platform used. Our results show that none of the bioinformatics workflows appears to perfectly filter out the accumulated errors and generate Operational Taxonomic Units, although PipeCraft, LotuS and PIPITS perform better than QIIME2 and Galaxy for the tested fungal amplicon dataset. We conclude that the output of each platform requires manual validation of the OTUs by examining the taxonomy assignment values.


Genomics ◽  
2017 ◽  
Vol 109 (2) ◽  
pp. 83-90 ◽  
Author(s):  
Yan Guo ◽  
Yulin Dai ◽  
Hui Yu ◽  
Shilin Zhao ◽  
David C. Samuels ◽  
...  

2018 ◽  
Vol 24 (9_suppl) ◽  
pp. 94S-103S ◽  
Author(s):  
Qi Wang ◽  
Lijuan Cao ◽  
Guangying Sheng ◽  
Hongjie Shen ◽  
Jing Ling ◽  
...  

Inherited thrombocytopenia is a group of hereditary diseases with a reduction in platelet count as the main clinical manifestation. Clinically, there is an urgent need for a convenient and rapid diagnosis method. We introduced a high-throughput, next-generation sequencing (NGS) platform into the routine diagnosis of patients with unexplained thrombocytopenia and analyzed the gene sequencing results to evaluate the value of NGS technology in the screening and diagnosis of inherited thrombocytopenia. From a cohort of 112 patients with thrombocytopenia, we screened 43 patients with hereditary features. For the blood samples of these 43 patients, a gene sequencing platform for hemorrhagic and thrombotic diseases comprising 89 genes was used to perform gene detection using NGS technology. When we combined the screening results with clinical features and other findings, 15 (34.9%) of 43patients were diagnosed with inherited thrombocytopenia. In addition, 19 pathogenic variants, including 8 previously unreported variants, were identified in these patients. Through the use of this detection platform, we expect to establish a more effective diagnostic approach to such disorders.


Sign in / Sign up

Export Citation Format

Share Document