scholarly journals Dub-seq: dual-barcoded shotgun expression library sequencing for high-throughput characterization of functional traits

2018 ◽  
Author(s):  
Vivek K. Mutalik ◽  
Pavel S. Novichkov ◽  
Morgan N. Price ◽  
Trenton K. Owens ◽  
Mark Callaghan ◽  
...  

AbstractA major challenge in genomics is the knowledge gap between sequence and its encoded function. Gain-of-function methods based on gene overexpression are attractive avenues for phenotype-based functional screens, but are not easily applied in high-throughput across many experimental conditions. Here, we present Dual Barcoded Shotgun Expression Library Sequencing (Dub-seq), a method that greatly increases the throughput of genome-wide overexpression assays. In Dub-seq, a shotgun expression library is cloned between dual random DNA barcodes and the precise breakpoints of DNA fragments are associated to the barcode sequences prior to performing assays. To assess the fitness of individual strains carrying these plasmids, we use DNA barcode sequencing (BarSeq), which is amenable to large-scale sample multiplexing. As a demonstration of this approach, we constructed a Dub-seq library with total Escherichia coli genomic DNA, performed 155 genome-wide fitness assays in 52 experimental conditions, and identified 813 genes with high-confidence overexpression phenotypes across 4,151 genes assayed. We show that Dub-seq data is reproducible, accurately recapitulates known biology, and identifies hundreds of novel gain-of-function phenotypes for E. coli genes, a subset of which we verified with assays of individual strains. Dub-seq provides complementary information to loss-of-function approaches such as transposon site sequencing or CRISPRi and will facilitate rapid and systematic functional characterization of microbial genomes.ImportanceMeasuring the phenotypic consequences of overexpressing genes is a classic genetic approach for understanding protein function; for identifying drug targets, antibiotic and metal resistance mechanisms; and for optimizing strains for metabolic engineering. In microorganisms, these gain-of-function assays are typically done using laborious protocols with individually archived strains or in low-throughput following qualitative selection for a phenotype of interest, such as antibiotic resistance. However, many microbial genes are poorly characterized and the importance of a given gene may only be apparent under certain conditions. Therefore, more scalable approaches for gain-of-function assays are needed. Here, we present Dual Barcoded Shotgun Expression Library Sequencing (Dub-seq), a strategy that couples systematic gene overexpression with DNA barcode sequencing for large-scale interrogation of gene fitness under many experimental conditions at low cost. Dub-seq can be applied to many microorganisms and is a valuable new tool for large-scale gene function characterization.

Genes ◽  
2019 ◽  
Vol 10 (7) ◽  
pp. 536 ◽  
Author(s):  
Xiaobo Zhao ◽  
Liming Gan ◽  
Caixia Yan ◽  
Chunjuan Li ◽  
Quanxi Sun ◽  
...  

Long non-coding RNAs (lncRNAs) are involved in various regulatory processes although they do not encode protein. Presently, there is little information regarding the identification of lncRNAs in peanut (Arachis hypogaea Linn.). In this study, 50,873 lncRNAs of peanut were identified from large-scale published RNA sequencing data that belonged to 124 samples involving 15 different tissues. The average lengths of lncRNA and mRNA were 4335 bp and 954 bp, respectively. Compared to the mRNAs, the lncRNAs were shorter, with fewer exons and lower expression levels. The 4713 co-expression lncRNAs (expressed in all samples) were used to construct co-expression networks by using the weighted correlation network analysis (WGCNA). LncRNAs correlating with the growth and development of different peanut tissues were obtained, and target genes for 386 hub lncRNAs of all lncRNAs co-expressions were predicted. Taken together, these findings can provide a comprehensive identification of lncRNAs in peanut.


2010 ◽  
Vol 5 ◽  
pp. BMI.S5062 ◽  
Author(s):  
Stephanie J. Loomis ◽  
Lana M. Olson ◽  
Louis R. Pasquale ◽  
Janey Wiggs ◽  
Daniel Mirel ◽  
...  

It is unclear if buccal cell samples contain sufficient human DNA with adequately sized fragments for high throughput genetic bioassays. Yet buccal cell sample collection is an attractive alternative to gathering blood samples for genetic epidemiologists engaged in large-scale genetic biomarker studies. We assessed the genotyping efficiency (GE) and genotyping concordance (GC) of buccal cell DNA samples compared to corresponding blood DNA samples, from 32 Nurses' Health Study (NHS) participants using the Illumina Infinium 660W-Quad platform. We also assessed how GE and GC accuracy varied as a function of DNA concentration using serial dilutions of buccal DNA samples. Finally we determined the nature and genomic distribution of discordant genotypes in buccal DNA samples. The mean GE of undiluted buccal cell DNA samples was high (99.32%), as was the GC between the paired buccal and blood samples (99.29%). GC between the dilutions versus the undiluted buccal DNA was also very high (>97%), though both GE and GC notably declined at DNA concentrations less than 5 ng/μl. Most (>95%) genotype determinations in buccal cell samples were of the “missing call” variety (as opposed to the “alternative genotype call” variety) across the spectrum of buccal DNA concentrations studied. Finally, for buccal DNA concentration above 1.7 ng/ul, discordant genotyping calls did not cluster in any particular chromosome. Buccal cell-derived DNA represents a viable alternative to blood DNA for genotyping on a high-density platform.


Author(s):  
Johan O. L. Andreasson ◽  
Michael R. Gotrik ◽  
Michelle J. Wu ◽  
Hannah K. Wayment-Steele ◽  
Wipapat Kladwang ◽  
...  

AbstractInternet-based scientific communities promise a means to apply distributed, diverse human intelligence towards previously intractable scientific problems. However, current implementations have not allowed communities to propose experiments to test all emerging hypotheses at scale or to modify hypotheses in response to experiments. We report high-throughput methods for molecular characterization of nucleic acids that enable the large-scale videogame-based crowdsourcing of functional RNA sensor design, followed by high-throughput functional characterization. Iterative design testing of thousands of crowdsourced RNA sensor designs produced near-thermodynamically optimal and reversible RNA switches that act as self-contained molecular sensors and couple five distinct small molecule inputs to three distinct protein binding and fluorogenic outputs—results that surpass computational and expert-based design. This work represents a new paradigm for widely distributed experimental bioscience.One Sentence SummaryOnline community discovers standalone RNA sensors.


2018 ◽  
Author(s):  
Adrià Fernández-Torras ◽  
Miquel Duran-Frigola ◽  
Patrick Aloy

AbstractBackgroundThe integration of large-scale drug sensitivity screens and genome-wide experiments is changing the field of pharmacogenomics, revealing molecular determinants of drug response without the need for previous knowledge about drug action. In particular, transcriptional signatures of drug sensitivity may guide drug repositioning, prioritize drug combinations and point to new therapeutic biomarkers. However, the inherent complexity of transcriptional signatures, with thousands of differentially expressed genes, makes them hard to interpret, thus giving poor mechanistic insights and hampering translation to clinics.MethodsTo simplify drug signatures, we have developed a network-based methodology to identify functionally coherent gene modules. Our strategy starts with the calculation of drug-gene correlations and is followed by a pathway-oriented filtering and a network-diffusion analysis across the interactome.ResultsWe apply our approach to 189 drugs tested in 671 cancer cell lines and observe a connection between gene expression levels of the modules and mechanisms of action of the drugs. Further, we characterize multiple aspects of the modules, including their functional categories, tissue-specificity and prevalence in clinics. Finally, we prove the predictive capability of the modules and demonstrate how they can be used as gene sets in conventional enrichment analyses.ConclusionsNetwork biology strategies like module detection are able to digest the outcome of large-scale pharmacogenomic initiatives, thereby contributing to their interpretability and improving the characterization of the drugs screened.


BMC Biology ◽  
2021 ◽  
Vol 19 (1) ◽  
Author(s):  
Nicole Gruenheit ◽  
Amy Baldwin ◽  
Balint Stewart ◽  
Sarah Jaques ◽  
Thomas Keller ◽  
...  

Abstract Background Genomes can be sequenced with relative ease, but ascribing gene function remains a major challenge. Genetically tractable model systems are crucial to meet this challenge. One powerful model is the social amoeba Dictyostelium discoideum, a eukaryotic microbe widely used to study diverse questions in the cell, developmental and evolutionary biology. Results We describe REMI-seq, an adaptation of Tn-seq, which allows high throughput, en masse, and quantitative identification of the genomic site of insertion of a drug resistance marker after restriction enzyme-mediated integration. We use REMI-seq to develop tools which greatly enhance the efficiency with which the sequence, transcriptome or proteome variation can be linked to phenotype in D. discoideum. These comprise (1) a near genome-wide resource of individual mutants and (2) a defined pool of ‘barcoded’ mutants to allow large-scale parallel phenotypic analyses. These resources are freely available and easily accessible through the REMI-seq website that also provides comprehensive guidance and pipelines for data analysis. We demonstrate that integrating these resources allows novel regulators of cell migration, phagocytosis and macropinocytosis to be rapidly identified. Conclusions We present methods and resources, generated using REMI-seq, for high throughput gene function analysis in a key model system.


2017 ◽  
Author(s):  
Raamesh Deshpande ◽  
Justin Nelson ◽  
Scott W. Simpkins ◽  
Michael Costanzo ◽  
Jeff S. Piotrowski ◽  
...  

Large-scale genetic interaction screening is a powerful approach for unbiased characterization of gene function and understanding systems-level cellular organization. While genome-wide screens are desirable as they provide the most comprehensive interaction profiles, they are resource and time-intensive and sometimes infeasible, depending on the species and experimental platform. For these scenarios, optimal methods for more efficient screening while still producing the maximal amount of information from the resulting profiles are of interest.To address this problem, we developed an optimal algorithm, called COMPRESS-GI, which selects a small but informative set of genes that captures most of the functional information contained within genome-wide genetic interaction profiles. The utility of this algorithm is demonstrated through an application of the approach to define a diagnostic mutant set for large-scale chemical genetic screens, where more than 13,000 compound screens were achieved through the increased throughput enabled by the approach. COMPRESS-GI can be broadly applied for directing genetic interaction screens in other contexts, including in species with little or no prior genetic-interaction data.


ExRNA ◽  
2019 ◽  
Vol 1 (1) ◽  
Author(s):  
Fangfang Jin ◽  
Yuling Sun

Abstract Procambarus clarkii is one of the important economic species in China and has been served as tasty food in recent years after being introduced to Nanjing. Significant problems of environment factors, such as salinity, pH and temperature, especially salinity, has the potential to result in significant economic losses in many crayfish-producing farms in China. miRNAs are a kind of ~ 22 nucleotide small non coding RNAs which were encoded by plants, animals and some viruses with functions in RNA silencing or post-transcription regulation. We constructed four sRNA library of P. clarkia from different tissues and treatments by using high-throughput sequencing technology. A total of 101 conserved miRNAs and two novel pre-miRNAs were identified and RT-qPCR were further performed to confirm existence of part of identified miRNAs. A genome wide expression profile of salt-tolerance miRNAs was proved and three miRNAs were further validated by RT-qPCR with dynamic response to different salinity stages. The study of miRNAs in P. clarkia can help us better understanding the role of miRNAs in salt-tolerance in P. clarkia.


Sign in / Sign up

Export Citation Format

Share Document