scholarly journals Identification and Functional Annotation of Long Intergenic Non-coding RNAs in the Brassicaceae

2021 ◽  
Author(s):  
Kyle Palos ◽  
Anna C. Nelson Dittrich ◽  
Li’ang Yu ◽  
Jordan R. Brock ◽  
Larry Wu ◽  
...  

AbstractLong intergenic noncoding RNAs (lincRNAs) are a large yet enigmatic class of eukaryotic transcripts with critical biological functions. Despite the wealth of RNA-seq data available, lincRNA identification lags in the plant lineage. In addition, there is a need for a harmonized identification and annotation effort to enable cross-species functional and genomic comparisons. In this study we processed >24 Tbp of RNA-seq data from >16,000 experiments to identify ~130,000 lincRNAs in four Brassicaceae: Arabidopsis thaliana, Camelina sativa, Brassica rapa, and Eutrema salsugineum. We used Nanopore RNA-seq, transcriptome-wide structural information, peptide data, and epigenomic data to characterize these lincRNAs and identify functional motifs. We then used comparative genomic and transcriptomic approaches to highlight lincRNAs in our dataset with sequence or transcriptional evolutionary conservation, including lincRNAs transcribed adjacent to orthologous genes that display little sequence similarity and likely function as transcriptional regulators. Finally, we used guilt-by-association techniques to further classify these lincRNAs according to putative function. LincRNAs with Brassicaceae-conserved putative miRNA binding motifs, short ORFs, and whose expression is modulated by abiotic stress are a few of the annotations that will prioritize and guide future functional analyses.

2020 ◽  
Author(s):  
Qi Lu ◽  
Fuhua Yan ◽  
Yuanyuan Liu ◽  
Qiaohong Li ◽  
Meng Yang ◽  
...  

The Gram-negative (G-) bacterial species Pectobacterium actinidiae causes summer canker in kiwifruit plants. However, little is known about its virulence factors and the mechanisms of genetic adaptation. We aimed to identify the key determinants that control the virulence of P. actinidiae to kiwifruit by genomic and functional analyses. Within four P. actinidiae isolates, low genetic variability displayed and shared an average 98.7% and 82% in genome-level sequence similarity and coding genes. Phylogenetic analysis, based on both the bulk of single nucleotide polymorphisms (SNPs) and genomes single-copy genes, revealed that P. actinidiae strains cluster into a sole clade, whichclosly related to the clades of P. odoriferum, the pathogen of vegetable soft rot from a completely different host range. By comparison with these two clades of genomes, 746 unique core orthologs/genes were enriched in the clades of P. actinidiae, especially for the key virulence determinants involved in the biosynthesis of secretion systems (type III, III and IV), iron, flagellar and the quorum-sensing system. Our results provide insights into the pathogenomics basis underlying the genetic diversification and evolution of pathogenicity in the species of P. actinidiae.


2017 ◽  
Vol 5 (32) ◽  
Author(s):  
Rose Waldron ◽  
Jamie McGowan ◽  
Natasha Gordon ◽  
Charley McCarthy ◽  
E. Bruce Mitchell ◽  
...  

ABSTRACT Dermatophagoides pteronyssinus is the European dust mite and a major source of human allergens. Here, we present the first draft genome sequence of the mite, as well as the ab initio gene prediction and functional analyses that will facilitate comparative genomic analyses with other mite species.


2011 ◽  
Vol 39 (5) ◽  
pp. 1353-1358 ◽  
Author(s):  
Simon Newstead

One of the principal aims of modern drug design is the targeted delivery of drugs within the body, such as to the central nervous system, combined with their exclusion from the liver and kidneys, which break down foreign molecules and subsequently eliminate them. Many of the commonly prescribed drugs are transported into cells and across the plasma membrane via endogenous membrane transporters, whose principal roles are the uptake of essential nutrients for metabolism. In many cases, such drug transport is serendipitous as they are simply mistaken as ‘natural’ compounds. Many of these transporters could, however, be targeted more efficiently, improving drug absorption, distribution and retention. The molecular details of these drug–transporter interactions, however, are at best poorly understood, in large part through the absence of any high-resolution structural information. To address this issue, we recently determined the structure of a prokaryotic peptide transporter, PepTSo from Shewanella oneidensis, which shares a high degree of sequence similarity and functional characteristics with the human PepT1 and PepT2 proteins. PepT1 and PepT2 contribute significantly to the oral bioavailability and pharmacokinetic properties of a number of important drug families, including antibiotics, antivirals and anticancer agents. The crystal structure of PepTSo provides the first high-resolution model of a drug importer and provides the starting point for understanding drug and peptide transport within the human body.


Author(s):  
Sanjeev Kumar ◽  
Victoria Hedrick ◽  
Seema Mattoo

Pasteurella multocida causes respiratory-tract infections in a broad range of animals, as well as opportunistic infections in humans. P. multocida secretes a multidomain toxin called PfhB2, which contains a YopT-like cysteine protease domain at its C-terminus. The YopT domain of PfhB2 contains a well conserved Cys–His–Asp catalytic triad that defines YopT family members, and shares high sequence similarity with the prototype YopT from Yersinia sp. To date, only one crystal structure of a YopT family member has been reported; however, additional structural information is needed to help characterize the varied substrate specificity and enzymatic action of this large protease family. Here, a catalytically inactive C3733S mutant of PfhB2 YopT that provides enhanced protein stability was used with the aim of gaining structural insight into the diversity within the YopT protein family. To this end, the C3733S mutant of PfhB2 YopT has been successfully cloned, overexpressed, purified and crystallized. Diffraction data sets were collected from native crystals to 3.5 Å resolution and a single-wavelength anomalous data set was collected from an iodide-derivative crystal to 3.2 Å resolution. Data pertaining to crystals belonging to space group P31, with unit-cell parameters a = 136.9, b = 136.9, c = 74.7 Å for the native crystals and a = 139.2, b = 139.2, c = 74.7 Å for the iodide-derivative crystals, are discussed.


Author(s):  
Fatima Foflonker ◽  
Crysten E Blaby-Haas

Abstract Diverging from the classic paradigm of random gene order in eukaryotes, gene proximity can be leveraged to systematically identify functionally related gene neighborhoods in eukaryotes, utilizing techniques pioneered in bacteria. Current methods of identifying gene neighborhoods typically rely on sequence similarity to characterized gene products. However, this approach is not robust for nonmodel organisms like algae, which are evolutionarily distant from well-characterized model organisms. Here, we utilize a comparative genomic approach to identify evolutionarily conserved proximal orthologous gene pairs conserved across at least two taxonomic classes of green algae. A total of 317 gene neighborhoods were identified. In some cases, gene proximity appears to have been conserved since before the streptophyte–chlorophyte split, 1,000 Ma. Using functional inferences derived from reconstructed evolutionary relationships, we identified several novel functional clusters. A putative mycosporine-like amino acid, “sunscreen,” neighborhood contains genes similar to either vertebrate or cyanobacterial pathways, suggesting a novel mosaic biosynthetic pathway in green algae. One of two putative arsenic-detoxification neighborhoods includes an organoarsenical transporter (ArsJ), a glyceraldehyde 3-phosphate dehydrogenase-like gene, homologs of which are involved in arsenic detoxification in bacteria, and a novel algal-specific phosphoglycerate kinase-like gene. Mutants of the ArsJ-like transporter and phosphoglycerate kinase-like genes in Chlamydomonas reinhardtii were found to be sensitive to arsenate, providing experimental support for the role of these identified neighbors in resistance to arsenate. Potential evolutionary origins of neighborhoods are discussed, and updated annotations for formerly poorly annotated genes are presented, highlighting the potential of this strategy for functional annotation.


2019 ◽  
Vol 36 (6) ◽  
pp. 1940-1941
Author(s):  
Nicolaas C Kist ◽  
Robert A Power ◽  
Andrew Skelton ◽  
Seth D Seegobin ◽  
Moira Verbelen ◽  
...  

Abstract Summary Mistakes in linking a patient’s biological samples with their phenotype data can confound RNA-Seq studies. The current method for avoiding such sample mix-ups is to test for inconsistencies between biological data and known phenotype data such as sex. However, in DNA studies a common QC step is to check for unexpected relatedness between samples. Here, we extend this method to RNA-Seq, which allows the detection of duplicated samples without relying on identifying inconsistencies with phenotype data. Results We present RNASeq_similarity_matrix: an automated tool to generate a sequence similarity matrix from RNA-Seq data, which can be used to visually identify sample mix-ups. This is particularly useful when a study contains multiple samples from the same individual, but can also detect contamination in studies with only one sample per individual. Availability and implementation RNASeq_similarity_matrix has been made available as a documented GPL licensed Docker image on www.github.com/nicokist/RNASeq_similarity_matrix.


2020 ◽  
Vol 8 (10) ◽  
pp. 1517
Author(s):  
Imen Nouioui ◽  
Carlos Cortés-Albayay ◽  
Meina Neumann-Schaal ◽  
Diego Vicente ◽  
Gustavo Cilla ◽  
...  

Strains 335427T and 234509T, isolated from two 76-year-old patients with chronic pulmonary diseases, were the subject of polyphasic taxonomic studies and comparative genomic analyses for virulence factors. The 16 rRNA gene sequence similarity between strains 335427T and 234509T and their closest phylogenetic neighbors Nocardia asiatica NBRC 100129T and Nocardia abscessus NBRC 100374T were 99.5% and 100%, respectively. Digital DNA–DNA hybridization values between the aforementioned studied strains were well below the 70% threshold for assigning prokaryotic strains to a novel species. Strains 335427T and 234509T have genome sizes of 8.49 Mpb and 8.07 Mpb, respectively, with G + C content of 68.5%. Isolate 335427T has C16:0, C18:1 ω9c, C18:0 and C18:0 10 methyl as major fatty acids (>15%) and mycolic acids formed of 52–54 carbon atoms. However, only C18:1 ω9c was detected for isolate 234509T, which had mycolic acids with 44–56 carbon. Based on phenotypic and genetic data, strains 335427T (DSM 109819T = CECT 9924T) and 234509T (DSM 111366T = CECT 30129T) merit recognition as novel species, which are named Nocardia barduliensis sp. nov. and Nocardia gipuzkoensis sp. nov., respectively. All the strains studied had homologous VF-associated genes to those described in M. tuberculosis, including experimentally verified virulence genes in humans related to tuberculosis. The narGHIJ (nitrate reduction pathway) and gvpAFGOJLMK (gas vesicles) genetic maps of strains 335427T, 234509T, NBRC 100129T and NBRC 100374T showed the same syntenic block and raise the question of whether their functions are interlinked during the infection of the human host. However, further research is required to decipher the role of the gas vesicle in the pathogenicity mechanism of Nocardia spp.


mBio ◽  
2019 ◽  
Vol 10 (3) ◽  
Author(s):  
Kira S. Makarova ◽  
Yuri I. Wolf ◽  
Svetlana Karamycheva ◽  
Dapeng Zhang ◽  
L. Aravind ◽  
...  

ABSTRACTNumerous, diverse, highly variable defense and offense genetic systems are encoded in most bacterial genomes and are involved in various forms of conflict among competing microbes or their eukaryotic hosts. Here we focus on the offense and self-versus-nonself discrimination systems encoded by archaeal genomes that so far have remained largely uncharacterized and unannotated. Specifically, we analyze archaeal genomic loci encoding polymorphic and related toxin systems and ribosomally synthesized antimicrobial peptides. Using sensitive methods for sequence comparison and the “guilt by association” approach, we identified such systems in 141 archaeal genomes. These toxins can be classified into four major groups based on the structure of the components involved in the toxin delivery. The toxin domains are often shared between and within each system. We revisit halocin families and substantially expand the halocin C8 family, which was identified in diverse archaeal genomes and also certain bacteria. Finally, we employ features of protein sequences and genomic locus organization characteristic of archaeocins and polymorphic toxins to identify candidates for analogous but not necessarily homologous systems among uncharacterized protein families. This work confidently predicts that more than 1,600 archaeal proteins, currently annotated as “hypothetical” in public databases, are components of conflict and self-versus-nonself discrimination systems.IMPORTANCEDiverse and highly variable systems involved in biological conflicts and self-versus-nonself discrimination are ubiquitous in bacteria but much less studied in archaea. We performed comprehensive comparative genomic analyses of the archaeal systems that share components with analogous bacterial systems and propose an approach to identify new systems that could be involved in these functions. We predict polymorphic toxin systems in 141 archaeal genomes and identify new, archaea-specific toxin and immunity protein families. These systems are widely represented in archaea and are predicted to play major roles in interactions between species and in intermicrobial conflicts. This work is expected to stimulate experimental research to advance the understanding of poorly characterized major aspects of archaeal biology.


Blood ◽  
2013 ◽  
Vol 122 (21) ◽  
pp. 81-81
Author(s):  
Guangzhen Hu ◽  
Thomas E Witzig ◽  
Mamta Gupta

Abstract Long noncoding RNAs (lncRNAs) are defined as RNA-like transcripts that are over 200 nucleotides and lack significant open reading frames. Some lncRNAs such as HOTAIR, MALAT1 and H19 have been found to be associated with clinical prognosis and are potential drivers of cancer progression in cancers of the breast, lung, and liver respectively. The role of lncRNAs in lymphoma is unknown. Dysregulation of eIF4E (a key component of the translation initiation complex eIF4F) influences global protein translation, especially the translation of “weak” mRNAs that can be malignancy-related. We and others have found that eIF4E is dysregulated in B-cell lymphoma. The aim of this study is to identify eIF4E-associated lncRNAs through next generation RNA-Sequencing (NGS RNA-Seq) and delineate their role in protein translation in lymphoma. RNA-immunoprecipitation (RNA-IP) was used to pull down eIF4E-bound lncRNA in lymphoma cells. eIF4E-bound lncRNAs were immunoprecipitated with eIF4E antibody or IgG control in Jeko, a mantle cell lymphoma (MCL) cell line and sent for microarray analysis and NGS-RNA-Seq for identification of lncRNAs. The microarray analysis showed that several lncRNAs were enriched with eIF4E antibody compared to IgG control. These included SNHG4 (13.6 fold), SNHG12 (4.8 fold), NCRNA00171 (4.8 fold) and IPW (4.6 fold), GNASAS (3.5 fold), SNHG7 (3.3 fold), NCRNA00182 (2.7 fold), NCRNA00094 (2.6 fold), NCRNA00188 (2.4 fold) and NCRNA00201 (2.1 fold). The binding of these lncRNAs to eIF4E was further confirmed by RT-PCR in Jeko, Mino and Granta MCL cell lines. Next, we looked the expression of these lncRNAs by qRT- PCR in the MCL cell lines and normal controls. We found SNHG4 and IPW to be overexpressed in all the MCL cell lines, while SNHG12 and NCRNA00201 were overexpressed in the selected cell lines. No significant difference was found for the expression of NCRNA00171 and NCRNA00182 in any of the MCL cell lines compared to controls. Overall, these data suggest that several lncRNA have altered expression in malignant B-cells. Considering that the microarray assay only covered a limited number of lncRNAs, we further confirmed eIF4E bound lncRNA by NGS RNA-Seq in Jeko MCL and normal control. The binding of 10/13 lncRNA mentioned above with eIF4E were found upregulated by NGS-RNA-Seq. In addition several novel lncRNAs such as SNHG1 (161.6), AC091814.2 (98.8) and RP11-304L19.5 (64.2) showed up in NGS-RNA-Seq data. These data suggest that lncRNAs, such as SNHG12, SNHG4, and SNHG1 bind to eIF4E with high affinity in malignant B-cells and might play a role in protein translation. We knocked down the expression of SNHG4 through siRNA and demonstrated that cell proliferation and global protein translation was inhibited in lymphoma cells. To further confirm the role of SNHG4 in translation regulation, a plasmid, which contains a renilla luciferase driven by SV40 promoter, was co-transfected with SNHG4 siRNA into Mino cells. The luciferase signal, decreased compared with the cells transfected with nontargeting siRNA. These data suggest that SNHG4 is involved in the regulation of protein translation. In order to clarify the mechanism of lncRNAs bound to eIF4E we searched for RNA binding sites or motifs in eIF4E protein using the web-based tools, BindN and PPRInt. Interestingly two RNA binding motifs, KNKRGGRWLITLNKQQRRS and SHADTATKSGSTTKNR, were found in eIF4E based on the prediction. To examine whether lncRNAs bind with eIF4E through these RNA binding motifs, an eIF4E mutant plasmid with both RNA binding motifs deleted (eIF4EDel), was constructed and transfected transiently into HEK-293T cells along with eIF4EWT plasmid. RNA-IP data showed that the lncRNAs SNHG12, SNHG4 and SNHG1 were not able to bind with eIF4E in eIF4EDel-transfected cells compared with that of eIF4EWT, suggesting that these lncRNAs complex with eIF4E through RNA-binding motifs within the eIF4E. Overall, our results show that the lncRNAs, SNHG1 and SNHG4 are able to bind with eIF4E and regulate protein translation. Since lncRNAs had been found to play roles in the regulation of gene expression, including transcription, splicing and mRNA stability, our results may broaden the view of the functional role of lncRNAs in translation in lymphoma cells and in other cancers. Furthermore, our results also suggested that SNHG4 lncRNAs might be served as potential biomarkers for MCL and other B cell lymphomas for translation therapy. Disclosures: No relevant conflicts of interest to declare.


2015 ◽  
Vol 33 (7_suppl) ◽  
pp. 316-316
Author(s):  
Yohann Loriot ◽  
Christophe Massard ◽  
Celine Lefebvre ◽  
Semih Dogan ◽  
Elena Ileanaecaterina ◽  
...  

316 Background: Although several studies have reported genomic characterisation of primary urothelial carcinoma (UC), little is known about the genomic alterations of the metastatic cisplatin-resistant UC. We sought to define the prevalence and co-occurrence of actionable genomic alterations in patients with metastatic cisplatin-resistant lethal UC. Methods: Patients with metastatic UC, who progressed after at least one line of standard therapy, were enrolled in a prospective molecular screening trial (MOSCATO 01) at Gustave Roussy. CT-Scan or ultrasound-guided biopsies were performed in metastases to carry out a comprehensive molecular characterization. DNA was extracted from fresh tumor samples and analyzed by comparative genomic hybridization (CGH) (≥ 30% tumor cells) and by Next Generation Sequencing (NGS) for up to 74 target genes (≥ 10% tumor cells). Whole-exome (WES) and RNA seq sequencing were performed retrospectively in selected cases. Results: From 12/2011 to 12/2013, 30 heavily pretreated patients with metastatic UC were included. Median age was 61 (37-73); all patients had been treated with platin-based chemotherapy with a median number of lines of 2.5 (1-5). A tumor biopsy could be performed in 26 patients (87%). CGH and targeted exome sequencing profiles were assessed in 19 (73%) and 23 (88%) of them, respectively. A total of 19 patients (73%) were profiled for both sequencing and CGH. WES and RNA seq was performed in 16 (62%) pairs of metastatic tissue and normal DNA (Integragen Inc, Hiseq platform). Our analyses identified potential therapeutic targets in 61 % of the tumours, including 31% with targets in the PI3K/AKT/mTOR pathway, 35% with targets in the FGF/FGFR pathway and 15% with targets in the RTK/MAPK pathway (including ERBB2). Others frequent aberrations were found in the chromatin regulatory genes (MLL gene family) and cell-cycle regulatory genes (E2F3, CDKN2A/B genes). FGFR gene fusions were found in 1 out of 16 screened patients (6%) Conclusions: Lethal platin-resistant UC exhibit high-level of genomic heterogeneity. The majority of these tumors harbour actionable genomic alterations that can be targeted with selective agents currently in clinical development (phase I trials) or registered.


Sign in / Sign up

Export Citation Format

Share Document