scholarly journals Structural variations in papaya genomes

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Zhenyang Liao ◽  
Xunxiao Zhang ◽  
Shengcheng Zhang ◽  
Zhicong Lin ◽  
Xingtan Zhang ◽  
...  

Abstract Background Structural variations (SVs) are a type of mutations that have not been widely detected in plant genomes and studies in animals have shown their role in the process of domestication. An in-depth study of SVs will help us to further understand the impact of SVs on the phenotype and environmental adaptability during papaya domestication and provide genomic resources for the development of molecular markers. Results We detected a total of 8083 SVs, including 5260 deletions, 552 tandem duplications and 2271 insertions with deletion being the predominant, indicating the universality of deletion in the evolution of papaya genome. The distribution of these SVs is non-random in each chromosome. A total of 1794 genes overlaps with SV, of which 1350 genes are expressed in at least one tissue. The weighted correlation network analysis (WGCNA) of these expressed genes reveals co-expression relationship between SVs-genes and different tissues, and functional enrichment analysis shows their role in biological growth and environmental responses. We also identified some domesticated SVs genes related to environmental adaptability, sexual reproduction, and important agronomic traits during the domestication of papaya. Analysis of artificially selected copy number variant genes (CNV-genes) also revealed genes associated with plant growth and environmental stress. Conclusions SVs played an indispensable role in the process of papaya domestication, especially in the reproduction traits of hermaphrodite plants. The detection of genome-wide SVs and CNV-genes between cultivated gynodioecious populations and wild dioecious populations provides a reference for further understanding of the evolution process from male to hermaphrodite in papaya.

2020 ◽  
Vol 19 ◽  
pp. 153303382097748
Author(s):  
Shao-wei Zhang ◽  
Nan Zhang ◽  
Na Wang

Background: Esophageal cancer (EC) is a primary malignant tumor originating from the esophageal of the epithelium. Surgical resection is a potential treatment for EC, but this is only appropriate for patients who have locally resectable lesions suitable for surgery. However, most patients with EC are at a late stage when diagnosed. Therefore, there is an urgent need to further explore the pathogenesis of EC to enable early diagnosis and treatment. Methods: Our study downloaded 2 expression spectrum datasets (GSE92396 and GSE100942) in the Gene Expression Omnibus (GEO) database. GEO2 R was used to identify the Differentially expressed genes (DEGs) between the samples of EC and control. Using the DAVID tool to make the Functional enrichment analysis. Constructing A protein–protein interaction (PPI) network. Identifying the Hub genes. The impact of hub gene expression on overall survival and their expression based on immunohistochemistry were analyzed. Associated microRNAs were also predicted. Results: There were 36 common DEGs identified. The analysis of GO and KEGG results shown that the variations were predominantly concentrated in the extracellular matrix (ECM), ECM organization, DNA binding, platelet activation, and ECM-receptor interactions. COL3A1 and POSTN had high expression in EC tissues which was compared with their expression in healthy tissues. Analysis of pathologic stages showed that when COL3A1 and POSTN were highly expressed, the stage of the pathologic of EC patients was relatively high (P < 0.005). Conclusions: COL3A1 and POSTN may play an important role in the advancement and occurrence of EC. These genes could provide some novel ideas and basis for the diagnosis and targeted treatment of EC.


2020 ◽  
Author(s):  
HongBo Ma ◽  
XiaoLi Wu ◽  
MiaoMiao Tao ◽  
Uthaman Jithin ◽  
GenDou Zhou ◽  
...  

Abstract Non-small cell lung cancer (NSCLC) is one of the most common causes of cancer-related death globally, and lung adenocarcinoma (LUAD) accounts for almost 40% of all lung cancer cases. In recent years, despite better understanding of the pathogenesis of the disease and achievements in the multimodal treatment of tumors, there is an urgent need to identify new diagnostic and prognostic biomarkers In this study, we aim to identify the potential key genes related to the pathogenesis and prognosis of LUAD by using comprehensive bioinformatics analysis. The gene expression profile was downloaded from The Cancer Genome Atlas (TCGA) database, and we calculated the LUAD immune scores and stromal scores by using the ESTIMATE algorithm. Based on these scores, we further quantified the immune and stromal components and obtained the differentially expressed genes (DEGs) in the tumor. Overall survival analysis could better reflect the impact of genes related to immune and stromal cells on the prognosis. Moreover, Gene Ontology (GO) functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted, while protein-protein interaction (PPI) network was obtained from STRING. Analysis of the correlation revealed that these genes are mainly involved in the immune/inflammatory response. In conclusion, our study showed that 11 prognostic genes (CD33, IRF8, CD80, CD53, IL16, LY86, CD79B, TYROBP, CD1E, CD1C, and CD1B) might show a potentially good performance in predicting overall survival in patients with LUAD. In summary, we identified the key genes related to the microenvironment, which can further serve as the prognostic biomarkers and therapeutic targets for LUAD.


2020 ◽  
Author(s):  
HongBo Ma ◽  
XiaoLi Wu ◽  
MiaoMiao Tao ◽  
Uthaman Jithin ◽  
GenDou Zhou ◽  
...  

Abstract Non-small cell lung cancer (NSCLC) is one of the most common causes of cancer-related death globally, and lung adenocarcinoma (LUAD) accounts for almost 40% of all lung cancer cases. In recent years, despite better understanding of the pathogenesis of the disease and achievements in the multimodal treatment of tumors, there is an urgent need to identify new diagnostic and prognostic biomarkers In this study, we aim to identify the potential key genes related to the pathogenesis and prognosis of LUAD by using comprehensive bioinformatics analysis. The gene expression profile was downloaded from The Cancer Genome Atlas (TCGA) database, and we calculated the LUAD immune scores and stromal scores by using the ESTIMATE algorithm. Based on these scores, we further quantified the immune and stromal components and obtained the differentially expressed genes (DEGs) in the tumor. Overall survival analysis could better reflect the impact of genes related to immune and stromal cells on the prognosis. Moreover, Gene Ontology (GO) functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted, while protein-protein interaction (PPI) network was obtained from STRING. Analysis of the correlation revealed that these genes are mainly involved in the immune/inflammatory response. In conclusion, our study showed that 11 prognostic genes (CD33, IRF8, CD80, CD53, IL16, LY86, CD79B, TYROBP, CD1E, CD1C, and CD1B) might show a potentially good performance in predicting overall survival in patients with LUAD. In summary, we identified the key genes related to the microenvironment, which can further serve as the prognostic biomarkers and therapeutic targets for LUAD.


2020 ◽  
Author(s):  
HongBo Ma ◽  
XiaoLi Wu ◽  
MiaoMiao Tao ◽  
Uthaman Jithin ◽  
GenDou Zhou ◽  
...  

Abstract Non-small cell lung cancer (NSCLC) is one of the most common causes of cancer-related death globally, and lung adenocarcinoma (LUAD) accounts for almost 40% of all lung cancer cases. In recent years, despite better understanding of the pathogenesis of the disease and achievements in the multimodal treatment of tumors, there is an urgent need to identify new diagnostic and prognostic biomarkers In this study, we aim to identify the potential key genes related to the pathogenesis and prognosis of LUAD by using comprehensive bioinformatics analysis. The gene expression profile was downloaded from The Cancer Genome Atlas (TCGA) database, and we calculated the LUAD immune scores and stromal scores by using the ESTIMATE algorithm. Based on these scores, we further quantified the immune and stromal components and obtained the differentially expressed genes (DEGs) in the tumor. Overall survival analysis could better reflect the impact of genes related to immune and stromal cells on the prognosis. Moreover, Gene Ontology (GO) functional enrichment analysis and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment were conducted, while protein-protein interaction (PPI) network was obtained from STRING. Analysis of the correlation revealed that these genes are mainly involved in the immune/inflammatory response. In conclusion, our study showed that 11 prognostic genes (CD33, IRF8, CD80, CD53, IL16, LY86, CD79B, TYROBP, CD1E, CD1C, and CD1B) might show a potentially good performance in predicting overall survival in patients with LUAD. In summary, we identified the key genes related to the microenvironment, which can further serve as the prognostic biomarkers and therapeutic targets for LUAD.


2020 ◽  
Author(s):  
Gulden Olgun ◽  
Oznur Tastan

AbstractThe dysregulation of long non-coding RNAs’ (lncRNAs) expressions has been implicated in cancer. Since most of the lncRNAs’ are not functionally characterized well, investigating the set of perturbed lncRNAs are is challenging. Existing methods that inspect lncRNAs functionally rely on the co-expressed coding genes, which are far better characterized functionally. LncRNAs can be known to act as transcriptional regulators; they may activate or repress the neighborhood’s coding genes on the genome. Based on this, in this work, we aim to analyze the deregulated lncRNAs in cancer by taking into account their ability to regulate nearby loci on the genome. We perform functional analysis on differentially expressed lncRNAs for 28 different cancers considering their adjacent coding genes. We identify that some deregulated lncRNAs are cancer-specific, but a substantial number of lncRNAs are shared across cancers. Next, we assess the similarities of the cancer types based on the functional enrichment of the deregulated lncRNA sets. We find some cancers are very similar in the functions and biological processes related to the deregulated lncRNAs. We observe that some of the cancers for which we find similarity can be linked through primary, metastatic site relations. We investigate the similarity of enriched functional terms for the deregulated lncRNAs and the mRNAs. We further assess the enriched functions’ similarity to the functions and processes that the known cancer driver genes take place. We believe that our methodology help to understand the impact of the lncRNAs in cancer functionally.


2019 ◽  
Vol 14 (7) ◽  
pp. 591-601 ◽  
Author(s):  
Aravind K. Konda ◽  
Parasappa R. Sabale ◽  
Khela R. Soren ◽  
Shanmugavadivel P. Subramaniam ◽  
Pallavi Singh ◽  
...  

Background: Chickpea is a nutritional rich premier pulse crop but its production encounters setbacks due to various stresses and understanding of molecular mechanisms can be ascribed foremost importance. Objective: The investigation was carried out to identify the differentially expressed WRKY TFs in chickpea in response to herbicide stress and decipher their interacting partners. Methods: For this purpose, transcriptome wide identification of WRKY TFs in chickpea was done. Behavior of the differentially expressed TFs was compared between other stress conditions. Orthology based cofunctional gene networks were derived from Arabidopsis. Gene ontology and functional enrichment analysis was performed using Blast2GO and STRING software. Gene Coexpression Network (GCN) was constructed in chickpea using publicly available transcriptome data. Expression pattern of the identified gene network was studied in chickpea-Fusarium interactions. Results: A unique WRKY TF (Ca_08086) was found to be significantly (q value = 0.02) upregulated not only under herbicide stress but also in other stresses. Co-functional network of 14 genes, namely Ca_08086, Ca_19657, Ca_01317, Ca_20172, Ca_12226, Ca_15326, Ca_04218, Ca_07256, Ca_14620, Ca_12474, Ca_11595, Ca_15291, Ca_11762 and Ca_03543 were identified. GCN revealed 95 hub genes based on the significant probability scores. Functional annotation indicated role in callose deposition and response to chitin. Interestingly, contrasting expression pattern of the 14 network genes was observed in wilt resistant and susceptible chickpea genotypes, infected with Fusarium. Conclusion: This is the first report of identification of a multi-stress responsive WRKY TF and its associated GCN in chickpea.


Open Medicine ◽  
2020 ◽  
Vol 15 (1) ◽  
pp. 672-688
Author(s):  
Yanbo Dong ◽  
Siyu Lu ◽  
Zhenxiao Wang ◽  
Liangfa Liu

AbstractThe chaperonin-containing T-complex protein 1 (CCT) subunits participate in diverse diseases. However, little is known about their expression and prognostic values in human head and neck squamous cancer (HNSC). This article aims to evaluate the effects of CCT subunits regarding their prognostic values for HNSC. We mined the transcriptional and survival data of CCTs in HNSC patients from online databases. A protein–protein interaction network was constructed and a functional enrichment analysis of target genes was performed. We observed that the mRNA expression levels of CCT1/2/3/4/5/6/7/8 were higher in HNSC tissues than in normal tissues. Survival analysis revealed that the high mRNA transcriptional levels of CCT3/4/5/6/7/8 were associated with a low overall survival. The expression levels of CCT4/7 were correlated with advanced tumor stage. And the overexpression of CCT4 was associated with higher N stage of patients. Validation of CCTs’ differential expression and prognostic values was achieved by the Human Protein Atlas and GEO datasets. Mechanistic exploration of CCT subunits by the functional enrichment analysis suggests that these genes may influence the HNSC prognosis by regulating PI3K-Akt and other pathways. This study implies that CCT3/4/6/7/8 are promising biomarkers for the prognosis of HNSC.


2021 ◽  
Vol 28 (1) ◽  
pp. 20-33
Author(s):  
Lydia-Eirini Giannakou ◽  
Athanasios-Stefanos Giannopoulos ◽  
Chrissi Hatzoglou ◽  
Konstantinos I. Gourgoulianis ◽  
Erasmia Rouka ◽  
...  

Haemophilus influenzae (Hi), Moraxella catarrhalis (MorCa) and Pseudomonas aeruginosa (Psa) are three of the most common gram-negative bacteria responsible for human respiratory diseases. In this study, we aimed to identify, using the functional enrichment analysis (FEA), the human gene interaction network with the aforementioned bacteria in order to elucidate the full spectrum of induced pathogenicity. The Human Pathogen Interaction Database (HPIDB 3.0) was used to identify the human proteins that interact with the three pathogens. FEA was performed via the ToppFun tool of the ToppGene Suite and the GeneCodis database so as to identify enriched gene ontologies (GO) of biological processes (BP), cellular components (CC) and diseases. In total, 11 human proteins were found to interact with the bacterial pathogens. FEA of BP GOs revealed associations with mitochondrial membrane permeability relative to apoptotic pathways. FEA of CC GOs revealed associations with focal adhesion, cell junctions and exosomes. The most significantly enriched annotations in diseases and pathways were lung adenocarcinoma and cell cycle, respectively. Our results suggest that the Hi, MorCa and Psa pathogens could be related to the pathogenesis and/or progression of lung adenocarcinoma via the targeting of the epithelial cellular junctions and the subsequent deregulation of the cell adhesion and apoptotic pathways. These hypotheses should be experimentally validated.


AMB Express ◽  
2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Zhiyong Liu ◽  
Kai Dang ◽  
Cunzhi Li ◽  
Junhong Gao ◽  
Hong Wang ◽  
...  

Abstract Hexanitrohexaazaisowurtzitane (CL-20) is a compound with a polycyclic cage and an N-nitro group that has been shown to play an unfavorable role in environmental fate, biosafety, and physical health. The aim of this study was to isolate the microbial community and to identify a single microbial strain that can degrade CL-20 with desirable efficiency. Metagenomic sequencing methods were performed to investigate the dynamic changes in the composition of the community diversity. The most varied genus among the microbial community was Pseudomonas, which increased from 1.46% to 44.63% during the period of incubation (MC0–MC4). Furthermore, the new strain was isolated and identified from the activated sludge by bacterial morphological and 16s rRNA sequencing analyses. The CL-20 concentrations decreased by 75.21 μg/mL and 74.02 μg/mL in 48 h by MC4 and Pseudomonas sp. ZyL-01, respectively. Moreover, ZyL-01 could decompose 98% CL-20 of the real effluent in 14 day’s incubation with the glucose as carbon source. Finally, a draft genome sequence was obtained to predict possible degrading enzymes involved in the biodegradation of CL-20. Specifically, 330 genes that are involved in energy production and conversion were annotated by Gene Ontology functional enrichment analysis, and some of these candidates may encode enzymes that are responsible for CL-20 degradation. In summary, our studies indicate that microbes might be a valuable biological resource for the treatment of environmental contamination caused by CL-20 and that Pseudomonas sp. ZyL-01 might be a promising candidate for eradicating CL-20 to achieve a more biosafe environment and improve public health.


Sign in / Sign up

Export Citation Format

Share Document