core gene
Recently Published Documents


TOTAL DOCUMENTS

405
(FIVE YEARS 167)

H-INDEX

41
(FIVE YEARS 7)

2022 ◽  
Author(s):  
Gamze Boluk ◽  
Shefali Dobhal ◽  
Dario Arizala ◽  
Anne M. Alvarez ◽  
Mohammad Arif

Bacterial pathogens identified as Dickeya sp. have recently been associated with a corm rot of wetland taro on Oahu, Hawaii, but the species designation of these strains was unclear. A Gram-negative, pectinolytic bacterial strain PL65T isolated from an infected taro corm was subjected to polyphasic analysis to determine its genomic and phenotypic characteristics. Multi-locus sequence analyses (MLSA) based on five housekeeping genes (dnaA, gapA, gyrB, atpD, and purA) revealed that Dickeya zeae and D. oryzae, were the closest relatives. Phylogenetic analysis based on 463 core gene sequences clearly showed two potentially new species within Dickeya oryzae. In silico DNA-DNA hybridization value of strain PL65T with 12 Type strains of Dickeya species was <68%. Average nucleotide identity (ANI) analysis revealed that PL65T was at the margin of the species delineation cut-off values with a 96% ANI value. The metabolic profile of strain PL65T using BIOLOG differentiated it from the type strains of all other known species of Dickeya. Based on the results of genome-to-genome comparisons and phenotypic data presented in this report, we propose establishment of a new species, Dickeya colocasiae sp. nov. with strain PL65T as the type strain (ICMP 24361T).


2022 ◽  
Author(s):  
Tang Li ◽  
Yanbin Yin

Background: Large scale metagenome assembly and binning to generate metagenome-assembled genomes (MAGs) has become possible in the past five years. As a result, millions of MAGs have been produced and increasingly included in pan-genomics workflow. However, pan-genome analyses of MAGs may suffer from the known issues with MAGs: fragmentation, incompleteness, and contamination, due to mis-assembly and mis-binning. Here, we conducted a critical assessment of including MAGs in pan-genome analysis, by comparing pan-genome analysis results of complete bacterial genomes and simulated MAGs. Results: We found that incompleteness led to more significant core gene loss than fragmentation. Contamination had little effect on core genome size but had major influence on accessory genomes. The core gene loss remained when using different pan-genome analysis tools and when using a mixture of MAGs and complete genomes. Importantly, the core gene loss was partially alleviated by lowering the core gene threshold and using gene prediction algorithms that consider fragmented genes, but to a less degree when incompleteness was higher than 5%. The core gene loss also led to incorrect pan-genome functional predictions and inaccurate phylogenetic trees. Conclusions: We conclude that lowering core gene threshold and predicting genes in metagenome mode (as Anvio does with Prodigal) are necessary in pan-genome analysis of MAGs to alleviate the accuracy loss. Better quality control of MAGs and development of new pan-genome analysis tools specifically designed for MAGs are needed in future studies.


2021 ◽  
Author(s):  
Pathum Kossinna ◽  
Weijia Cai ◽  
Xuewen Lu ◽  
Carrie S Shemanko ◽  
Qingrun Zhang

Approaches systematically characterizing interactions via transcriptomic data usually follow two systems: (1) co-expression network analyses focusing on correlations between genes; (2) linear regressions (usually regularized) to select multiple genes jointly. Both suffer from the problem of stability: a slight change of parameterization or dataset could lead to dramatic alternations of outcomes. Here, we propose Stabilized Core gene and Pathway Election, or SCOPE, a tool integrating bootstrapped LASSO and co-expression analysis, leading to robust outcomes insensitive to variations in data. By applying SCOPE to six cancer expression datasets (BRCA, COAD, KIRC, LUAD, PRAD and THCA) in The Cancer Genome Atlas, we identified core genes capturing interaction effects in crucial pan-cancer pathways related to genome instability and DNA damage response. Moreover, we highlighted the pivotal role of CD63 as an oncogenic driver and a potential therapeutic target in kidney cancer. SCOPE enables stabilized investigations towards complex interactions using transcriptome data.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e12624
Author(s):  
Zhaodong Li ◽  
Hongtian Fei ◽  
Siyu Lei ◽  
Fengtong Hao ◽  
Lijie Yang ◽  
...  

Background Lung adenocarcinoma (LUAD) is the most prevalent tumor in lung carcinoma cases and threatens human life seriously worldwide. Here we attempt to identify a prognostic biomarker and potential therapeutic target for LUAD patients. Methods Differentially expressed genes (DEGs) shared by GSE18842, GSE75037, GSE101929 and GSE19188 profiles were determined and used for protein-protein interaction analysis, enrichment analysis and clinical correlation analysis to search for the core gene, whose expression was further validated in multiple databases and LUAD cells (A549 and PC-9) by quantitative real-time PCR (qRT-PCR) and western blot analyses. Its prognostic value was estimated using the Kaplan-Meier method, meta-analysis and Cox regression analysis based on the Cancer Genome Atlas (TCGA) dataset and co-expression analysis was conducted using the Oncomine database. Gene Set Enrichment Analysis (GSEA) was performed to illuminate the potential functions of the core gene. Results A total of 115 shared DEGs were found, of which 24 DEGs were identified as candidate hub genes with potential functions associated with cell cycle and FOXM1 transcription factor network. Among these candidates, HMMR was identified as the core gene, which was highly expressed in LUAD as verified by multiple datasets and cell samples. Besides, high HMMR expression was found to independently predict poor survival in patients with LUAD. Co-expression analysis showed that HMMR was closely related to FOXM1 and was mainly involved in cell cycle as suggested by GSEA. Conclusion HMMR might be served as an independent prognostic biomarker for LUAD patients, which needs further validation in subsequent studies.


2021 ◽  
Author(s):  
Wen-Ming Chen ◽  
Che-Chia Yang ◽  
Chiu-Chung Young ◽  
Shih-Yao Lin ◽  
Shih-Yi Sheu

Abstract Bacterial strain designated CSW-27T was isolated from a freshwater pond in Taiwan. Cells were Gram-stain-negative, aerobic, oxidase-positive, catalase-negative, rod-shaped and motile by flagella. Strain CSW-27T grew at 20-40 oC (optimum, 30-37 oC), at pH 5-9 (optimum, pH 6-7) and in the presence of 0-4% NaCl (optimum, 0%). Phylogenetic analyses based on 16S rRNA gene sequences and an up-to-date bacterial core gene set revealed that strain CSW-27T was affiliated with species in the genus Rhizobium. Analysis of 16S rRNA gene sequences showed that strain CSW-27T had the highest similarity to Rhizobium straminoryzae CC-LY845T (98.5%) followed by Rhizobium capsici CC-SKC2T (96.9%). The average nucleotide identity, average amino acid identity and digital DNA-DNA hybridization values between strain CSW-27T and the closely related Rhizobium species were 73.4-86.5, 66.0-88.8 and 13.3-22.1%, respectively. The principal fatty acid was summed feature 8 (C18:1ω7c and/or C18:1ω6c). The main polar lipids were phosphatidylethanolamine, phosphatidylglycerol, diphosphatidylglycerol, phosphatidylmonomethylethanolamine, phosphatidyldimethylethanolamine, phosphatidylcholine, one uncharacterized aminophospholipid, three uncharacterized aminolipids and two uncharacterized lipids. The predominant polyamine was spermidine. The major isoprenoid quinone was Q-10. Genomic DNA G+C content of strain CSW-27T was 63.3%. These polyphasic taxonomic data indicited that strain CSW-27T should be considered as representing a novel species in the genus Rhizobium, for which the name Rhizobium lacunae sp. nov. is proposed with strain CSW-27T (=BCRC 81244T =LMG 31684T) as the type strain.


PLoS ONE ◽  
2021 ◽  
Vol 16 (12) ◽  
pp. e0261487
Author(s):  
Wenli Jiao ◽  
Mengxue Xu ◽  
Rujun Zhou ◽  
Yiwei Fu ◽  
Zibo Li ◽  
...  

Elsinochromes (ESCs) are virulence factors produced by Elsinoë arachidis which is the cause of peanut scab. However, the biosynthesis pathway of ESCs in E. arachidis has not been elucidated and the potential pathogenic mechanism of E. arachidis is poorly understood. In this study, we report a high-quality genome sequence of E. arachidis. The size of the E. arachidis genome is 33.18Mb, which is comparable to the Ascomycota genome (average 36.91 Mb), encoding 9174 predicted genes. The self-detoxification family including transporters and cytochrome P450 enzymes were analysis, candidate effectors and cell wall degrading enzymes were investigated as the pathogenicity genes by using PHI and CAZy databases. Additionally, the E. arachidis genome contains 24 secondary metabolism gene clusters, in which ESCB1 was identified as the core gene of ESC biosynthesis. Taken together, the genome sequence of E. arachidis provides a new route to explore its potential pathogenic mechanism and the biosynthesis pathway of ESCs.


2021 ◽  
Vol 12 ◽  
Author(s):  
Haoyi Yang ◽  
Mingda Hu ◽  
Boqian Wang ◽  
Yuan Jin ◽  
Xingfei Gong ◽  
...  

Reassortment among avian influenza viruses is the main source of novel avian influenza virus subtypes. Studies have shown that the H9N2 virus often donates internal segments to generate novel reassortant avian influenza viruses, acting as a reassortment template. However, the characteristics of the internal pattern of reassortment remain unclear. In this article, we first defined the core gene pool of the internal segments of the H9N2 virus that provide templates for reassortment. We used genetic distance and sequence similarity to define typical clusters in the core gene pool. Then, we analyzed the phylogenetic relationships, feature vector distances, geographic distributions and mutation sites of strains related to the core gene pool. Strains in the same typical clusters have close phylogenetic relationships and feature vector distances. We also found that these typical clusters can be divided into three categories according to their main geographic distribution area. Furthermore, typical clusters in the same geographic area contain some common mutation patterns. Our results suggest that typical clusters in the core gene pool affect the reassortment events of the H9N2 virus in many respects, such as geographic distribution and amino acid mutation sites.


2021 ◽  
Author(s):  
David A Constant ◽  
Jacob A Van Winkle ◽  
Eden VanderHoek ◽  
Simone E Dekker ◽  
M Anthony Sofia ◽  
...  

The three types of interferon (IFN) have roles in antimicrobial immunity and inflammation that must be properly balanced to maintain tissue homeostasis. For example, IFNs are elevated in the context of inflammatory bowel disease and may synergize with inflammatory cytokines such as tumor necrosis factor alpha (TNFα) to promote tissue damage. Prior studies suggest that in mouse intestinal epithelial cells (IECs), type III IFNs are preferentially produced during viral infections and are less cytotoxic than type I IFN. Here, we generated human IEC organoid lines from biopsies of ileum, ascending colon, and sigmoid colon of three healthy subjects to establish the baseline responses of normal human IECs to types I, II, and III IFN. We found that all IFN types elicited responses that were qualitatively consistent across intestinal biopsy sites. However, IFN types differed in magnitude of STAT1 phosphorylation and identity of genes in their downstream transcriptional programs. Specifically, there was a core transcriptional module shared by IFN types, but types I and II IFN stimulated unique transcriptional modules beyond this core gene signature. The transcriptional modules of type I and II IFN included pro-apoptotic genes, and expression of these genes correlated with potentiation of TNFα cytotoxicity. These data define the response profiles of healthy human IEC organoids across IFN types, and suggest that cytotoxic effects mediated by TNFα in inflamed tissues may be amplified by a simultaneous high-magnitude IFN response.


2021 ◽  
Vol 43 (3) ◽  
pp. 2177-2188
Author(s):  
Hakjoon Choi ◽  
Wan Seok Kang ◽  
Jin Seok Kim ◽  
Chang-Su Na ◽  
Sunoh Kim

Scutellaria L. (family Lamiaceae) includes approximately 470 species found in most parts of the world and is commonly known as skullcaps. Scutellaria L. is a medicinal herb used as a folk remedy in Korea and East Asia, but it is difficult to identify and classify various subspecies by morphological methods. Since Scutellaria L. has not been studied genetically, to expand the knowledge of species in the genus Scutellaria L., de novo whole-genome assembly was performed in Scutellaria indica var. tsusimensis (H. Hara) Ohwi using the Illumina sequencing platform. We aimed to develop a molecular method that could be used to classify S.indica var. tsusimensis (H. Hara) Ohwi, S. indica L. and three other Scutellaria L. species. The assembly results for S.indica var. tsusimensis (H. Hara) Ohwi revealed a genome size of 318,741,328 bp and a scaffold N50 of 78,430. The assembly contained 92.08% of the conserved BUSCO core gene set and was estimated to cover 94.65% of the genome. The obtained genes were compared with previously registered Scutellaria nucleotide sequences and similar regions using the NCBI BLAST service, and a total of 279 similar nucleotide sequences were detected. By selecting the 279 similar nucleotide sequences and nine chloroplast DNA barcode genes, primers were prepared so that the size of the PCR product was 100 to 1000 bp. As a result, a species-specific primer set capable of distinguishing five species of Scutellaria L. was developed.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Lipeng Niu ◽  
Ce Gao ◽  
Yang Li

AbstractColorectal carcinoma (CRC) is one of the most prevalent malignant tumors worldwide. Meanwhile, the majority of CRC related deaths results from liver metastasis. Gene expression profile of CRC patients with liver Metastasis was identified using 4 datasets. The data was analyzed using GEO2R tool. GO and KEGG pathway analysis were performed. PPI network of the DEGs between 1 and 2 gene sets was also constructed. The set 1 is named between primary CRC tissues and metastatic CRC tissues. The set 2 is named between primary CRC tissues and normal tissues. Finally, the prognostic value of hub genes was also analyzed. 35 DEGs (set 1) and 142 DEGs (set 2) were identified between CRC liver metastatic cancer patients. The PPI network was constructed using the top 10 set 1 hub genes which included AHSG, SERPINC1, FGA, F2, CP, ITIH2, APOA2, HPX, PLG, HRG and set 2 hub genes which included TIMP1, CXCL1, COL1A2, MMP1, AURKA, UBE2C, CXCL12, TOP2A, ALDH1A1 and PRKACB. Therefore, ITIH2 might represent the potential core gene for colon cancer liver metastasis. COL1A2 behaves as a key gene in colorectal carcinoma.


Sign in / Sign up

Export Citation Format

Share Document