gene annotation
Recently Published Documents


TOTAL DOCUMENTS

651
(FIVE YEARS 368)

H-INDEX

36
(FIVE YEARS 9)

2022 ◽  
Author(s):  
Caroline M. Weisman ◽  
Andrew M. Murray ◽  
Sean R Eddy

Comparisons of genomes of different species are used to identify lineage-specific genes, those genes that appear unique to one species or clade. Lineage-specific genes are often thought to represent genetic novelty that underlies unique adaptations. Identification of these genes depends not only on genome sequences, but also on inferred gene annotations. Comparative analyses typically use available genomes that have been annotated using different methods, increasing the risk that orthologous DNA sequences may be erroneously annotated as a gene in one species but not another, appearing lineage-specific as a result. To evaluate the impact of such 'annotation heterogeneity', we identified four clades of species with sequenced genomes with more than one publicly available gene annotation, allowing us to compare the number of lineage-specific genes inferred when differing annotation methods are used to those resulting when annotation method is uniform across the clade. In these case studies, annotation heterogeneity increases the apparent number of lineage-specific genes by up to 15-fold, suggesting that annotation heterogeneity is a substantial source of potential artifact.


2022 ◽  
Author(s):  
Hugo Darras ◽  
Natalia de Souza Araujo ◽  
Lyam Baudry ◽  
Nadege Guiglielmoni ◽  
Pedro Lorite ◽  
...  

Cataglyphis are thermophilic ants that forage during the day when temperatures are highest and sometimes close to their critical thermal limit. Several Cataglyphis species have evolved unusual reproductive systems such as facultative queen parthenogenesis or social hybridogenesis, which have not yet been investigated in detail at the molecular level. We generated high-quality genome assemblies for two hybridogenetic lineages of the Iberian ant Cataglyphis hispanica using long-read Nanopore sequencing and exploited chromosome conformation capture (3C) sequencing to assemble contigs into 26 and 27 chromosomes, respectively. Males of one lineage were karyotyped to confirm the number of chromosomes inferred from 3C data. We obtained transcriptomic data to assist gene annotation and built custom repeat libraries for each of the two assemblies. Comparative analyses with 19 other published ant genomes were also conducted. These new genomic resources pave the way for exploring the genetic mechanisms underlying the remarkable thermal adaptation and the molecular mechanisms associated with transitions between different genetic systems characteristics of the ant genus Cataglyphis.


BMC Genomics ◽  
2022 ◽  
Vol 23 (1) ◽  
Author(s):  
Chang Pyo Hong ◽  
Chang-Kug Kim ◽  
Dong Jin Lee ◽  
Hee Jeong Jeong ◽  
Yi Lee ◽  
...  

Abstract Background Schisandra chinensis, an ancient member of the most basal angiosperm lineage which is known as the ANITA, is a fruit-bearing vine with the pharmacological effects of a multidrug system, such as antioxidant, anti-inflammatory, cardioprotective, neuroprotective, anti-osteoporosis effects. Its major bioactive compound is represented by lignans such as schisandrin. Molecular characterization of lignan biosynthesis in S. chinensis is of great importance for improving the production of this class of active compound. However, the biosynthetic mechanism of schisandrin remains largely unknown. Results To understand the potential key catalytic steps and their regulation of schisandrin biosynthesis, we generated genome-wide transcriptome data from three different tissues of S. chinensis cultivar Cheongsoon, including leaf, root, and fruit, via long- and short-read sequencing technologies. A total of 132,856 assembled transcripts were generated with an average length of 1.9 kb and high assembly completeness. Overall, our data presented effective, accurate gene annotation in the prediction of functional pathways. In particular, the annotation revealed the abundance of transcripts related to phenylpropanoid biosynthesis. Remarkably, transcriptome profiling during fruit development of S. chinensis cultivar Cheongsoon revealed that the phenylpropanoid biosynthetic pathway, specific to coniferyl alcohol biosynthesis, showed a tendency to be upregulated at the postfruit development stage. Further the analysis also revealed that the pathway forms a transcriptional network with fruit ripening-related genes, especially the ABA signaling-related pathway. Finally, candidate unigenes homologous to isoeugenol synthase 1 (IGS1) and dirigent-like protein (DIR), which are subsequently activated by phenylpropanoid biosynthesis and thus catalyze key upstream steps in schisandrin biosynthesis, were identified. Their expression was increased at the postfruit development stage, suggesting that they may be involved in the regulation of schisandrin biosynthesis in S. chinensis. Conclusions Our results provide new insights into the production and accumulation of schisandrin in S. chinensis berries and will be utilized as a valuable transcriptomic resource for improving the schisandrin content.


2022 ◽  
Vol 12 ◽  
Author(s):  
Yawei Li ◽  
Li Chu ◽  
Xiaofeng Liu ◽  
Nannan Zhang ◽  
Yufei Xu ◽  
...  

Soybean pubescence plays an important role in insect resistance, drought tolerance, and other stresses. Hence, a deep understanding of the molecular mechanism underlying pubescence is a prerequisite to a deeper understanding of insect resistance and drought tolerance. In the present study, quantitative trait loci (QTL) mapping of pubescence traits was performed using a high-density inter-specific linkage map of one recombinant inbred line (RIL) population, designated NJRINP. It was observed that pubescence length (PL) was negatively correlated with pubescence density (PD). A total of 10 and 9 QTLs distributed on six and five chromosomes were identified with phenotypic variance (PV) of 3.0–9.9% and 0.8–15.8% for PL and PD, respectively, out of which, eight and five were novel. Most decreased PL (8 of 10) and increased PD (8 of 9) alleles were from the wild soybean PI 342618B. Based on gene annotation, Protein ANalysis THrough Evolutionary Relationships and literature search, 21 and 12 candidate genes were identified related to PL and PD, respectively. In addition, Glyma.12G187200 from major QTLs qPL-12-1 and qPD-12-2, was identified as Ps (sparse pubescence) before, having an expression level of fivefold greater in NN 86-4 than in PI 342618B, hence it might be the candidate gene that is conferring both PL and PD. Based on gene expression and cluster analysis, three and four genes were considered as the important candidate genes of PL and PD, respectively. Besides, leaves with short and dense (SD) pubescence, which are similar to the wild soybean pubescence morphology, had the highest resistance to common cutworm (CCW) in soybean. In conclusion, the findings in the present study provide a better understanding of genetic basis and candidate genes information of PL and PD and the relationship with resistance to CCW in soybean.


2022 ◽  
Vol 12 (1) ◽  
Author(s):  
C. Tara Satyavathi ◽  
Rukam S. Tomar ◽  
Supriya Ambawat ◽  
Jasminkumar Kheni ◽  
Shital M. Padhiyar ◽  
...  

AbstractPearl millet is an important staple food crop of poor people and excels all other cereals due to its unique features of resilience to adverse climatic conditions. It is rich in micronutrients like iron and zinc and amenable for focused breeding for these micronutrients along with high yield. Hence, this is a key to alleviate malnutrition and ensure nutritional security. This study was conducted to identify and validate candidate genes governing grain iron and zinc content enabling the desired modifications in the genotypes. Transcriptome sequencing using ION S5 Next Generation Sequencer generated 43.5 million sequence reads resulting in 83,721 transcripts with N50 of 597 bp and 84.35% of transcripts matched with the pearl millet genome assembly. The genotypes having high iron and zinc showed differential gene expression during different stages. Of which, 155 were up-regulated and 251 were down-regulated while during flowering stage and milking stage 349 and 378 transcripts were differentially expressed, respectively. Gene annotation and GO term showed the presence of transcripts involved in metabolic activities associated with uptake and transport of iron and zinc. Information generated will help in gaining insights into iron and zinc metabolism and develop genotypes with high yield, grain iron and zinc content.


2022 ◽  
Author(s):  
Dong Xu ◽  
Kangming Jin ◽  
Heling Jiang ◽  
Desheng Gong ◽  
Jinbao Yang ◽  
...  

Sequence alignment is the basis of gene functional annotation for unknow sequences. Selecting closely related species as the reference species should be an effective way to improve the accuracy of gene annotation for plants, compared with only based on one or some model plants. Therefore, limited species number in previous software or website is disadvantageous for plant gene annotation. Here, we collected the protein sequences of 236 plant species with known genomic information from 63 families. After that, these sequences were annotated by pfam, Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) databases to construct our databases. Furthermore, we developed the software, Gene Annotation Software for Plants (GFAP), to perform gene annotation using our databases. GFAP, an open-source software running on Windows and MacOS systems, is an efficient and network independent tool. GFAP can search the protein domain, GO and KEGG information for 43000 genes within 4 minutes. In addition, GFAP can also perform the sequence alignment, statistical analysis and drawing. The website of https://gitee.com/simon198912167815/gfap-database provides the software, databases, testing data and video tutorials for users. GFAP contained large amount of plant-species information. We believe that it will become a powerful tool in gene annotation using closely related species for phytologists.


2022 ◽  
Vol 22 (1) ◽  
Author(s):  
Yanyan Gao ◽  
Jun Ji ◽  
Yujin Zhang ◽  
Ningxian Yang ◽  
Mingsheng Zhang

Abstract Background Cremastra appendiculata is a rare terrestrial orchid with a high market value as an ornamental and medicinal plant. However, the species depends entirely on fungi for seed germination under natural conditions. In a previous study, we have successfully isolated and identified the mycorrhizal fungus Coprinellus disseminatus which was able to induce the germination of C. appendiculata seeds. We then speculated that C. disseminatus may do so by breaking the testa imposed dormancy of the seeds. In this study, biochemical and transcriptomic analyses were used to characterize the germination of C. appendiculata seeds, collected at different stages of germination, as affected by C. disseminatus. Results The lignocellulose in the seeds coat of C. appendiculata was degraded by the mycorrhizal fungus resulting in facilitated absorption of water. The rate of decline in lignin content was 67 and 73% at 6 and 12 days after sowing, respectively. The water content increased from 13 to 90% during symbiosis. A total of 15,382 genes showing significantly different levels of expression (log2 FPKM≥2.0, Qvalue≤0.05) were successfully identified among all libraries, where the highest number of DEGs was shared between 6 days versus 0 day after symbiotic germination. Gene annotation results suggested that 15 key genes related water-status, such as DHN gene family and Xero 1 were down-regulated. The genes zeaxanthin epoxidase ZEP, 9-cis-epoxycarotenoid dioxygenase NCED3 and β-carotene hydroxylase involved in the biosynthesis of abscisic acid (ABA) were significantly down-regulated in 6 days as compared to 0 day after symbiotic germination. Conclusions This work demonstrates that mycorrhizal fungus C. disseminatus can stimulate C. appendiculata seeds germination through a mechanism of breaking the testa imposed dormancy and inducing water absorption of the embryo.


2022 ◽  
Vol 8 ◽  
Author(s):  
Xieyan Chen ◽  
Zihao He ◽  
Jichen Zhao ◽  
Minze Liao ◽  
Yuan Xue ◽  
...  

Biofloc technology (BFT) is one of the most promising technologies in global aquaculture for the purpose of improving water quality, waste treatment, and disease prevention in intensive aquaculture systems. However, characterization of the microbial species and antibiotic resistance potentially present in biofloc-based aquaculture environments is needed. In this study, we used high-throughput sequencing technology to comprehensively compare the bacterial communities in mariculture ponds of Penaeus monodon (P. monodon), by testing of water, biofloc, and intestine of P. monodon. Operational taxonomic units (OTUs) cluster analysis showed that the nine samples tested divided into 45 phyla and 457 genera. Proteobacteria was the dominant bacteria in water, biofloc and prawn intestine. In biofloc and intestine, the Ruegeria (2.23–6.31%) genus represented the largest proportion of bacteria, with Marivita (14.01–20.94%) the largest group in water. Microbial functional annotation revealed that in all the samples, genes encoding metabolism were predominant. The antibiotic resistance gene annotation showed the highest absolute abundance of patB, adeF, OXA-243, and Brucella_suis_mprF from Proteobacteria. PatB (11.33–15.01%), adeF (15.79–18.16%), OXA-243 (35.65%), and Brucella_suis_mprF (10.03%) showed the highest absolute abundance of antibiotic resistance genes in water, biofloc, and intestines, respectively. These findings may greatly increase our understanding of the characteristics of the microbiota of shrimp biofloc-based aquaculture systems and the complex interactions among shrimp, ambient microflora, and environmental variables. It provides a reference basis for policy on breeding, environmental safety, and maintaining food safety in the production of P. monodon.


2022 ◽  
Vol 13 (1) ◽  
Author(s):  
Mengqin Shen ◽  
Ruixue Zhang ◽  
Wenzhi Jia ◽  
Zongping Zhu ◽  
Li Zhao ◽  
...  

AbstractNuclear-localized epidermal growth factor receptor (EGFR) highly correlates with the malignant progression and may be a promising therapeutic target for breast cancer. However, molecular mechanisms of nuclear EGFR in triple-negative breast cancer (TNBC) have not been fully elucidated. Here, we performed gene-annotation enrichment analysis for the interactors of nuclear EGFR and found that RNA-binding proteins (RBPs) were closely associated with nuclear EGFR. We further demonstrated p54nrb/NONO, one of the RBPs, significantly interacted with nuclear EGFR. NONO was upregulated in 80 paired TNBC tissues and indicated a poor prognosis. Furthermore, NONO knockout significantly inhibited TNBC proliferation in vitro and in vivo. Mechanistically, NONO increased the stability of nuclear EGFR and recruited CREB binding protein (CBP) and its accompanying E1A binding protein p300, thereby enhancing the transcriptional activity of EGFR. In turn, EGFR positively regulated the affinity of NONO to mRNAs of nuclear EGFR downstream genes. Furthermore, the results indicated that the nuclear EGFR/NONO complex played a critical role in tumorigenesis and chemotherapy resistance. Taken together, our findings indicate that NONO enhances nuclear EGFR-mediated tumorigenesis and may be a potential therapeutic target for TNBC patients with nuclear EGFR expression.


GigaScience ◽  
2022 ◽  
Vol 11 (1) ◽  
Author(s):  
Youngik Yang ◽  
Ji Yong Yoo ◽  
Sang Ho Baek ◽  
Ha Yeun Song ◽  
Seonmi Jo ◽  
...  

Abstract Background The shuttles hoppfish (mudskipper), Periophthalmus modestus, is one of the mudskippers, which are the largest group of amphibious teleost fishes, which are uniquely adapted to live on mudflats. Because mudskippers can survive on land for extended periods by breathing through their skin and through the lining of the mouth and throat, they were evaluated as a model for the evolutionary sea-land transition of Devonian protoamphibians, ancestors of all present tetrapods. Results A total of 39.6, 80.2, 52.9, and 33.3 Gb of Illumina, Pacific Biosciences, 10X linked, and Hi-C data, respectively, was assembled into 1,419 scaffolds with an N50 length of 33 Mb and BUSCO score of 96.6%. The assembly covered 117% of the estimated genome size (729 Mb) and included 23 pseudo-chromosomes anchored by a Hi-C contact map, which corresponded to the top 23 longest scaffolds above 20 Mb and close to the estimated one. Of the genome, 43.8% were various repetitive elements such as DNAs, tandem repeats, long interspersed nuclear elements, and simple repeats. Ab initio and homology-based gene prediction identified 30,505 genes, of which 94% had homology to the 14 Actinopterygii transcriptomes and 89% and 85% to Pfam familes and InterPro domains, respectively. Comparative genomics with 15 Actinopterygii species identified 59,448 gene families of which 12% were only in P. modestus. Conclusions We present the high quality of the first genome assembly and gene annotation of the shuttles hoppfish. It will provide a valuable resource for further studies on sea-land transition, bimodal respiration, nitrogen excretion, osmoregulation, thermoregulation, vision, and mechanoreception.


Sign in / Sign up

Export Citation Format

Share Document