scholarly journals Prokaryotic gene-family size correlations

Author(s):  
Gon Carmi ◽  
Alexander Bolshoy

The existence of multiple copies of genes is a well-known phenomenon. A gene family is a set of sufficiently similar genes, formed by gene duplication. In earlier works conducted on limited number of completely sequenced and annotated genomes it was found that size of gene family and size of genome are positively correlated. Additionally, it was found that several atypical microbes deviated from the observed general trend. In this study, we reexamined these associations on a larger dataset consisting of 1484 prokaryotic genomes and using several ranking approaches. We applied ranking methods in such a way that genomes with lower number of paralogs would have lower rank. Until now only simple ranking methods were used; we applied the Kemeny optimal aggregation approach as well. Regression and correlation analysis were utilized in order to accurately quantify and characterize the relationships between measures of paralog indices and genome size. In addition, boxplot analysis was employed as a method for outlier detection. We found that, in general, all paralog indexes positively correlate with an increase of genome size. As expected, different groups of atypical prokaryotic genomes were found for different types of paralog quantities.

2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Katie Emelianova ◽  
Andrea Martínez Martínez ◽  
Lucia Campos-Dominguez ◽  
Catherine Kidner

AbstractBegonia is an important horticultural plant group, as well as one of the most speciose Angiosperm genera, with over 2000 described species. Genus wide studies of genome size have shown that Begonia has a highly variable genome size, and analysis of paralog pairs has previously suggested that Begonia underwent a whole genome duplication. We address the contribution of gene duplication to the generation of diversity in Begonia using a multi-tissue RNA-seq approach. We chose to focus on chalcone synthase (CHS), a gene family having been shown to be involved in biotic and abiotic stress responses in other plant species, in particular its importance in maximising the use of variable light levels in tropical plants. We used RNA-seq to sample six tissues across two closely related but ecologically and morphologically divergent species, Begonia conchifolia and B. plebeja, yielding 17,012 and 19,969 annotated unigenes respectively. We identified the chalcone synthase gene family members in our Begonia study species, as well as in Hillebrandia sandwicensis, the monotypic sister genus to Begonia, Cucumis sativus, Arabidopsis thaliana, and Zea mays. Phylogenetic analysis suggested the CHS gene family has high duplicate turnover, all members of CHS identified in Begonia arising recently, after the divergence of Begonia and Cucumis. Expression profiles were similar within orthologous pairs, but we saw high inter-ortholog expression variation. Sequence analysis showed relaxed selective constraints on some ortholog pairs, with substitutions at conserved sites. Evidence of pseudogenisation and species specific duplication indicate that lineage specific differences are already beginning to accumulate since the divergence of our study species. We conclude that there is evidence for a role of gene duplication in generating diversity through sequence and expression divergence in Begonia.


2021 ◽  
Author(s):  
Katie Emelianova ◽  
Andrea Martínez Martínez ◽  
Lucia Campos-Dominguez ◽  
Catherine Kidner

Abstract Begonia is an important horticultural plant, as well as one of the most speciose Angiosperm genera, with over 2000 described species. Genus wide studies of genome size have shown that Begonia has a highly variable genome size, and analysis of paralog pairs has previously suggested that Begonia underwent a whole genome duplication. We address the contribution of gene duplication to the generation of diversity in Begonia using a multi-tissue RNA-seq approach. We chose to focus on the chalcone synthase (CHS) gene family due to its role in biotic and abiotic stress response, and in particular its importance in maximising the use of variable light levels in tropical plants. We used RNA-seq to sample six tissues across two closely related but ecologically and morphologically divergent species, Begonia conchifolia and B. plebeja, yielding 17,012 and 19,969 annotated unigenes respectively. We identified the chalcone synthase gene family members in our Begonia study species, as well as in Hillebrandia sandwicensis, the monotypic sister genus to Begonia, Cucumis sativus, Arabidopsis thaliana, and Zea mays. Phylogenetic and expression analysis revealed the recent origin of CHS duplicates in Begonia, which showed both conserved and divergent expression profiles between duplicates. We conclude that there is evidence for a role of gene duplication in generating diversity through expression divergence in Begonia.


1989 ◽  
Vol 9 (3) ◽  
pp. 1224-1232 ◽  
Author(s):  
W F Ng ◽  
F Sarangi ◽  
R L Zastawny ◽  
L Veinot-Drebot ◽  
V Ling

Overproduction of P-glycoprotein is intimately associated with multidrug resistance. This protein appears to be encoded by a multigene family. Thus, differential expression of different members of this family may contribute to the complexity of the multidrug resistance phenotype. Three lambda genomic clones isolated from a hamster genomic library represent different members of the hamster P-glycoprotein gene family. Using a highly conserved exon probe, we found that the hamster P-glycoprotein gene family consists of three genes. We also found that the P-glycoprotein gene family consists of three genes in mice but has only two genes in humans and rhesus monkeys. The hamster P-glycoprotein genes have similar exon-intron organizations within the 3' region encoding the cytoplasmic domains. We propose that the hamster P-glycoprotein gene family arose from gene duplication. The hamster pgp1 and pgp2 genes appear to be more closely related to each other than either gene is to the pgp3 gene. We speculate that the hamster pgp1 and pgp2 genes arose from a recent gene duplication event and that primates did not undergo this duplication and therefore contain only two P-glycoprotein genes.


2021 ◽  
Vol 8 ◽  
Author(s):  
Kai-Lu Zhang ◽  
Jian-Li Zhou ◽  
Jing-Fang Yang ◽  
Yu-Zhen Zhao ◽  
Debatosh Das ◽  
...  

As a pivotal regulator of 5’ splice site recognition, U1 small nuclear ribonucleoprotein (U1 snRNP)-specific protein C (U1C) regulates pre-mRNA splicing by interacting with other components of the U1 snRNP complex. Previous studies have shown that U1 snRNP and its components are linked to a variety of diseases, including cancer. However, the phylogenetic relationships and expression profiles of U1C have not been studied systematically. To this end, we identified a total of 110 animal U1C genes and compared them to homologues from yeast and plants. Bioinformatics analysis shows that the structure and function of U1C proteins is relatively conserved and is found in multiple copies in a few members of the U1C gene family. Furthermore, the expression patterns reveal that U1Cs have potential roles in cancer progression and human development. In summary, our study presents a comprehensive overview of the animal U1C gene family, which can provide fundamental data and potential cues for further research in deciphering the molecular function of this splicing regulator.


2015 ◽  
Vol 26 (06) ◽  
pp. 1550067 ◽  
Author(s):  
Chao Gao ◽  
Lu Zhong ◽  
Xianghua Li ◽  
Zili Zhang ◽  
Ning Shi

Identifying influential nodes is of theoretical significance in many domains. Although lots of methods have been proposed to solve this problem, their evaluations are under single-source attack in scale-free networks. Meanwhile, some researches have speculated that the combinations of some methods may achieve more optimal results. In order to evaluate this speculation and design a universal strategy suitable for different types of networks under the consideration of multi-source attacks, this paper proposes an attribute fusion method with two independent strategies to reveal the correlation of existing ranking methods and indicators. One is based on feature union (FU) and the other is based on feature ranking (FR). Two different propagation models in the fields of recommendation system and network immunization are used to simulate the efficiency of our proposed method. Experimental results show that our method can enlarge information spreading and restrain virus propagation in the application of recommendation system and network immunization in different types of networks under the condition of multi-source attacks.


Sign in / Sign up

Export Citation Format

Share Document