scholarly journals Genome-Wide Analysis of Protein Family Diversity Provides Insights into Species-Specific Protein Family Expansions in Insects

Author(s):  
Mehmet DAYI

Abstract Insects are one of the earliest land animals with more than 400 million years old history on Earth, and they compose more than 80% of species. Insects invade a wide range of ecosystems and are considered one of the most evolutionary successful organism groups. Today, many insect species’ genomes have been sequenced to encode molecular mechanisms behind this magnificent evolutionary plasticity. However, only limited genome-wide studies have been carried out to compare protein family diversity in insects. A total of 20 insect species belonging to seven insect orders and two morphogenesis groups were investigated for evolutionary relationships and to uncover protein family diversity in the present study. The phylogenetic analysis inferred from a total of 530 one-to-one single-copy ortholog genes were separated insects into two evolutionary clades based on morphogenesis. Protein family analyses showed that insects share core protein families that perform essential tasks in development and metabolic processes, such as Pkinase and Zinc Finger, cellular signaling and odorant perception (7tm), digestion of food molecules (Trypsin), and detoxification (p450) with copy number expansion compared to other protein families. Additionally, species-specific protein family expansion was observed in various protein families. This study provided insights into protein family diversity and variation among insects and highlights high copy number variation in protein families species-wide.

2019 ◽  
Author(s):  
Junhua Rao ◽  
Lihua Peng ◽  
Fang Chen ◽  
Hui Jiang ◽  
Chunyu Geng ◽  
...  

AbstractBackgroundNext-generation sequence (NGS) has rapidly developed in past years which makes whole-genome sequencing (WGS) becoming a more cost- and time-efficient choice in wide range of biological researches. We usually focus on some variant detection via WGS data, such as detection of single nucleotide polymorphism (SNP), insertion and deletion (Indel) and copy number variant (CNV), which playing an important role in many human diseases. However, the feasibility of CNV detection based on WGS by DNBSEQ™ platforms was unclear. We systematically analysed the genome-wide CNV detection power of DNBSEQ™ platforms and Illumina platforms on NA12878 with five commonly used tools, respectively.ResultsDNBSEQ™ platforms showed stable ability to detect slighter more CNVs on genome-wide (average 1.24-fold than Illumina platforms). Then, CNVs based on DNBSEQ™ platforms and Illumina platforms were evaluated with two public benchmarks of NA12878, respectively. DNBSEQ™ and Illumina platforms showed similar sensitivities and precisions on both two benchmarks. Further, the difference between tools for CNV detection was analyzed, and indicated the selection of tool for CNV detection could affected the CNV performance, such as count, distribution, sensitivity and precision.ConclusionThe major contribution of this paper is providing a comprehensive guide for CNV detection based on WGS by DNBSEQ™ platforms for the first time.


Blood ◽  
2005 ◽  
Vol 106 (11) ◽  
pp. 2946-2946
Author(s):  
Justyna Rawluk ◽  
Dietmar Pfeifer ◽  
Milena Pantic ◽  
Hendrik Veelken

Abstract Chronic lymphocytic leukemia (CLL) is a heterogeneous disease characterized by different recurrent chromosomal aberrations with prognostic significance. Therefore, convenient genome-wide assays to identify genetic changes with high accuracy are required to assess the risk category of an individual patient, and to identify genetic mechanisms underlying the malignant transformation. Array-based CGH with custom-made chips has been proposed for this purpose (Schwaenen et al., 2004). We analyzed unselected peripheral blood cells from 50 consecutive CLL patients (33 with mutated IgVH genes) for the presence of genetic changes using Affymetrix 10k and 50k SNP mapping arrays that allow copy number analysis with a mean intermarker distance 300 kb and 60 kb, respectively. The detection limit of the assay with respect to tumor cell content was 25 to 40 % as assessed by validation experiments with a commercially available panel of FISH probes for typical recurrent CLL aberrations. 14 cases (28%) failed to show any detectable abnormalities. Among the common prognostically important aberrations, we readily identified del13q14 in 26 (52%), trisomy 12 in 9 (18%), del11q22 in 6 (12%), and del17p13 in 3 cases (6%). In 9 cases, additional losses in 17q (3 cases), 14q (2 cases), 5q (2 cases), 1q, 18p and 20q were detected. Likewise, multiple gains like trisomy 3, dup17q and dup2p (2 cases) were diagnosed. In addition to their convenient detection, size and extent of genomic aberrations can be defined with high resolution due to the precise positional information of the SNPs: In 2 out of 6 patients with a del11q22, in which deletion of the ATM (ataxia-telangiectasia mutated) gene is thought to represent the critical event, the ATM gene was present, but a 1Mb deletion around the RDX (radixin) gene was revealed. These deletions were missed by FISH analysis with ATM-spanning probes. Similarly, TP53 was still present in 1 of 3 patients with del17p13, despite the common assumption that this deletion causes the loss of a TP53 allele. An observed gain of 2p16 could be narrowed down to a 3Mb region containing the REL gene. Overexpression of REL may contribute to resistance to DNA-damage-induced apoptosis in CLL cells in vitro (Vallat et al., 2003). Finally, both the 10k and 50k SNP arrays showed the extent of the most frequent del13q14 in a wide range with a size distribution of 500kb to over 30Mb. However, for a more reliable detection of the del13q14 core region, the 50k array may be preferable to the 10k version, since this region is covered by 34 compared to 4 SNPs, respectively. Unexpectedly, 3 of 6 cases with a homozygous del13q14 showed loss of heterozygosity (LOH) associated with a normal copy number distal to the del13q14 extending to the telomere. A similar phenomenon has recently been described in AML patients (Raghavan et al., 2005) and can be explained by a somatic recombination event. Preliminary analyses demonstrated additional large regions of homozygosity ranging from 3 to over 20Mb on various chromosomes in at least 12 CLL cases. In conclusion, genomic profiling with Affymetrix SNP microarrays is an efficient screening method for simultaneous genome-wide detection of submicroscopic aberrations. Information on genotype with dosis information reveals hitherto undetected regions of homozygosity that may harbor imprinted genes or loss-of-function alleles with importance for the pathogenesis of CLL.


Blood ◽  
2018 ◽  
Vol 132 (Supplement 1) ◽  
pp. 1917-1917
Author(s):  
Jiye Liu ◽  
Wenrong Zhou ◽  
Lijie Xing ◽  
Liang Lin ◽  
Kenneth Wen ◽  
...  

Abstract Major improvements in patient outcome have resulted from the development of novel agents targeting multiple myeloma (MM) cells in the bone marrow (BM) microenvironment. Immunomodulatory drugs (IMiDs) including lenalidomide and pomalidomide bind to the CRL4CRBN ubiquitin ligase and promote proteasomal degradation of IKZF1/3 followed by induction of growth inhibition in MM cells. Although IMiDs-based treatments have achieved significant responses and improved outcomes in MM patients, acquired resistance to IMiDs commonly underlies relapse. Although the molecular mechanisms regulating sensitivity to IMiDs have not been fully defined, cereblon (CRBN) is considered to be the primary binding target of IMiDs. However, some MM cells show resistance to IMiDs despite harboring high CRBN expression levels. In this study, we show that TRAF2, a member of TNF receptor associated factor protein family, represents a novel regulator of IMiDs sensitivity in MM cells. To study the molecular mechanisms underlying IMiDs resistance, we first performed genome-wide knockout screening in IMiDs-sensitive MM.1S cells using a CRISPR-Cas9 GeCKOv2 library containing 6 unique sgRNAs against each of 19,050 genes and 4 sgRNAs against each of 1,864 miRNAs Twenty-eight genes and one miRNA were identified which were associated with resistance to IMiDs. Of note, all six sgRNAs targeting CRBN were identified, consistent with previous studies. Among these novel genes, we found that three different sgRNAs targeting TRAF2 were enriched after IMiDs selection. Therefore, we next individually cloned the sgRNAs of TRAF2 into the Cas9 lentiviral vector, and then re-introduced them into MM.1S cells. Importantly, TRAF2 knockout (KO) MM.1S cells acquire significant resistance to pomalidomide and lenalidomide treatments. To examine whether TRAF2 KO-induced IMiDs resistance was CRBN-pathway dependent, we assessed CRBN and its downstream protein levels. TRAF2 KO showed no effect on CRBN expression; IMiDs can still induce IKZF1/3 degradation, associated with downregulation of IRF4 in TRAF2 KO cells. Taken together, these data suggest that TRAF2 mediates sensitivity of IMiDs, independent of CRBN-IKZF1/3 axis. Since TRAF2 is a member of the TNF receptor associated factor (TRAF) protein family required for activation of several signal pathways including NF-ĸB and JNK, we next examined signaling cascades modulated by TRAF2 KO. Importantly, TRAF2-KO MM cells revealed increased processing of p52 (NF-ĸB2) from its precursor p100, resulting in hyperactivation of the non-canonical NF-κB pathway. The soluble factors (ie, cytokines) secreted by MM cells have important paracrine role on MM cell growth and drug resistance. We therefore performed cytokine analysis of proteins secreted by TRAF2 KO cells, and found that chemokine (C-C motif) ligand 5 (CCL5), a target gene of NF-kB pathway, was upregulated in TRAF2 KO cells. Moreover, upregulation of CCL5 was also observed in acquired IMiDs resistant MM1.S cells, which was confirmed by quantitative real-time PCR. In addition, exogenous CCL5 in the culture medium partially downregulated sensitivity to IMiDs treatment. In conclusion, we here identify and validate TRAF2 as a novel regulator of IMiDs sensitivity, independent of CRBN. Our mechanistic studies provide the preclinical rationale for combination treatment strategies with non-canonical NF-kB inhibitors to overcome IMiDs resistance in MM. Disclosures Zhou: WuXi App Tec Inc: Employment. Anderson:Oncopep: Equity Ownership; C4 Therapeutics: Equity Ownership; Celgene: Consultancy; Takeda Millennium: Consultancy; Bristol Myers Squibb: Consultancy; Gilead: Membership on an entity's Board of Directors or advisory committees.


2019 ◽  
Vol 12 (1) ◽  
Author(s):  
Minqiu Lu ◽  
Ying Hu ◽  
Yin Wu ◽  
Huixing Zhou ◽  
Yuan Jian ◽  
...  

Abstract Background Long noncoding RNAs (lncRNAs) are involved in a wide range of biological processes in tumorigenesis. However, the role of lncRNA expression in the biology, prognosis, and molecular classification of human multiple myeloma (MM) remains unclear, especially the biological functions of the vast majority of lncRNAs. Recently, lncRNAs have been identified in neoplastic hematologic disorders. Evidence has accumulated on the molecular mechanisms of action of lncRNAs, providing insight into their functional roles in tumorigenesis. This study aimed to characterize potential lncRNAs in patients with MM. Methods In this study, the whole-transcriptome strand-specific RNA sequencing of samples from three newly diagnosed patients with MM was performed. The whole transcriptome, including lncRNAs, microRNAs, and mRNAs, was analyzed. Using these data, MM lncRNAs were systematically analyzed, and the lncRNAs involved in the occurrence of MM were identified. Results The results revealed that MM lncRNAs had distinctive characteristics different from those of other malignant tumors. Further, the functions of a set of lncRNAs preferentially expressed in MM were verified, and several lncRNAs were identified as competing endogenous RNAs. More importantly, the aberrant expression of certain lncRNAs, including maternally expressed gene3, colon cancer–associated transcript1, and coiled-coil domain-containing 26, as well as some novel lncRNAs involved in the occurrence of MM was established. Further, lncRNAs were related to some microRNAs, regulated each other, and participated in MM development. Conclusions Genome-wide screening and functional analysis enabled the identification of a set of lncRNAs involved in the occurrence of MM. The interaction exists among microRNAs and lncRNAs.


2018 ◽  
Vol 115 (25) ◽  
pp. 6482-6487 ◽  
Author(s):  
Hui Shi ◽  
Mohan Lyu ◽  
Yiwen Luo ◽  
Shoucheng Liu ◽  
Yue Li ◽  
...  

Three families of transcription factors have been reported to play key roles in light control of Arabidopsis seedling morphogenesis. Among them, bHLH protein PIFs and plant-specific protein EIN3/EIN3-LIKE 1 (EIN3/EIL1) accumulate in the dark to maintain skotomorphogenesis. On the other hand, HY5 and HY5 HOMOLOG (HYH), two related bZIP proteins, are stabilized in light and promote photomorphogenic development. To systemically investigate the transcriptional regulation of light-controlled seedling morphogenesis, we generated HY5ox/pifQein3eil1, which contained mutations of EIN3/EIL1 and four PIF genes (pifQein3eil1) and overexpression of HY5. Our results show that dark-grown HY5ox/pifQein3eil1 seedlings display a photomorphogenesis highly similar to that of wild-type seedlings grown in continuous light, with remarkably enhanced photomorphogenic phenotypes compared with the pifQ mutants. Consistent with the genetic evidence, transcriptome analysis indicated that PIFs, EIN3/EIL1, and HY5 are dominant transcription factors in collectively mediating a wide range of light-caused genome-wide transcriptional changes. Moreover, PIFs and EIN3/EIL1 independently control the expression of light-regulated genes such as HLS1 to cooperatively regulate apical hook formation, hypocotyl elongation, and cotyledon opening and expansion. This study illustrates a comprehensive regulatory network of transcription activities that correspond to specific morphological aspects in seedling skotomorphogenesis and photomorphogenesis.


2011 ◽  
Vol 43 (14) ◽  
pp. 855-872 ◽  
Author(s):  
Ajit N. Shah ◽  
Daniela Cadinu ◽  
R. Michael Henke ◽  
Xiantong Xin ◽  
Ranita Ghosh Dastidar ◽  
...  

Hypoxia is a widely occurring condition experienced by diverse organisms under numerous physiological and disease conditions. To probe the molecular mechanisms underlying hypoxia responses and tolerance, we performed a genome-wide screen to identify mutants with enhanced hypoxia tolerance in the model eukaryote, the yeast Saccharomyces cerevisiae . Yeast provides an excellent model for genomic and proteomic studies of hypoxia. We identified five genes whose deletion significantly enhanced hypoxia tolerance. They are RAI1, NSR1, BUD21, RPL20A, and RSM22, all of which encode functions involved in ribosome biogenesis. Further analysis of the deletion mutants showed that they minimized hypoxia-induced changes in polyribosome profiles and protein synthesis. Strikingly, proteomic analysis by using the iTRAQ profiling technology showed that a substantially fewer number of proteins were changed in response to hypoxia in the deletion mutants, compared with the parent strain. Computational analysis of the iTRAQ data indicated that the activities of a group of regulators were regulated by hypoxia in the wild-type parent cells, but such regulation appeared to be diminished in the deletion strains. These results show that the deletion of one of the genes involved in ribosome biogenesis leads to the reversal of hypoxia-induced changes in gene expression and related regulators. They suggest that modifying ribosomal function is an effective mechanism to minimize hypoxia-induced specific protein changes and to confer hypoxia tolerance. These results may have broad implications in understanding hypoxia responses and tolerance in diverse eukaryotes ranging from yeast to humans.


2018 ◽  
Vol 58 (7) ◽  
pp. 1192 ◽  
Author(s):  
K. Karimi ◽  
A. Esmailizadeh ◽  
D. D. Wu ◽  
C. Gondro

The objective of this study was to present the first map of the copy number variations (CNVs) in Iranian indigenous cattle based on a high-density single nucleotide polymorphism (SNP) dataset. A total of 90 individuals were genotyped using the Illumina BovineHD BeadChip containing 777 962 SNPs. The QuantiSNP algorithm was used to perform a genome-wide CNV detection across autosomal genome. After merging the overlapping CNV, a total of 221 CNV regions were identified encompassing 36.4 Mb or 1.44% of the bovine autosomal genome. The length of the CNV regions ranged from 3.5 to 2252.8 Kb with an average of 163.8 Kb. These regions included 147 loss (66.52%) and 74 gain (33.48%) events containing a total of 637 annotated Ensembl genes. Gene ontology analysis revealed that most of genes in the CNV regions were involved in environmental responses, disease susceptibility and immune system functions. Furthermore, 543 of these genes corresponded to the human orthologous genes, which involved in a wide range of biological functions. Altogether, 73% of the 221 CNV regions overlapped either completely or partially with those previously reported in other cattle studies. Moreover, novel CNV regions involved several quantitative trait loci (QTL)-related to adaptative traits of Iranian indigenous cattle. These results provided a basis to conduct future studies on association between CNV regions and phenotypic variations in the Iranian indigenous cattle.


2017 ◽  
Author(s):  
Jacob L. Steenwyk ◽  
Antonis Rokas

AbstractIn recent years, copy number (CN) variation has emerged as a new and significant source of genetic polymorphisms contributing to the phenotypic diversity of populations. CN variants are defined as genetic loci that, due to duplication and deletion, vary in their number of copies across individuals in a population. CN variants range in size from 50 base pairs to whole chromosomes, can influence gene activity, and are associated with a wide range of phenotypes in diverse organisms, including the budding yeastSaccharomyces cerevisiae.In this review, we introduce CN variation, discuss the genetic and molecular mechanisms implicated in its generation, how they can contribute to genetic and phenotypic diversity in fungal populations, and consider how CN variants may influence wine yeast adaptation in fermentation-related processes. In particular, we focus on reviewing recent work investigating the contribution of changes in CN of fermentation-related genes associated with the adaptation and domestication of yeast wine strains and offer notable illustrations of such changes, including the high levels of CN variation among theCUPgenes, which confer resistance to copper, and the preferential deletion and duplication of theMALIandMAL3loci, respectively, which are responsible for metabolizing maltose and sucrose. Based on the available data, we propose that CN variation is a substantial dimension of yeast genetic diversity that occurs largely independent of single nucleotide polymorphisms. As such, CN variation harbors considerable potential for understanding and manipulating yeast strains in the wine fermentation environment and beyond.


2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Francois Olivier Hebert ◽  
Luca Freschi ◽  
Gwylim Blackburn ◽  
Catherine Béliveau ◽  
Ken Dewar ◽  
...  

Abstract Two subspecies of Asian gypsy moth (AGM), Lymantria dispar asiatica and L. dispar japonica, pose a serious alien invasive threat to North American forests. Despite decades of research on the ecology and biology of this pest, limited AGM-specific genomic resources are currently available. Here, we report on the genome sequences and functional content of these AGM subspecies. The genomes of L.d. asiatica and L.d. japonica are the largest lepidopteran genomes sequenced to date, totaling 921 and 999 megabases, respectively. Large genome size in these subspecies is driven by the accumulation of specific classes of repeats. Genome-wide metabolic pathway reconstructions suggest strong genomic signatures of energy-related pathways in both subspecies, dominated by metabolic functions related to thermogenesis. The genome sequences reported here will provide tools for probing the molecular mechanisms underlying phenotypic traits that are thought to enhance AGM invasiveness.


2016 ◽  
Vol 149 (3) ◽  
pp. 176-181 ◽  
Author(s):  
Olutobi A. Oluwole ◽  
Tamas Revay ◽  
Kiana Mahboubi ◽  
Laura A. Favetta ◽  
W. Allan King

Somatic mosaicism has become a focus in human research due to the implications of individual genetic variability in disease. Here, we assessed somatic copy number variations (CNVs) in Holstein bulls in 2 respects. We estimated genome-wide CNVs and assayed CNVs of the TSPY gene, the most variable bovine gene from the Y chromosome. Somatic tissues (blood, lung, heart, muscle, testis, and brain) of 4 bulls were arrayed on the Illumina Bovine SNP50k chip and qPCR tested for TSPY copy numbers. Our results showed extensive copy number divergence in tissues within the same animal as well as significant copy number alterations of TSPY. We detected a mean of 31 CNVs per animal among which 14 were of germline origin, as they were constantly present in all investigated tissues of the animal, while 18 were specific to 1 tissue. Thus, 57% of the total number of detected CNVs was the result of de novo somatic events. Further, TSPY copy number was found to vary significantly among tissues as well as among the same tissue type from different animals in a wide range from 7 to 224% of the calibrator. Our study shows significant autosomal and Y-chromosomal de novo somatic CNV in bulls.


Sign in / Sign up

Export Citation Format

Share Document