scholarly journals A global analysis of CNVs in Chinese indigenous fine-wool sheep populations using whole-genome resequencing

BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Chao Yuan ◽  
Zengkui Lu ◽  
Tingting Guo ◽  
Yaojing Yue ◽  
Xijun Wang ◽  
...  

Abstract Background Copy number variation (CNV) is an important source of genetic variation that has a significant influence on phenotypic diversity, economically important traits and the evolution of livestock species. In this study, the genome-wide CNV distribution characteristics of 32 fine-wool sheep from three breeds were analyzed using resequencing. Results A total of 1,747,604 CNVs were detected in this study, and 7228 CNV regions (CNVR) were obtained after merging overlapping CNVs; these regions accounted for 2.17% of the sheep reference genome. The average length of the CNVRs was 4307.17 bp. “Deletion” events took place more frequently than “duplication” or “both” events. The CNVRs obtained overlapped with previously reported sheep CNVRs to variable extents (4.39–55.46%). Functional enrichment analysis showed that the CNVR-harboring genes were mainly involved in sensory perception systems, nutrient metabolism processes, and growth and development processes. Furthermore, 1855 of the CNVRs were associated with 166 quantitative trait loci (QTL), including milk QTLs, carcass QTLs, and health-related QTLs, among others. In addition, the 32 fine-wool sheep were divided into horned and polled groups to analyze for the selective sweep of CNVRs, and it was found that the relaxin family peptide receptor 2 (RXFP2) gene was strongly influenced by selection. Conclusions In summary, we constructed a genomic CNV map for Chinese indigenous fine-wool sheep using resequencing, thereby providing a valuable genetic variation resource for sheep genome research, which will contribute to the study of complex traits in sheep.

2021 ◽  
Author(s):  
Chao Yuan ◽  
Zengkui Lu ◽  
Tingting Guo ◽  
Yaojing Yue ◽  
Xijun Wang ◽  
...  

Abstract Background Copy number variation (CNV) is an important source of genetic variation that has a significant influence on phenotypic diversity, economically important traits and the evolution of livestock species. In this study, the genome-wide CNV distribution characteristics of 32 fine-wool sheep from three breeds were analyzed using resequencing.Results A total of 1,747,604 CNVs were detected in this study, and 7,228 CNV regions (CNVR) were obtained after merging overlapping CNVs; these regions accounted for 2.17% of the sheep reference genome. The average length of the CNVRs was 4,307.17 bp. “Deletion” events took place more frequently than “duplication” or “both” events. The CNVRs obtained overlapped with previously reported sheep CNVRs to variable extents (4.39%–55.46%). Functional enrichment analysis showed that the CNVR-harboring genes were mainly involved in sensory perception systems, nutrient metabolism processes, and growth and development processes. Furthermore, 1,855 of the CNVRs were associated with 166 quantitative trait loci (QTL), including milk QTLs, carcass QTLs, and health-related QTLs, among others. In addition, the 32 fine-wool sheep were divided into horned and polled groups to analyze for the selective sweep of CNVRs, and it was found that the relaxin family peptide receptor 2 (RXFP2) gene was strongly influenced by selection.Conclusions In summary, we constructed a genomic CNV map for Chinese indigenous fine-wool sheep using resequencing, thereby providing a valuable genetic variation resource for sheep genome research, which will contribute to the study of complex traits in sheep.


2020 ◽  
Author(s):  
Chao Yuan ◽  
Zengkui Lu ◽  
Tingting Guo ◽  
Yaojing Yue ◽  
Xijun Wang ◽  
...  

Abstract Background: Copy number variation (CNV) is an important source of genetic variation that has a significant influence on phenotypic diversity, important economic traits and the evolution of livestock species. In this study, the genome-wide CNV distribution characteristics of 32 fine-wool sheep from three breeds were analyzed using resequencing. Results: A total of 1747604 CNVs were detected in this study, and 7228 CNV regions (CNVR) were obtained after merging overlapping CNVs; these regions accounted for 2.17% of the sheep reference genome. The average length of the CNVRs was 4307.17 bp. “Deletion” events took place more frequently than “duplication” or “both” events. The CNVRs obtained overlapped with previously reported sheep CNVRs to variable extents (4.39%–55.46%). Functional enrichment analysis showed that the CNVR-harboring genes were mainly involved in sensory perception systems, nutrient metabolism processes, and growth and development processes. Furthermore, 1855 of the CNVRs were associated with 166 quantitative trait loci (QTL), including milk QTLs, carcass QTLs, and health-related QTLs, among others. In addition, the 32 fine-wool sheep were divided into horned and polled groups to analyze for the selective elimination of CNVRs, and it was found that the relaxin family peptide receptor 2 ( RXFP2 ) gene was strongly influenced by selection. Conclusions: In summary, we constructed a genomic CNV map for Chinese indigenous fine-wool sheep using resequencing, thereby providing a valuable genetic variation resource for sheep genome research, which will contribute to the study of complex traits in sheep.


2019 ◽  
Author(s):  
chao yuan ◽  
Zengkui Lu ◽  
Tingting Guo ◽  
yaojing Yue ◽  
Xijun Wang ◽  
...  

Abstract Background Copy number variation (CNV) is an important source of genetic variation that has a significant influence on phenotypic diversity, important economic traits and the evolution of livestock species. In this study, the genome-wide CNV distribution characteristics of 32 fine-wool sheep from three breeds were analyzed using resequencing.Results A total of 1747604 CNVs were detected in this study, and 7228 CNV regions (CNVR) were obtained after merging overlapping CNVs; these regions accounted for 2.17% of the sheep reference genome. The average length of the CNVRs was 4307.17 bp. “Deletion” events took place more frequently than “duplication” or “both” events. The CNVRs obtained overlapped with previously reported sheep CNVRs to variable extents (4.39%–55.46%). Functional enrichment analysis showed that the CNVR-harboring genes were mainly involved in sensory perception systems, nutrient metabolism processes, and growth and development processes. Furthermore, 1855 of the CNVRs were associated with 166 quantitative trait loci (QTL), including milk QTLs, carcass QTLs, and health-related QTLs, among others. In addition, the 32 fine-wool sheep were divided into horned and polled groups to analyze for the selective elimination of CNVRs, and it was found that the RXFP2 gene was strongly influenced by selection.Conclusions In summary, we constructed a genomic CNV map for Chinese indigenous fine-wool sheep using resequencing, thereby providing a valuable genetic variation resource for sheep genome research, which will contribute to the study of complex traits in sheep.


2020 ◽  
Author(s):  
Chao Yuan ◽  
Zengkui Lu ◽  
Tingting Guo ◽  
Yaojing Yue ◽  
Xijun Wang ◽  
...  

Abstract BackgroundCopy number variation (CNV) is an important source of genetic variation that has a significant influence on phenotypic diversity, important economic traits and the evolution of livestock species. In this study, the genome-wide CNV distribution characteristics of 32 fine-wool sheep from three breeds were analyzed using resequencing.ResultsA total of 1,747,604 CNVs were detected in this study, and 7,228 CNV regions (CNVR) were obtained after merging overlapping CNVs; these regions accounted for 2.17% of the sheep reference genome. The average length of the CNVRs was 4,307.17 bp. “Deletion” events took place more frequently than “duplication” or “both” events. The CNVRs obtained overlapped with previously reported sheep CNVRs to variable extents (4.39%–55.46%). Functional enrichment analysis showed that the CNVR-harboring genes were mainly involved in sensory perception systems, nutrient metabolism processes, and growth and development processes. Furthermore, 1,855 of the CNVRs were associated with 166 quantitative trait loci (QTL), including milk QTLs, carcass QTLs, and health-related QTLs, among others. In addition, the 32 fine-wool sheep were divided into horned and polled groups to analyze for the selective sweep of CNVRs, and it was found that the relaxin family peptide receptor 2 (RXFP2) gene was strongly influenced by selection.ConclusionsIn summary, we constructed a genomic CNV map for Chinese indigenous fine-wool sheep using resequencing, thereby providing a valuable genetic variation resource for sheep genome research, which will contribute to the study of complex traits in sheep.


2018 ◽  
Author(s):  
Jicai Jiang ◽  
John B. Cole ◽  
Yang Da ◽  
Paul M. VanRaden ◽  
Li Ma

AbstractImputation has been routinely used to infer sequence variants in large genotyped populations based on reference populations of sequenced individuals. With increasing numbers of animals sequenced and the implementation of the 1000 Bull Genomes Project, fine-mapping of causal variants for complex traits is becoming possible in cattle. Using 404 ancestor bull sequences as reference, we imputed over 3 million selected sequence variants to 27,214 Holstein bulls with highly reliable phenotypes (breeding values) for 35 production, reproduction, and body conformation traits. We first performed whole-genome single-marker scans for each of the 35 traits using a mixed-model association test. The single-trait association statistics were then merged into multi-trait tests of 3 trait groups: production, reproduction, and body conformation. Both single- and multi-trait GWAS results were used to identify 282 candidate QTL regions for fine-mapping in the cattle genome. To facilitate fast and powerful fine-mapping analyses, we developed a Bayesian Fine-MAPping approach (BFMAP) to integrate fine-mapping with functional enrichment analysis. Our fine-mapping results identified 69 promising candidate genes for dairy traits, including ABCC9, VPS13B, MGST1, SCD, MKL1, and CSN1S1 for production traits; CHEK2, GC, and KALRN for reproduction traits; and TMTC2, ARRDC3, ZNF613, CCND2, and FGF6 for body conformation traits. Based on existing functional annotation data for cattle, we revealed biologically meaningful enrichment in our fine-mapped variants that can be readily tested in functional validation studies. In summary, these results demonstrated the utility of a fast Bayesian approach for fine-mapping and functional enrichment analysis, identified candidate causative genes and variants, and enhanced our understanding of the genetic basis of complex traits in dairy cattle.


2019 ◽  
Vol 14 (7) ◽  
pp. 591-601 ◽  
Author(s):  
Aravind K. Konda ◽  
Parasappa R. Sabale ◽  
Khela R. Soren ◽  
Shanmugavadivel P. Subramaniam ◽  
Pallavi Singh ◽  
...  

Background: Chickpea is a nutritional rich premier pulse crop but its production encounters setbacks due to various stresses and understanding of molecular mechanisms can be ascribed foremost importance. Objective: The investigation was carried out to identify the differentially expressed WRKY TFs in chickpea in response to herbicide stress and decipher their interacting partners. Methods: For this purpose, transcriptome wide identification of WRKY TFs in chickpea was done. Behavior of the differentially expressed TFs was compared between other stress conditions. Orthology based cofunctional gene networks were derived from Arabidopsis. Gene ontology and functional enrichment analysis was performed using Blast2GO and STRING software. Gene Coexpression Network (GCN) was constructed in chickpea using publicly available transcriptome data. Expression pattern of the identified gene network was studied in chickpea-Fusarium interactions. Results: A unique WRKY TF (Ca_08086) was found to be significantly (q value = 0.02) upregulated not only under herbicide stress but also in other stresses. Co-functional network of 14 genes, namely Ca_08086, Ca_19657, Ca_01317, Ca_20172, Ca_12226, Ca_15326, Ca_04218, Ca_07256, Ca_14620, Ca_12474, Ca_11595, Ca_15291, Ca_11762 and Ca_03543 were identified. GCN revealed 95 hub genes based on the significant probability scores. Functional annotation indicated role in callose deposition and response to chitin. Interestingly, contrasting expression pattern of the 14 network genes was observed in wilt resistant and susceptible chickpea genotypes, infected with Fusarium. Conclusion: This is the first report of identification of a multi-stress responsive WRKY TF and its associated GCN in chickpea.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Zhenyang Liao ◽  
Xunxiao Zhang ◽  
Shengcheng Zhang ◽  
Zhicong Lin ◽  
Xingtan Zhang ◽  
...  

Abstract Background Structural variations (SVs) are a type of mutations that have not been widely detected in plant genomes and studies in animals have shown their role in the process of domestication. An in-depth study of SVs will help us to further understand the impact of SVs on the phenotype and environmental adaptability during papaya domestication and provide genomic resources for the development of molecular markers. Results We detected a total of 8083 SVs, including 5260 deletions, 552 tandem duplications and 2271 insertions with deletion being the predominant, indicating the universality of deletion in the evolution of papaya genome. The distribution of these SVs is non-random in each chromosome. A total of 1794 genes overlaps with SV, of which 1350 genes are expressed in at least one tissue. The weighted correlation network analysis (WGCNA) of these expressed genes reveals co-expression relationship between SVs-genes and different tissues, and functional enrichment analysis shows their role in biological growth and environmental responses. We also identified some domesticated SVs genes related to environmental adaptability, sexual reproduction, and important agronomic traits during the domestication of papaya. Analysis of artificially selected copy number variant genes (CNV-genes) also revealed genes associated with plant growth and environmental stress. Conclusions SVs played an indispensable role in the process of papaya domestication, especially in the reproduction traits of hermaphrodite plants. The detection of genome-wide SVs and CNV-genes between cultivated gynodioecious populations and wild dioecious populations provides a reference for further understanding of the evolution process from male to hermaphrodite in papaya.


Open Medicine ◽  
2020 ◽  
Vol 15 (1) ◽  
pp. 672-688
Author(s):  
Yanbo Dong ◽  
Siyu Lu ◽  
Zhenxiao Wang ◽  
Liangfa Liu

AbstractThe chaperonin-containing T-complex protein 1 (CCT) subunits participate in diverse diseases. However, little is known about their expression and prognostic values in human head and neck squamous cancer (HNSC). This article aims to evaluate the effects of CCT subunits regarding their prognostic values for HNSC. We mined the transcriptional and survival data of CCTs in HNSC patients from online databases. A protein–protein interaction network was constructed and a functional enrichment analysis of target genes was performed. We observed that the mRNA expression levels of CCT1/2/3/4/5/6/7/8 were higher in HNSC tissues than in normal tissues. Survival analysis revealed that the high mRNA transcriptional levels of CCT3/4/5/6/7/8 were associated with a low overall survival. The expression levels of CCT4/7 were correlated with advanced tumor stage. And the overexpression of CCT4 was associated with higher N stage of patients. Validation of CCTs’ differential expression and prognostic values was achieved by the Human Protein Atlas and GEO datasets. Mechanistic exploration of CCT subunits by the functional enrichment analysis suggests that these genes may influence the HNSC prognosis by regulating PI3K-Akt and other pathways. This study implies that CCT3/4/6/7/8 are promising biomarkers for the prognosis of HNSC.


2021 ◽  
Vol 28 (1) ◽  
pp. 20-33
Author(s):  
Lydia-Eirini Giannakou ◽  
Athanasios-Stefanos Giannopoulos ◽  
Chrissi Hatzoglou ◽  
Konstantinos I. Gourgoulianis ◽  
Erasmia Rouka ◽  
...  

Haemophilus influenzae (Hi), Moraxella catarrhalis (MorCa) and Pseudomonas aeruginosa (Psa) are three of the most common gram-negative bacteria responsible for human respiratory diseases. In this study, we aimed to identify, using the functional enrichment analysis (FEA), the human gene interaction network with the aforementioned bacteria in order to elucidate the full spectrum of induced pathogenicity. The Human Pathogen Interaction Database (HPIDB 3.0) was used to identify the human proteins that interact with the three pathogens. FEA was performed via the ToppFun tool of the ToppGene Suite and the GeneCodis database so as to identify enriched gene ontologies (GO) of biological processes (BP), cellular components (CC) and diseases. In total, 11 human proteins were found to interact with the bacterial pathogens. FEA of BP GOs revealed associations with mitochondrial membrane permeability relative to apoptotic pathways. FEA of CC GOs revealed associations with focal adhesion, cell junctions and exosomes. The most significantly enriched annotations in diseases and pathways were lung adenocarcinoma and cell cycle, respectively. Our results suggest that the Hi, MorCa and Psa pathogens could be related to the pathogenesis and/or progression of lung adenocarcinoma via the targeting of the epithelial cellular junctions and the subsequent deregulation of the cell adhesion and apoptotic pathways. These hypotheses should be experimentally validated.


AMB Express ◽  
2020 ◽  
Vol 10 (1) ◽  
Author(s):  
Zhiyong Liu ◽  
Kai Dang ◽  
Cunzhi Li ◽  
Junhong Gao ◽  
Hong Wang ◽  
...  

Abstract Hexanitrohexaazaisowurtzitane (CL-20) is a compound with a polycyclic cage and an N-nitro group that has been shown to play an unfavorable role in environmental fate, biosafety, and physical health. The aim of this study was to isolate the microbial community and to identify a single microbial strain that can degrade CL-20 with desirable efficiency. Metagenomic sequencing methods were performed to investigate the dynamic changes in the composition of the community diversity. The most varied genus among the microbial community was Pseudomonas, which increased from 1.46% to 44.63% during the period of incubation (MC0–MC4). Furthermore, the new strain was isolated and identified from the activated sludge by bacterial morphological and 16s rRNA sequencing analyses. The CL-20 concentrations decreased by 75.21 μg/mL and 74.02 μg/mL in 48 h by MC4 and Pseudomonas sp. ZyL-01, respectively. Moreover, ZyL-01 could decompose 98% CL-20 of the real effluent in 14 day’s incubation with the glucose as carbon source. Finally, a draft genome sequence was obtained to predict possible degrading enzymes involved in the biodegradation of CL-20. Specifically, 330 genes that are involved in energy production and conversion were annotated by Gene Ontology functional enrichment analysis, and some of these candidates may encode enzymes that are responsible for CL-20 degradation. In summary, our studies indicate that microbes might be a valuable biological resource for the treatment of environmental contamination caused by CL-20 and that Pseudomonas sp. ZyL-01 might be a promising candidate for eradicating CL-20 to achieve a more biosafe environment and improve public health.


Sign in / Sign up

Export Citation Format

Share Document