Abstract
Background: Copy number variations (CNVs) are important structural variations that can cause significant phenotypic diversity. Reliable CNVs mapping can be achieved by identification of CNVs from different genetic backgrounds. Investigations on the characteristics of overlapping CNV regions (CNVRs) between protein-coding genes (CNV genes) and miRNAs (CNV-miRNAs) can reveal the potential mechanisms of their regulation. Results: In this study, we used 55K SNP arrays to detect CNVs in Duroc purebred pig. A total number of 211 CNVRs were detected with a total length of 118.48 Mb, accounting for 5.23% of the autosomal genome sequence. Of these CNVRs, 32 were gains, 175 losses, and 4 contained both types (loss and gain within the same region). The CNVRs we detected were non-randomly distributed in the swine genome and were significantly enriched in the segmental duplication and gene density region. Additionally, these CNVRs were overlapping with 1096 protein-coding genes, and 39 miRNAs, respectively. The CNV genes were enriched in terms of dosage-sensitive gene list. The expression of the CNV genes was significantly higher than that of the non-CNV genes in the adult Duroc liver and prostate. Of all detected CNV genes, 252 genes, which accounted for 22.99%, were tissue-specific (TSI > 0.9). Strong negative selection had been underway in the CNV genes as the ones that were located entirely within the loss CNVRs appeared to be evolving rapidly as determined by the median dN plus dS values. Non-CNV genes tended to be miRNA target than CNV genes. Furthermore, CNV-miRNAs tended to target more genes compared to non-CNV-miRNAs, and a combination of two CNV-miRNAs preferentially synergistically regulated the same target genes. We also focused our efforts on examining CNV genes and CNV-miRNAs, which were also involved in the lipid metabolism, including DGAT1, DGAT2, MOGAT2, miR143, miR335, and miRLET7. Conclusions: Our analyses of CNV-genes and CNV-miRNAs provide new insights into the characteristics of CNVRs in Duroc purebred population. Further molecular experiments and independent large studies are needed to confirm our findings.