scholarly journals Development and comparison of RNA-sequencing pipelines for more accurate SNP identification: practical example of functional SNP detection associated with feed efficiency in Nellore beef cattle

BMC Genomics ◽  
2020 ◽  
Vol 21 (1) ◽  
Author(s):  
S. Lam ◽  
J. Zeidan ◽  
F. Miglior ◽  
A. Suárez-Vega ◽  
I. Gómez-Redondo ◽  
...  

Abstract Background Optimization of an RNA-Sequencing (RNA-Seq) pipeline is critical to maximize power and accuracy to identify genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study used RNA-Seq data (GEO Accession ID: PRJEB7696 and PRJEB15314) from muscle and liver tissue, respectively, from 12 Nellore beef steers selected from 585 steers with residual feed intake measures (RFI; n = 6 low-RFI, n = 6 high-RFI). Three RNA-Seq pipelines were compared including multi-sample calling from i) non-merged samples; ii) merged samples by RFI group, iii) merged samples by RFI and tissue group. The RNA-Seq reads were aligned against the UMD3.1 bovine reference genome (release 94) assembly using STAR aligner. Variants were called using BCFtools and variant effect prediction (VeP) and functional annotation (ToppGene) analyses were performed. Results On average, total reads detected for Approach i) non-merged samples for liver and muscle, were 18,362,086.3 and 35,645,898.7, respectively. For Approach ii), merging samples by RFI group, total reads detected for each merged group was 162,030,705, and for Approach iii), merging samples by RFI group and tissues, was 324,061,410, revealing the highest read depth for Approach iii). Additionally, Approach iii) merging samples by RFI group and tissues, revealed the highest read depth per variant coverage (572.59 ± 3993.11) and encompassed the majority of localized positional genes detected by each approach. This suggests Approach iii) had optimized detection power, read depth, and accuracy of SNP calling, therefore increasing confidence of variant detection and reducing false positive detection. Approach iii) was then used to detect unique SNPs fixed within low- (12,145) and high-RFI (14,663) groups. Functional annotation of SNPs revealed positional candidate genes, for each RFI group (2886 for low-RFI, 3075 for high-RFI), which were significantly (P < 0.05) associated with immune and metabolic pathways. Conclusion The most optimized RNA-Seq pipeline allowed for more accurate identification of SNPs, associated positional candidate genes, and significantly associated metabolic pathways in muscle and liver tissues, providing insight on the underlying genetic architecture of feed efficiency in beef cattle.

2020 ◽  
Author(s):  
S. Lam ◽  
J. Zeidan ◽  
F. Miglior ◽  
A. Suárez-Vega ◽  
I. Gómez-Redondo ◽  
...  

Abstract Optimization of an RNA-Sequencing (RNA-Seq) pipeline can maximize power and accuracy for identifying genetic variants, including SNPs, which may serve as genetic markers to select for feed efficiency, leading to economic benefits for beef production. This study determined an optimized pipeline for variant detection using a dataset with multiple samples and tissues. The RNA-Seq data (GEO Accession ID: PRJEB7696 and PRJEB15314) from muscle and liver tissue, respectively, from 12 Nellore beef steers selected from 585 steers with residual feed intake measures (RFI; n=6 low-RFI, n=6 high-RFI) were used. Three RNA-Seq pipelines were compared including multi-sample calling from i) non-merged samples; ii) merged samples by group for low-RFI and for high-RFI for each tissue, iii) merged samples by group and tissue for low- and high-RFI for both tissues. The RNA-Seq reads were aligned against the UMD3.1 bovine reference genome (release 94) assembly using STAR. Variants were called using BCFtools and variant effect prediction (VeP) and functional annotation (ToppGene) analyses were performed. Approaches were compared by comparing read depth, overlap of SNP detection results, and following SNP annotation for positional candidate genes. On average, total reads detected for Approach i) individual liver and muscle samples were 18,362,086.3 and 35,645,898.7, respectively. For Approach ii), total reads detected for each merged group of samples was 162,030,705, and for Approach iii) was 324,061,410, revealing the highest read depth. Additionally, Approach iii) encompassed the majority of localized positional genes detected by each approach, suggesting Approach iii) be applied to maximize detection power, read depth, and accuracy of SNP calling, therefore increasing confidence of variant detection and reducing false positive rate. Approach iii) was used to detect unique SNPs fixed within low- (12,145) and high-RFI (14,663) groups. Annotation of moderate to high functional impact SNPs revealed co-localized positional candidate genes for each RFI group (2,886 for low-RFI, 3,075 for high-RFI), which were significantly (P<0.05) associated with immune and metabolism pathways. The most optimized RNA-Seq pipeline allowed for more accurate identification of SNP, associated positional candidate genes, and associated metabolic pathways in muscle and liver tissues, providing insight on the genetic architecture of feed efficiency in beef cattle.


2020 ◽  
Vol 10 (1) ◽  
Author(s):  
H. Sweett ◽  
P. A. S. Fonseca ◽  
A. Suárez-Vega ◽  
A. Livernois ◽  
F. Miglior ◽  
...  

AbstractFertility plays a key role in the success of calf production, but there is evidence that reproductive efficiency in beef cattle has decreased during the past half-century worldwide. Therefore, identifying animals with superior fertility could significantly impact cow-calf production efficiency. The objective of this research was to identify candidate regions affecting bull fertility in beef cattle and positional candidate genes annotated within these regions. A GWAS using a weighted single-step genomic BLUP approach was performed on 265 crossbred beef bulls to identify markers associated with scrotal circumference (SC) and sperm motility (SM). Eight windows containing 32 positional candidate genes and five windows containing 28 positional candidate genes explained more than 1% of the genetic variance for SC and SM, respectively. These windows were selected to perform gene annotation, QTL enrichment, and functional analyses. Functional candidate gene prioritization analysis revealed 14 prioritized candidate genes for SC of which MAP3K1 and VIP were previously found to play roles in male fertility. A different set of 14 prioritized genes were identified for SM and five were previously identified as regulators of male fertility (SOD2, TCP1, PACRG, SPEF2, PRLR). Significant enrichment results were identified for fertility and body conformation QTLs within the candidate windows. Gene ontology enrichment analysis including biological processes, molecular functions, and cellular components revealed significant GO terms associated with male fertility. The identification of these regions contributes to a better understanding of fertility associated traits and facilitates the discovery of positional candidate genes for future investigation of causal mutations and their implications.


2021 ◽  
Vol 12 ◽  
Author(s):  
Chaoyun Yang ◽  
Liyun Han ◽  
Peng Li ◽  
Yanling Ding ◽  
Yun Zhu ◽  
...  

Residual feed intake (RFI) is an important measure of feed efficiency for agricultural animals. Factors associated with cattle RFI include physiology, dietary factors, and the environment. However, a precise genetic mechanism underlying cattle RFI variations in duodenal tissue is currently unavailable. The present study aimed to identify the key genes and functional pathways contributing to variance in cattle RFI phenotypes using RNA sequencing (RNA-seq). Six bulls with extremely high or low RFIs were selected for detecting differentially expressed genes (DEGs) by RNA-seq, followed by conducting GO, KEGG enrichment, protein-protein interaction (PPI), and co-expression network (WGCNA, n = 10) analysis. A total of 380 differentially expressed genes was obtained from high and low RFI groups, including genes related to energy metabolism (ALDOA, HADHB, INPPL1), mitochondrial function (NDUFS1, RFN4, CUL1), and feed intake behavior (CCK). Two key sub-networks and 26 key genes were detected using GO analysis of DEGs and PPI analysis, such as TPM1 and TPM2, which are involved in mitochondrial pathways and protein synthesis. Through WGCNA, a gene network was built, and genes were sorted into 27 modules, among which the blue (r = 0.72, p = 0.03) and salmon modules (r = −0.87, p = 0.002) were most closely related with RFI. DEGs and genes from the main sub-networks and closely related modules were largely involved in metabolism; oxidative phosphorylation; glucagon, ribosome, and N-glycan biosynthesis, and the MAPK and PI3K-Akt signaling pathways. Through WGCNA, five key genes, including FN1 and TPM2, associated with the biological regulation of oxidative processes and skeletal muscle development were identified. Taken together, our data suggest that the duodenum has specific biological functions in regulating feed intake. Our findings provide broad-scale perspectives for identifying potential pathways and key genes involved in the regulation of feed efficiency in beef cattle.


2017 ◽  
Vol 83 (9) ◽  
Author(s):  
Fuyong Li ◽  
Le Luo Guan

ABSTRACT Exploring compositional and functional characteristics of the rumen microbiome can improve the understanding of its role in rumen function and cattle feed efficiency. In this study, we applied metatranscriptomics to characterize the active rumen microbiomes of beef cattle with different feed efficiencies (efficient, n = 10; inefficient, n = 10) using total RNA sequencing. Active bacterial and archaeal compositions were estimated based on 16S rRNAs, and active microbial metabolic functions including carbohydrate-active enzymes (CAZymes) were assessed based on mRNAs from the same metatranscriptomic data sets. In total, six bacterial phyla (Proteobacteria, Firmicutes, Bacteroidetes, Spirochaetes, Cyanobacteria, and Synergistetes), eight bacterial families (Succinivibrionaceae, Prevotellaceae, Ruminococcaceae, Lachnospiraceae, Veillonellaceae, Spirochaetaceae, Dethiosulfovibrionaceae, and Mogibacteriaceae), four archaeal clades (Methanomassiliicoccales, Methanobrevibacter ruminantium, Methanobrevibacter gottschalkii, and Methanosphaera), 112 metabolic pathways, and 126 CAZymes were identified as core components of the active rumen microbiome. As determined by comparative analysis, three bacterial families (Lachnospiraceae, Lactobacillaceae, and Veillonellaceae) tended to be more abundant in low-feed-efficiency (inefficient) animals (P < 0.10), and one archaeal taxon (Methanomassiliicoccales) tended to be more abundant in high-feed-efficiency (efficient) cattle (P < 0.10). Meanwhile, 32 microbial metabolic pathways and 12 CAZymes were differentially abundant (linear discriminant analysis score of >2 with a P value of <0.05) between two groups. Among them, 30 metabolic pathways and 11 CAZymes were more abundant in the rumen of inefficient cattle, while 2 metabolic pathways and 1 CAZyme were more abundant in efficient animals. These findings suggest that the rumen microbiomes of inefficient cattle have more diverse activities than those of efficient cattle, which may be related to the host feed efficiency variation. IMPORTANCE This study applied total RNA-based metatranscriptomics and showed the linkage between the active rumen microbiome and feed efficiency (residual feed intake) in beef cattle. The data generated from the current study provide fundamental information on active rumen microbiome at both compositional and functional levels, which serve as a foundation to study rumen function and its role in cattle feed efficiency. The findings that the active rumen microbiome may contribute to variations in feed efficiency of beef cattle highlight the possibility of enhancing nutrient utilization and improve cattle feed efficiency through modification of rumen microbial functions.


2009 ◽  
Vol 87 (8) ◽  
pp. 2475-2484 ◽  
Author(s):  
E. Marques ◽  
J. D. Nkrumah ◽  
E. L. Sherman ◽  
S. S. Moore

2019 ◽  
Vol 97 (Supplement_3) ◽  
pp. 168-169
Author(s):  
Peter Carmona ◽  
Luis Silva ◽  
Diogo Fleury Azevedo Costa ◽  
Lais Lima

Abstract Nutrition for a positive growth path represents the major cost of any beef cattle enterprise. Improvements on feed efficiency (FE) can lead to significant economic benefits and reduce the environmental footprint. Usually, animals selected for FE on high-protein (HP) diets are expected to perform as efficiently on low-protein (LP) diets. This experiment used 30 Bos indicus steers (398 ± 24 kg BW) to determine the agreement between FE rankings of beef cattle fed a LP or a HP diet. As hypothesis, it was suggested that the agreement would be high. A completely randomized block design was used, where each steer represented an experimental unit. Steers were fed in individual pens for two periods of 70 days, including an adaptation of 10 days, with diets supplying either 70% or 100% of their rumen degradable protein requirements. Average daily gain (ADG) and dry matter intake (DMI) were measured, while residual feed intake (RFI) and residual gain (RG) were calculated. Kappa analysis was used to determine the agreement between FE of both diets. In the LP diet, ADG was 0.93 kg/d (0.38 to 1.47), DMI averaged 9.67 kg/d (7.9 to 12.1), RFI varied between -1.55 and 1.84, and RG from -0.61 to 0.53. In the HP diet, ADG was 1.16 kg/d (0.77 to 1.57) and DMI averaged 9.87 kg/d (4.79 to 11.87). RFI varied between -2.53 and 1.61 and RG from -0.34 to 0.33. Chance-corrected analysis of the ranking between diets showed no agreement for RFI (Kappa=5.6%, P = 0.68) nor for RG (Kappa=9.1%, P = 0.44). These results suggest that different physiological mechanisms are responsible for FE regulation in both diets; thus, appropriate diets targeting each scenario must be used when selecting animals for feed efficiency.


2013 ◽  
Vol 93 (3) ◽  
pp. 295-306 ◽  
Author(s):  
B. K. Karisa ◽  
J. Thomson ◽  
Z. Wang ◽  
H. L. Bruce ◽  
G. S. Plastow ◽  
...  

Karisa, B. K., Thomson, J., Wang, Z., Bruce, H. L., Plastow, G. S. and Moore, S. S. 2013. Candidate genes and biological pathways associated with carcass quality traits in beef cattle. Can. J. Anim. Sci. 93: 295–306. The objective of this study was to use the candidate gene approach to identify the genes associated with carcass quality traits in beef cattle steers at the University of Alberta Ranch at Kinsella, Canada. This approach involved identifying positional candidate genes and prioritizing them according to their functions into functional candidate genes before performing statistical association analysis. The positional candidate genes and single nucleotide polymorphisms (SNP) were identified from previously reported quantitative trait loci for component traits including body weight, average daily gain, metabolic weight, feed efficiency and energy balance. Positional candidate genes were then prioritized into functional candidate genes according to the associated gene ontology terms and their functions. A total of 116 genes were considered functional candidate genes and 117 functional SNPs were genotyped and used for multiple marker association analysis using ASReml®. Seven SNPs were significantly associated with various carcass quality traits (P≤0.005). The significant genes were associated with biological processes such as fat, glucose, protein and steroid metabolism, growth, energy utilization and DNA transcription and translation as inferred from the protein knowledgebase (UniprotKB). Gene network analysis indicated significant involvement of biological processes related to fat and steroid metabolism and regulation of transcription and translation of DNA.


2021 ◽  
Author(s):  
Lili Du ◽  
Tianpeng Chang ◽  
Bingxing An ◽  
Mang Liang ◽  
Xinghai Duan ◽  
...  

Abstract Water holding capacity (WHC) is an important sensory attribute that greatly influences meat quality. However, the molecular mechanism that regulates the beef WHC remains to be elucidated. In this study, the longissimus dorsi (LD) muscles of 49 Chinese Simmental beef cattle were subjected to RNA sequencing (RNA-seq), among which eight individuals with the highest WHC (H-WHC) and the lowest WHC (L-WHC) were selected for transcriptome analysis. A total of 1256 genes were identified as differentially expressed genes (DEGs) between two groups, of which 948 genes were up-regulated and 308 genes were down-regulated. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment revealed that DEGs were significantly enriched in 24 GO terms and 78 pathways. Additionally, based on protein-protein interaction (PPI) network, animal QTL database (QTLdb), and relevant literature, the study not only confirmed seven genes (HSPA12A, HSPA13, PPARg, MYH10, MYL2, MYPN, and TPI1) influenced WHC in accordance with previous studies, but also identified six genes (ITGAV, FGF2, THBS1, DCN, COL4A1, and TGFBR1) as the most promising novel candidate genes affecting the WHC. These findings could offer important insight for exploring the molecular mechanism underlying the WHC trait and facilitate the improvement of beef quality.


Sign in / Sign up

Export Citation Format

Share Document