relationship matrix Latest Research Papers

Genetic and genomic characterization followed by single-step genomic evaluation of withers height in German Warmblood horses

Journal of Applied Genetics ◽

10.1007/s13353-021-00681-w ◽

2022 ◽

Author(s):

Sarah Vosgerau ◽

Nina Krattenmacher ◽

Clemens Falker-Gieske ◽

Anita Seidel ◽

Jens Tetens ◽

...

Keyword(s):

Genome Wide Association Study ◽

Reference Population ◽

Single Step ◽

Relationship Matrix ◽

Nucleotide Polymorphisms ◽

Step Method ◽

Genomic Breeding ◽

Breeding Values ◽

A Genome ◽

Estimation System

Abstract Reliability of genomic predictions is influenced by the size and genetic composition of the reference population. For German Warmblood horses, compilation of a reference population has been enabled through the cooperation of five German breeding associations. In this study, preliminary data from this joint reference population were used to genetically and genomically characterize withers height and to apply single-step methodology for estimating genomic breeding values for withers height. Using data on 2113 mares and their genomic information considering about 62,000 single nucleotide polymorphisms (SNPs), analysis of the genomic relationship revealed substructures reflecting breed origin and different breeding goals of the contributing breeding associations. A genome-wide association study confirmed a known quantitative trait locus (QTL) for withers height on equine chromosome (ECA) 3 close to LCORL and identified a further significant peak on ECA 1. Using a single-step approach with a combined relationship matrix, the estimated heritability for withers height was 0.31 (SE = 0.08) and the corresponding genomic breeding values ranged from − 2.94 to 2.96 cm. A mean reliability of 0.38 was realized for these breeding values. The analyses of withers height showed that compiling a reference population across breeds is a suitable strategy for German Warmblood horses. The single-step method is an appealing approach for practical genomic prediction in horses, because not many genotypes are available yet and animals without genotypes can by this way directly contribute to the estimation system.

The genomic structure of isolation across breed, country and strain for important South African and Australian sheep populations

BMC Genomics ◽

10.1186/s12864-021-08020-3 ◽

2022 ◽

Vol 23 (1) ◽

Author(s):

Cornelius Nel ◽

Phillip Gurman ◽

Andrew Swan ◽

Julius van der Werf ◽

Margaretha Snyman ◽

...

Keyword(s):

South Africa ◽

South African ◽

Genomic Structure ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Effective Population ◽

Absolute Level ◽

Genomic Resources ◽

Sheep Breeds ◽

Population Structures

Abstract Background South Africa and Australia shares multiple important sheep breeds. For some of these breeds, genomic breeding values are provided to breeders in Australia, but not yet in South Africa. Combining genomic resources could facilitate development for across country selection, but the influence of population structures could be important to the compatability of genomic data from varying origins. The genetic structure within and across breeds, countries and strains was evaluated in this study by population genomic parameters derived from SNP-marker data. Populations were first analysed by breed and country of origin and then by subpopulations of South African and Australian Merinos. Results Mean estimated relatedness according to the genomic relationship matrix varied by breed (-0.11 to 0.16) and bloodline (-0.08 to 0.06) groups and depended on co-ancestry as well as recent genetic links. Measures of divergence across bloodlines (FST: 0.04–0.12) were sometimes more distant than across some breeds (FST: 0.05–0.24), but the divergence of common breeds from their across-country equivalents was weak (FST: 0.01–0.04). According to mean relatedness, FST, PCA and Admixture, the Australian Ultrafine line was better connected to the SA Cradock Fine Wool flock than with other AUS bloodlines. Levels of linkage disequilibrium (LD) between adjacent markers was generally low, but also varied across breeds (r2: 0.14–0.22) as well as bloodlines (r2: 0.15–0.19). Patterns of LD decay was also unique to breeds, but bloodlines differed only at the absolute level. Estimates of effective population size (Ne) showed genetic diversity to be high for the majority of breeds (Ne: 128–418) but also for bloodlines (Ne: 137–369). Conclusions This study reinforced the genetic complexity and diversity of important sheep breeds, especially the Merino breed. The results also showed that implications of isolation can be highly variable and extended beyond breed structures. However, knowledge of useful links across these population substructures allows for a fine-tuned approach in the combination of genomic resources. Isolation across country rarely proved restricting compared to other structures considered. Consequently, research into the accuracy of across-country genomic prediction is recommended.

Preprocessing Tools for Data Preparation

Multivariate Statistical Machine Learning Methods for Genomic Prediction ◽

10.1007/978-3-030-89010-0_2 ◽

2022 ◽

pp. 35-70

Author(s):

Osval Antonio Montesinos López ◽

Abelardo Montesinos López ◽

Jose Crossa

Keyword(s):

Machine Learning ◽

Mixed Model ◽

Linear Mixed Model ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Data Preparation ◽

Statistical Machine Learning ◽

Major Allele Frequency ◽

Major Allele ◽

Best Linear Unbiased

AbstractThis data preparation chapter is of paramount importance for implementing statistical machine learning methods for genomic selection. We present the basic linear mixed model that gives rise to BLUE and BLUP and explain how to decide when to use fixed or random effects that give rise to best linear unbiased estimates (BLUE or BLUEs) and best linear unbiased predictors (BLUP or BLUPs). The R codes for fitting linear mixed model for the data are given in small examples. We emphasize tools for computing BLUEs and BLUPs for many linear combinations of interest in genomic-enabled prediction and plant breeding. We present tools for cleaning, imputing, and detecting minor and major allele frequency computation, marker recodification, frequency of heterogeneous, frequency of NAs, and three methods for computing the genomic relationship matrix. In addition, scaling and data compression of inputs are important in statistical machine learning. For a more extensive description of linear mixed models, see Chap. 10.1007/978-3-030-89010-0_5.

Interest of phenomic prediction as an alternative to genomic prediction in grapevine

10.1101/2021.12.16.472608 ◽

2021 ◽

Author(s):

Charlotte Brault ◽

Juliette Lazerges ◽

Agnès Doligez ◽

Miguel Thomas ◽

Martin Ecarnot ◽

...

Keyword(s):

Genomic Prediction ◽

Reflectance Spectrum ◽

Predictive Ability ◽

Relationship Matrix ◽

Diversity Panel ◽

Cereal Species ◽

Genetic Signal ◽

First Time ◽

Half Diallel ◽

Two Populations

Phenomic prediction has been defined as an alternative to genomic prediction by using spectra instead of molecular markers. A reflectance spectrum reflects the biochemical composition within a tissue, under genetic determinism. Thus, a relationship matrix built from spectra could potentially capture genetic signal. This new methodology has been successfully applied in several cereal species but little is known so far about its interest in perennial species. Besides, phenomic prediction has only been tested for a restricted set of traits, mainly related to yield or phenology. This study aims at applying phenomic prediction for the first time in grapevine, using spectra collected on two tissues and over two consecutive years, on two populations and for 15 traits. First, we characterized the genetic signal in spectra and under which condition it could be maximized, then phenomic predictive ability was compared to genomic predictive ability. We found that the co-inertia between spectra and genomic data was stable across tissues or years, but variable across populations, with co-inertia around 0.3 and 0.6 for diversity panel and half-diallel populations, respectively. Differences between populations were also observed for predictive ability of phenomic prediction, with an average of 0.27 for the diversity panel and 0.35 for the half-diallel. For both populations, there was a correlation across traits between predictive ability of genomic and phenomic prediction, with a slope around 1 and an intercept of -0.2, thus suggesting that phenomic prediction could be applied for any trait.

Efficient approximation of reliabilities for single-step genomic BLUP models with the Algorithm for Proven and Young

Journal of Animal Science ◽

10.1093/jas/skab353 ◽

2021 ◽

Author(s):

M Bermann ◽

D Lourenco ◽

I Misztal

Keyword(s):

Mixed Model ◽

Low Cost ◽

Single Step ◽

Reliability Estimation ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Multiple Trait ◽

Left Hand ◽

Computational Performance ◽

Estimated Breeding Values

Abstract The objectives of this study were to develop an efficient algorithm for calculating prediction error variances (PEV) for GBLUP models using the Algorithm for Proven and Young (APY), extend it to single-step GBLUP (ssGBLUP), and to apply this algorithm for approximating the theoretical reliabilities for single and multiple trait models in ssGBLUP. The PEV with APY was calculated by block-sparse inversion, efficiently exploiting the sparse structure of the inverse of the genomic relationship matrix with APY. Single-step GBLUP reliabilities were approximated by combining reliabilities with and without genomic information in terms of effective record contributions. Multi-trait reliabilities relied on single-trait results adjusted using the genetic and residual covariance matrices among traits. Tests involved two datasets provided by the American Angus Association. A small dataset (Data1) was used for comparing the approximated reliabilities with the reliabilities obtained by the inversion of the left-hand side of the mixed model equations. The large dataset (Data2) was used for evaluating the computational performance of the algorithm. Analyses with both datasets used single-trait and three-trait models. The number of animals in the pedigree ranged from 167,951 in Data1 to 10,213,401 in Data2, with 50,000 and 20,000 genotyped animals for single-trait and multiple trait-analysis, respectively, in Data1 and 335,325 in Data2. Correlations between estimated and exact reliabilities obtained by inversion ranged from 0.97 to 0.99, whereas the intercept and slope of the regression of the exact on the approximated reliabilities ranged from 0.00 to 0.04 and from 0.93 to 1.05, respectively. For the three-trait model with the largest dataset (Data2), the elapsed time for the reliability estimation was eleven minutes. The computational complexity of the proposed algorithm increased linearly with the number of genotyped animals and with the number of traits in the model. This algorithm can efficiently approximate the theoretical reliability of genomic estimated breeding values in ssGBLUP with APY for large numbers of genotyped animals at a low cost.

Simulation of pedigree vs. fully-informative marker based relationships matrices in a loblolly pine breeding population

10.1101/2021.11.30.468863 ◽

2021 ◽

Author(s):

Adam R Festa ◽

Ross Whetten

Keyword(s):

Inbreeding Depression ◽

Loblolly Pine ◽

Genetic Gain ◽

Field Experiments ◽

Breeding Value ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Genomic Relationship ◽

Deleterious Alleles

Computer simulations of breeding strategies are an essential resource for tree breeders because they allow exploratory analyses into potential long-term impacts on genetic gain and inbreeding consequences without bearing the cost, time, or resource requirements of field experiments. Previous work has modeled the potential long-term implications on inbreeding and genetic gain using random mating and phenotypic selection. Reduction in sequencing costs has enabled the use of DNA marker-based relationship matrices in addition to or in place of pedigree-based allele sharing estimates; this has been shown to provide a significant increase in the accuracy of progeny breeding value prediction. A potential pitfall of genomic selection using genetic relationship matrices is increased coancestry among selections, leading to the accumulation of deleterious alleles and inbreeding depression. We used simulation to compare the relative genetic gain and risk of inbreeding depression within a breeding program similar to loblolly pine, utilizing pedigree-based or marker-based relationships over ten generations. We saw a faster rate of purging deleterious alleles when using a genomic relationship matrix based on markers that track identity-by-descent of segments of the genome. Additionally, we observed an increase in the rate of genetic gain when using a genomic relationship matrix instead of a pedigree-based relationship matrix. While the genetic variance of populations decreased more rapidly when using genomic-based relationship matrices as opposed to pedigree-based, there appeared to be no long-term consequences on the accumulation of deleterious alleles within the simulated breeding strategy.

Drug-Target Interaction Prediction Based on Multisource Information Weighted Fusion

Contrast Media & Molecular Imaging ◽

10.1155/2021/6044256 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Shuaiqi Liu ◽

Jingjie An ◽

Jie Zhao ◽

Shuhuan Zhao ◽

Hui Lv ◽

...

Keyword(s):

Drug Target ◽

Operating Characteristic ◽

State Of The Art ◽

Characteristic Curve ◽

Relationship Matrix ◽

Target Interaction ◽

Interaction Prediction ◽

Weighted Fusion ◽

Precision Recall Curve ◽

The Relationship

Recently, in most existing studies, it is assumed that there are no interaction relationships between drugs and targets with unknown interactions. However, unknown interactions mean the relationships between drugs and targets have just not been confirmed. In this paper, samples for which the relationship between drugs and targets has not been determined are considered unlabeled. A weighted fusion method of multisource information is proposed to screen drug-target interactions. Firstly, some drug-target pairs which may have interactions are selected. Secondly, the selected drug-target pairs are added to the positive samples, which are regarded as known to have interaction relationships, and the original interaction relationship matrix is revised. Finally, the revised datasets are used to predict the interaction derived from the bipartite local model with neighbor-based interaction profile inferring (BLM-NII). Experiments demonstrate that the proposed method has greatly improved specificity, sensitivity, precision, and accuracy compared with the BLM-NII method. In addition, compared with several state-of-the-art methods, the area under the receiver operating characteristic curve (AUC) and the area under the precision-recall curve (AUPR) of the proposed method are excellent.

An Epigenetic Aging Clock for Cattle Using Portable Sequencing Technology

Frontiers in Genetics ◽

10.3389/fgene.2021.760450 ◽

2021 ◽

Vol 12 ◽

Author(s):

Ben J. Hayes ◽

Loan T. Nguyen ◽

Mehrnush Forutan ◽

Bailey N. Engle ◽

Harrison J. Lamb ◽

...

Keyword(s):

Relationship Matrix ◽

Epigenetic Clock ◽

Absolute Deviation ◽

Linear Unbiased Prediction ◽

Methylation Of Dna ◽

Epigenetic Aging ◽

Oxford Nanopore ◽

Best Linear Unbiased ◽

Oxford Nanopore Technologies ◽

Tail Hair

Extensively grazed cattle are often mustered only once a year. Therefore, birthdates are typically unknown or inaccurate. Birthdates would be useful for deriving important traits (growth rate; calving interval), breed registrations, and making management decisions. Epigenetic clocks use methylation of DNA to predict an individual’s age. An epigenetic clock for cattle could provide a solution to the challenges of industry birthdate recording. Here we derived the first epigenetic clock for tropically adapted cattle using portable sequencing devices from tail hair, a tissue which is widely used in industry for genotyping. Cattle (n = 66) with ages ranging from 0.35 to 15.7 years were sequenced using Oxford Nanopore Technologies MinION and methylation was called at CpG sites across the genome. Sites were then filtered and used to calculate a covariance relationship matrix based on methylation state. Best linear unbiased prediction was used with 10-fold cross validation to predict age. A second methylation relationship matrix was also calculated that contained sites associated with genes used in the dog and human epigenetic clocks. The correlation between predicted age and actual age was 0.71 for all sites and 0.60 for dog and human gene epigenetic clock sites. The mean absolute deviation was 1.4 years for animals aged less than 3 years of age, and 1.5 years for animals aged 3–10 years. This is the first reported epigenetic clock using industry relevant samples in cattle.

Trends of genetic changes uncovered by Env- and Eigen-GWAS in wheat and barley

Theoretical and Applied Genetics ◽

10.1007/s00122-021-03991-z ◽

2021 ◽

Author(s):

Rajiv Sharma ◽

James Cockram ◽

Keith A. Gardner ◽

Joanne Russell ◽

Luke Ramsay ◽

...

Keyword(s):

Genome Structure ◽

Genomic Relationship Matrix ◽

Relationship Matrix ◽

Crop Breeding ◽

Phenotypic Data ◽

Genetic Changes ◽

Large Numbers ◽

New Varieties ◽

Environmental Genome ◽

Genomic Regions

Abstract Key message Variety age and population structure detect novel QTL for yield and adaptation in wheat and barley without the need to phenotype. Abstract The process of crop breeding over the last century has delivered new varieties with increased genetic gains, resulting in higher crop performance and yield. However, in many cases, the alleles and genomic regions underpinning this success remain unknown. This is partly due to the difficulty of generating sufficient phenotypic data on large numbers of historical varieties to enable such analyses. Here we demonstrate the ability to circumvent such bottlenecks by identifying genomic regions selected over 100 years of crop breeding using age of a variety as a surrogate for yield. Rather than collecting phenotype data, we deployed ‘environmental genome-wide association scans’ (EnvGWAS) based on variety age in two of the world’s most important crops, wheat and barley, and detected strong signals of selection across both genomes. EnvGWAS identified 16 genomic regions in barley and 10 in wheat with contrasting patterns between spring and winter types of the two crops. To further examine changes in genome structure, we used the genomic relationship matrix of the genotypic data to derive eigenvectors for analysis in EigenGWAS. This detected seven major chromosomal introgressions that contributed to adaptation in wheat. EigenGWAS and EnvGWAS based on variety age avoid costly phenotyping and facilitate the identification of genomic tracts that have been under selection during breeding. Our results demonstrate the potential of using historical cultivar collections coupled with genomic data to identify chromosomal regions under selection and may help guide future plant breeding strategies to maximise the rate of genetic gain and adaptation.

Fuzzy Multicriteria Decision-Making Analysis of Agricultural Product Logistics in Agricultural Economic Management

Mathematical Problems in Engineering ◽

10.1155/2021/1730594 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

Meizhi Tang ◽

Zhongzheng Wang

Keyword(s):

Decision Making ◽

Supply Chain ◽

Multicriteria Decision Making ◽

Green Supply Chain ◽

Agricultural Product ◽

Relationship Matrix ◽

Supplier Evaluation ◽

Green Logistics ◽

Efficient Operation ◽

Multicriteria Decision

The purpose is to establish a logistics supplier evaluation and selection model for green supply chain, which can effectively take into account the environmental problems, while the enterprise economy is developing. The logistics supplier evaluation index system is established under the background of green logistics based on fuzzy theory and multicriteria decision-making theory. The green competitiveness index is innovatively introduced into the system. Moreover, a fuzzy hybrid multicriteria decision-making method model is established based on the deficiency of the current supplier evaluation and selection methods, so as to select logistics suppliers; finally, the established logistics supplier evaluation and selection method is applied to real enterprises for example verification. The results are as follows. The direct relationship matrix is obtained according to the opinions of the expert group, and the relationship matrix is defuzzified and standardized to obtain the clear value of the secondary index. Finally, the fit between the evaluation results of the five logistics companies and the positive and negative ideal values is calculated. According to the principle of maximum fit, the evaluation priority of the five companies is 4 > 3 > 2 > 5 > 1. Thereby, taking the fuzzy multicriteria decision-making analysis method as the evaluation model has high reference value for the evaluation of green logistics companies, solves the problem of how production and sales enterprises choose green logistics suppliers, fully meets the needs of enterprises and governments that focus on the green degree of suppliers in the supply chain, and can effectively promote the efficient operation of green supply chain management.

relationship matrix
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Genetic and genomic characterization followed by single-step genomic evaluation of withers height in German Warmblood horses

The genomic structure of isolation across breed, country and strain for important South African and Australian sheep populations

Preprocessing Tools for Data Preparation

Interest of phenomic prediction as an alternative to genomic prediction in grapevine

Efficient approximation of reliabilities for single-step genomic BLUP models with the Algorithm for Proven and Young

Simulation of pedigree vs. fully-informative marker based relationships matrices in a loblolly pine breeding population

Drug-Target Interaction Prediction Based on Multisource Information Weighted Fusion

An Epigenetic Aging Clock for Cattle Using Portable Sequencing Technology

Trends of genetic changes uncovered by Env- and Eigen-GWAS in wheat and barley

Fuzzy Multicriteria Decision-Making Analysis of Agricultural Product Logistics in Agricultural Economic Management

Export Citation Format

relationship matrixRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Genetic and genomic characterization followed by single-step genomic evaluation of withers height in German Warmblood horses

The genomic structure of isolation across breed, country and strain for important South African and Australian sheep populations

Preprocessing Tools for Data Preparation

Interest of phenomic prediction as an alternative to genomic prediction in grapevine

Efficient approximation of reliabilities for single-step genomic BLUP models with the Algorithm for Proven and Young

Simulation of pedigree vs. fully-informative marker based relationships matrices in a loblolly pine breeding population

Drug-Target Interaction Prediction Based on Multisource Information Weighted Fusion

An Epigenetic Aging Clock for Cattle Using Portable Sequencing Technology

Trends of genetic changes uncovered by Env- and Eigen-GWAS in wheat and barley

Fuzzy Multicriteria Decision-Making Analysis of Agricultural Product Logistics in Agricultural Economic Management

relationship matrix
Recently Published Documents