gaussian kernels
Recently Published Documents


TOTAL DOCUMENTS

173
(FIVE YEARS 58)

H-INDEX

17
(FIVE YEARS 4)

Heredity ◽  
2021 ◽  
Author(s):  
Marco Lopez-Cruz ◽  
Yoseph Beyene ◽  
Manje Gowda ◽  
Jose Crossa ◽  
Paulino Pérez-Rodríguez ◽  
...  

AbstractGenomic prediction models are often calibrated using multi-generation data. Over time, as data accumulates, training data sets become increasingly heterogeneous. Differences in allele frequency and linkage disequilibrium patterns between the training and prediction genotypes may limit prediction accuracy. This leads to the question of whether all available data or a subset of it should be used to calibrate genomic prediction models. Previous research on training set optimization has focused on identifying a subset of the available data that is optimal for a given prediction set. However, this approach does not contemplate the possibility that different training sets may be optimal for different prediction genotypes. To address this problem, we recently introduced a sparse selection index (SSI) that identifies an optimal training set for each individual in a prediction set. Using additive genomic relationships, the SSI can provide increased accuracy relative to genomic-BLUP (GBLUP). Non-parametric genomic models using Gaussian kernels (KBLUP) have, in some cases, yielded higher prediction accuracies than standard additive models. Therefore, here we studied whether combining SSIs and kernel methods could further improve prediction accuracy when training genomic models using multi-generation data. Using four years of doubled haploid maize data from the International Maize and Wheat Improvement Center (CIMMYT), we found that when predicting grain yield the KBLUP outperformed the GBLUP, and that using SSI with additive relationships (GSSI) lead to 5–17% increases in accuracy, relative to the GBLUP. However, differences in prediction accuracy between the KBLUP and the kernel-based SSI were smaller and not always significant.


2021 ◽  
Author(s):  
Peter Feher ◽  
Ádám Madarász ◽  
András Stirling

<div>Theoretical prediction of electronic absorption spectra without input from experiment is no easy feat as it requires addressing all the factors that affect line shapes. In practice, however, the methodologies are limited to treat these ingredients only to a certain extent. Here we present a multiscale protocol that addresses the temperature, solvent and nuclear quantum effects, anharmonicity and reconstruction of the final spectra from the individual transitions. First, QM/MM molecular dynamics is conducted to obtain trajectories of solute-solvent configurations, from which the corresponding quantum corrected ensembles are generated through the Generalized Smoothed Trajectory Analysis (GSTA). The optical spectra of the ensembles are then produced by calculating vertical transitions using TDDFT with implicit solvation. To obtain the final spectral shapes, the stick spectra from TDDFT are convoluted with Gaussian kernels where the half-widths are determined by a statistically motivated strategy. We have tested our method by calculating the UV-vis spectra of a recently discovered acridine photocatalyst in two redox states and evaluated the impact of each step. Nuclear quantization affects the relative peak intensities and widths, which is necessary to reproduce the experimental spectrum. We have also found that using only the optimized geometry of each molecule works surprisingly well if a proper empirical broadening factor is applied. This is explained by the rigidity of the conjugated chromophore moieties of the selected molecules which are mainly responsible for the excitations in the spectra. In contrast, we have also shown that the molecules are flexible enough to feature anharmonicities that impair the Wigner sampling. </div>


2021 ◽  
Author(s):  
Peter Feher ◽  
Ádám Madarász ◽  
András Stirling

<div>Theoretical prediction of electronic absorption spectra without input from experiment is no easy feat as it requires addressing all the factors that affect line shapes. In practice, however, the methodologies are limited to treat these ingredients only to a certain extent. Here we present a multiscale protocol that addresses the temperature, solvent and nuclear quantum effects, anharmonicity and reconstruction of the final spectra from the individual transitions. First, QM/MM molecular dynamics is conducted to obtain trajectories of solute-solvent configurations, from which the corresponding quantum corrected ensembles are generated through the Generalized Smoothed Trajectory Analysis (GSTA). The optical spectra of the ensembles are then produced by calculating vertical transitions using TDDFT with implicit solvation. To obtain the final spectral shapes, the stick spectra from TDDFT are convoluted with Gaussian kernels where the half-widths are determined by a statistically motivated strategy. We have tested our method by calculating the UV-vis spectra of a recently discovered acridine photocatalyst in two redox states and evaluated the impact of each step. Nuclear quantization affects the relative peak intensities and widths, which is necessary to reproduce the experimental spectrum. We have also found that using only the optimized geometry of each molecule works surprisingly well if a proper empirical broadening factor is applied. This is explained by the rigidity of the conjugated chromophore moieties of the selected molecules which are mainly responsible for the excitations in the spectra. In contrast, we have also shown that the molecules are flexible enough to feature anharmonicities that impair the Wigner sampling. </div>


2021 ◽  
Vol 5 (1) ◽  
pp. 1-9
Author(s):  
Pulung Nurtantio Andono ◽  
Eko Hari Rachmawanto

Batik as one of Indonesia's cultural heritages has various types, motifs and colors. A batik may have almost the same motif with a different color or vice versa, therefore it requires a classification of batik motifs. In this study, a printed batik was used with various coastal batik motifs in Central Java. The algorithm for classification is selected Support Vector Machine (SVM) with feature extraction of the Gray Level Co-Occurrence Matrix (GLCM) and Local Binary Pattern (LBP). SVM has the advantage of grouping data with small amounts and short operation times. GLCM as an extractive feature for recognizing batik textures and LBP was chosen to do spot pattern recognition. In the experiment, we have used 160 images of batik motifs which are divided into two, namely 128 training data and 32 testing data. The accuracy results obtained from the SVM, GLCM and LBP algorithms produce 100% accuracy in polyniomial, linear and gaussian kernels with distances at GLCM 1, 3, and 5, where at a distance of 1 linear kernel is 78.1%, gaussian 93.7%. At a distance of 3 linear kernels 75%, gaussian 87.5% and at a distance of 5 linear kernels 84.3%, gaussian 87.5%. In the SVM and GLCM algorithms the resulting accuracy is at a distance of 1 with a polynomial kernel 96.8%, linear 68.7%, and gaussian 75%. At distance 3, the polynomial kernel is 100%, linear 71.8%, and gaussian 78.1%, while for distance 5, the polynomial kernel is 87.5%, linear 75%, and gaussian 81.2%.


Author(s):  
Dominic Knoch ◽  
Christian R. Werner ◽  
Rhonda C. Meyer ◽  
David Riewe ◽  
Amine Abbadi ◽  
...  

Abstract Key message Complementing or replacing genetic markers with transcriptomic data and use of reproducing kernel Hilbert space regression based on Gaussian kernels increases hybrid prediction accuracies for complex agronomic traits in canola. In plant breeding, hybrids gained particular importance due to heterosis, the superior performance of offspring compared to their inbred parents. Since the development of new top performing hybrids requires labour-intensive and costly breeding programmes, including testing of large numbers of experimental hybrids, the prediction of hybrid performance is of utmost interest to plant breeders. In this study, we tested the effectiveness of hybrid prediction models in spring-type oilseed rape (Brassica napus L./canola) employing different omics profiles, individually and in combination. To this end, a population of 950 F1 hybrids was evaluated for seed yield and six other agronomically relevant traits in commercial field trials at several locations throughout Europe. A subset of these hybrids was also evaluated in a climatized glasshouse regarding early biomass production. For each of the 477 parental rapeseed lines, 13,201 single nucleotide polymorphisms (SNPs), 154 primary metabolites, and 19,479 transcripts were determined and used as predictive variables. Both, SNP markers and transcripts, effectively predict hybrid performance using (genomic) best linear unbiased prediction models (gBLUP). Compared to models using pure genetic markers, models incorporating transcriptome data resulted in significantly higher prediction accuracies for five out of seven agronomic traits, indicating that transcripts carry important information beyond genomic data. Notably, reproducing kernel Hilbert space regression based on Gaussian kernels significantly exceeded the predictive abilities of gBLUP models for six of the seven agronomic traits, demonstrating its potential for implementation in future canola breeding programmes.


2021 ◽  
Vol 62 ◽  
pp. 101513
Author(s):  
Ingo Steinwart ◽  
Simon Fischer

Sign in / Sign up

Export Citation Format

Share Document