scholarly journals Gradient boosting for linear mixed models

2021 ◽  
Vol 0 (0) ◽  
Author(s):  
Colin Griesbach ◽  
Benjamin Säfken ◽  
Elisabeth Waldmann

Abstract Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current boosting approaches also offer methods accounting for random effects and thus enable prediction of mixed models for longitudinal and clustered data. However, these approaches include several flaws resulting in unbalanced effect selection with falsely induced shrinkage and a low convergence rate on the one hand and biased estimates of the random effects on the other hand. We therefore propose a new boosting algorithm which explicitly accounts for the random structure by excluding it from the selection procedure, properly correcting the random effects estimates and in addition providing likelihood-based estimation of the random effects variance structure. The new algorithm offers an organic and unbiased fitting approach, which is shown via simulations and data examples.

PLoS ONE ◽  
2021 ◽  
Vol 16 (7) ◽  
pp. e0254178
Author(s):  
Colin Griesbach ◽  
Andreas Groll ◽  
Elisabeth Bergherr

Boosting techniques from the field of statistical learning have grown to be a popular tool for estimating and selecting predictor effects in various regression models and can roughly be separated in two general approaches, namely gradient boosting and likelihood-based boosting. An extensive framework has been proposed in order to fit generalized mixed models based on boosting, however for the case of cluster-constant covariates likelihood-based boosting approaches tend to mischoose variables in the selection step leading to wrong estimates. We propose an improved boosting algorithm for linear mixed models, where the random effects are properly weighted, disentangled from the fixed effects updating scheme and corrected for correlations with cluster-constant covariates in order to improve quality of estimates and in addition reduce the computational effort. The method outperforms current state-of-the-art approaches from boosting and maximum likelihood inference which is shown via simulations and various data examples.


Biometrika ◽  
2010 ◽  
Vol 97 (4) ◽  
pp. 773-789 ◽  
Author(s):  
Sonja Greven ◽  
Thomas Kneib

Abstract In linear mixed models, model selection frequently includes the selection of random effects. Two versions of the Akaike information criterion, aic, have been used, based either on the marginal or on the conditional distribution. We show that the marginal aic is not an asymptotically unbiased estimator of the Akaike information, and favours smaller models without random effects. For the conditional aic, we show that ignoring estimation uncertainty in the random effects covariance matrix, as is common practice, induces a bias that can lead to the selection of any random effect not predicted to be exactly zero. We derive an analytic representation of a corrected version of the conditional aic, which avoids the high computational cost and imprecision of available numerical approximations. An implementation in an R package (R Development Core Team, 2010) is provided. All theoretical results are illustrated in simulation studies, and their impact in practice is investigated in an analysis of childhood malnutrition in Zambia.


1975 ◽  
Vol 26 ◽  
pp. 395-407
Author(s):  
S. Henriksen

The first question to be answered, in seeking coordinate systems for geodynamics, is: what is geodynamics? The answer is, of course, that geodynamics is that part of geophysics which is concerned with movements of the Earth, as opposed to geostatics which is the physics of the stationary Earth. But as far as we know, there is no stationary Earth – epur sic monere. So geodynamics is actually coextensive with geophysics, and coordinate systems suitable for the one should be suitable for the other. At the present time, there are not many coordinate systems, if any, that can be identified with a static Earth. Certainly the only coordinate of aeronomic (atmospheric) interest is the height, and this is usually either as geodynamic height or as pressure. In oceanology, the most important coordinate is depth, and this, like heights in the atmosphere, is expressed as metric depth from mean sea level, as geodynamic depth, or as pressure. Only for the earth do we find “static” systems in use, ana even here there is real question as to whether the systems are dynamic or static. So it would seem that our answer to the question, of what kind, of coordinate systems are we seeking, must be that we are looking for the same systems as are used in geophysics, and these systems are dynamic in nature already – that is, their definition involvestime.


Methodology ◽  
2018 ◽  
Vol 14 (4) ◽  
pp. 177-188 ◽  
Author(s):  
Martin Schultze ◽  
Michael Eid

Abstract. In the construction of scales intended for the use in cross-cultural studies, the selection of items needs to be guided not only by traditional criteria of item quality, but has to take information about the measurement invariance of the scale into account. We present an approach to automated item selection which depicts the process as a combinatorial optimization problem and aims at finding a scale which fulfils predefined target criteria – such as measurement invariance across cultures. The search for an optimal solution is performed using an adaptation of the [Formula: see text] Ant System algorithm. The approach is illustrated using an application to item selection for a personality scale assuming measurement invariance across multiple countries.


2019 ◽  
Vol 37 (1) ◽  
pp. 89-110
Author(s):  
Rachel Fensham

The Viennese modern choreographer Gertrud Bodenwieser's black coat leads to an analysis of her choreography in four main phases – the early European career; the rise of Nazism; war's brutality; and postwar attempts at reconciliation. Utilising archival and embodied research, the article focuses on a selection of Bodenwieser costumes that survived her journey from Vienna, or were remade in Australia, and their role in the dramaturgy of works such as Swinging Bells (1926), The Masks of Lucifer (1936, 1944), Cain and Abel (1940) and The One and the Many (1946). In addition to dance history, costume studies provides a distinctive way to engage with the question of what remains of performance, and what survives of the historical conditions and experience of modern dance-drama. Throughout, Hannah Arendt's book The Human Condition (1958) provides a critical guide to the acts of reconstruction undertaken by Bodenwieser as an émigré choreographer in the practice of her craft, and its ‘materializing reification’ of creative thought. As a study in affective memory, information regarding Bodenwieser's personal life becomes interwoven with the author's response to the material evidence of costumes, oral histories and documents located in various Australian archives. By resurrecting the ‘dead letters’ of this choreography, the article therefore considers how dance costumes offer the trace of an artistic resistance to totalitarianism.


Author(s):  
YuE Kravchenko ◽  
SV Ivanov ◽  
DS Kravchenko ◽  
EI Frolova ◽  
SP Chumakov

Selection of antibodies using phage display involves the preliminary cloning of the repertoire of sequences encoding antigen-binding domains into phagemid, which is considered the bottleneck of the method, limiting the resulting diversity of libraries and leading to the loss of poorly represented variants before the start of the selection procedure. Selection in cell-free conditions using a ribosomal display is devoid from this drawback, however is highly sensitive to PCR artifacts and the RNase contamination. The aim of the study was to test the efficiency of a combination of both methods, including pre-selection in a cell-free system to enrich the source library, followed by cloning and final selection using phage display. This approach may eliminate the shortcomings of each method and increase the efficiency of selection. For selection, alpaca VHH antibody sequences suitable for building an immune library were used due to the lack of VL domains. Analysis of immune libraries from the genes of the VH3, VHH3 and VH4 families showed that the VHH antibodies share in the VH3 and VH4 gene groups is insignificant, and selection from the combined library is less effective than from the VHH3 family of sequences. We found that the combination of ribosomal and phage displays leads to a higher enrichment of high-affinity fragments and avoids the loss of the original diversity during cloning. The combined method allowed us to obtain a greater number of different high-affinity sequences, and all the tested VHH fragments were able to specifically recognize the target, including the total protein extracts of cell cultures.


Kybernetes ◽  
2019 ◽  
Vol 49 (4) ◽  
pp. 1083-1102
Author(s):  
Georgios N. Aretoulis ◽  
Jason Papathanasiou ◽  
Fani Antoniou

Purpose This paper aims to rank and identify the most efficient project managers (PMs) based on personality traits, using Preference Ranking Organization METHod for Enrichment Evaluations (PROMETHEE) methodology. Design/methodology/approach The proposed methodology relies on the five personality traits. These were used as the selection criteria. A questionnaire survey among 82 experienced engineers was used to estimate the required weights per personality trait. A second two-part questionnaire survey aimed at recording the PMs profile and assess the performance of personality traits per PM. PMs with the most years of experience are selected to be ranked through Visual PROMETHEE. Findings The findings suggest that a competent PM is the one that scores low on the “Neuroticism” trait and high especially on the “Conscientiousness” trait. Research limitations/implications The research applied a psychometric test specifically designed for Greek people. Furthermore, the proposed methodology is based on the personality characteristics to rank the PMs and does not consider the technical skills. Furthermore, the type of project is not considered in the process of ranking PMs. Practical implications The findings could contribute in the selection of the best PM that maximizes the project team’s performance. Social implications Improved project team communication and collaboration leading to improved project performance through better communication and collaboration. This is an additional benefit for the society, especially in the delivery of public infrastructure projects. A lot of public infrastructure projects deviate largely as far as cost and schedule is concerned and this is an additional burden for public and society. Proper project management through efficient PMs would save people’s money and time. Originality/value Identification of the best PMbased on a combination of multicriteria decision-making and psychometric tests, which focus on personality traits.


2021 ◽  
Vol 5 (1) ◽  
Author(s):  
Osman Mamun ◽  
Madison Wenzlick ◽  
Arun Sathanur ◽  
Jeffrey Hawk ◽  
Ram Devanathan

AbstractThe Larson–Miller parameter (LMP) offers an efficient and fast scheme to estimate the creep rupture life of alloy materials for high-temperature applications; however, poor generalizability and dependence on the constant C often result in sub-optimal performance. In this work, we show that the direct rupture life parameterization without intermediate LMP parameterization, using a gradient boosting algorithm, can be used to train ML models for very accurate prediction of rupture life in a variety of alloys (Pearson correlation coefficient >0.9 for 9–12% Cr and >0.8 for austenitic stainless steels). In addition, the Shapley value was used to quantify feature importance, making the model interpretable by identifying the effect of various features on the model performance. Finally, a variational autoencoder-based generative model was built by conditioning on the experimental dataset to sample hypothetical synthetic candidate alloys from the learnt joint distribution not existing in both 9–12% Cr ferritic–martensitic alloys and austenitic stainless steel datasets.


Author(s):  
Hai Tao ◽  
Maria Habib ◽  
Ibrahim Aljarah ◽  
Hossam Faris ◽  
Haitham Abdulmohsin Afan ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document