A Proposition of Generalized Method for Forward Selection of Variables

1980 ◽  
Vol 7 (7) ◽  
pp. 95-107 ◽  
Author(s):  
Haruo Yanai
2014 ◽  
Vol 11 (11) ◽  
pp. 15889-15909 ◽  
Author(s):  
S. Blanco

Abstract. Diatoms play a key role in the development of quantitative methods for environmental reconstruction in lake ecosystems. Diatom-based calibration datasets developed during the last decades allow the inference of past limnological variables such as TP, pH or conductivity and provide information on the autecology and distribution of diatom taxa. However, little is known about the relationships between diatoms and climatic or geographic factors. The response of surface sediment diatom assemblages to abiotic factors is usually examined using canonical correspondence analysis (CCA) and subsequent forward selection of variables based on Monte Carlo permutation tests that show the set of predictors best explaining the distributions of diatom species. The results reported in 40 previous studies using this methodology in different regions of the world are re-analyzed in this paper. Bi- and multivariate statistics (canonical correlation and two-block partial least-squares) were used to explore the correspondence between physical, chemical and physiographical factors and the variables that explain most of the variance in the diatom datasets. Results show that diatom communities respond mainly to chemical variables (pH, nutrients) with lake depth being the most important physiographical factor. However, the relative importance of certain parameters varied along latitudinal and trophic gradients. Canonical analyses demonstrated a strong concordance with regard to the predictor variables and the amount of variance they captured, suggesting that, on a broad scale, lake diatoms give a robust indication of past and present environmental conditions.


2020 ◽  
Author(s):  
Lucía A. Azibeiro ◽  
Michal Kucera ◽  
Lukas Jonkers ◽  
Francisco J. Sierro ◽  
Angela Cloke-Hayes

<p>La reconstrucción de la temperatura de la superficie del mar (TSM) ha estado durante mucho tiempo en el centro de la investigación paleoceanográfica. Los estudios en el Mediterráneo no han sido una excepción, ya que la reconstrucción cuantitativa de TSM en esta cuenca semicerrada es crucial para comprender el cambio climático pasado en la región. Muchos de estos métodos se basaron en foraminíferos planctónicos, tanto en su geoquímica de caparazón como en la composición de los ensamblajes (por ejemplo, funciones de transferencia). Comprender y modelar las relaciones entre el censo actual y las variables ambientales es la base para transformar los datos fósiles en estimaciones cuantitativas de estas variables. Aunque globalmente, los conjuntos de foraminíferos parecen estar determinados principalmente por la temperatura, en cuencas marginales como el Mediterráneo, </p><p>In this study we attempt to determine which environmental parameters may control the variability of planktonic foraminifer assemblages in the modern Mediterranean. For this purpose, census counts of planktonic foraminifera assemblages from Mediterranean coretops (ForCenS data base) have been integrated with monthly estimates of SST, chlorophyll concentration, and vertical gradients of various parameters as proxies for water column stratification/mixing (WOA 1998).  Redundancy Analysis (RDA) was used to evaluating the explanatory power and the collinearity among tested environmental parameters and a forward selection of variables was carried out to identify those explaining independently the largest share of the variance in the composition of planktonic foraminifera assemblages.</p><p>Se identificaron nueve variables significativas. Tres de ellos corresponden a TSM, mientras que los otros seis se distribuyen entre las concentraciones de clorofila superficial (2) y los gradientes térmicos verticales (4). Las variables más explicativas son la <em>TSM de junio</em> (R <sup>2</sup> 0.43) y <em>el gradiente térmico vertical de diciembre</em> (R <sup>2</sup> 0.15).</p>


Author(s):  
Willi Sauerbrei ◽  
◽  
Aris Perperoglou ◽  
Matthias Schmid ◽  
Michal Abrahamowicz ◽  
...  

2020 ◽  
Vol V (IV) ◽  
pp. 1-9
Author(s):  
Aftab Anwar ◽  
Muhammad Masood Anwar ◽  
Ghulam Yahya Khan

Since inflation and trade openness rate are considered as critical measure of an economy's health. This article analyze the relation of Economic growth with Investment, Inflation and Trade Openness of Pakistan for 1970- 2019. The policy guide lines from analysis include promotion of policies to increase Investment and Trade-openness in short and long-terms. The study used ARDL bound-testing for long-term and Un-Restricted-Error Correction techniques to discover short-term interrelation amongst a selection of variables. Results of study revealed inflation negatively related to economic performance and positively linked to Investment and Trade-Openness. Findings of enquiry suggested government should focus more on investment friendly policies in the country.


2015 ◽  
Author(s):  
Dorothy V Bishop ◽  
Paul A Thompson

Background: The p-curve is a plot of the distribution of p-values below .05 reported in a set of scientific studies. Comparisons between ranges of p-values have been used to evaluate fields of research in terms of the extent to which studies have genuine evidential value, and the extent to which they suffer from bias in the selection of variables and analyses for publication, p-hacking. We argue that binomial tests on the p-curve are not robust enough to be used for this purpose. Methods: P-hacking can take various forms. Here we used R code to simulate the use of ghost variables, where an experimenter gathers data on several dependent variables but reports only those with statistically significant effects. We also examined a text-mined dataset used by Head et al. (2015) and assessed its suitability for investigating p-hacking. Results: We first show that a p-curve suggestive of p-hacking can be obtained if researchers misapply parametric tests to data that depart from normality, even when no p-hacking occurs. We go on to show that when there is ghost p-hacking, the shape of the p-curve depends on whether dependent variables are intercorrelated. For uncorrelated variables, simulated p-hacked data do not give the "p-hacking bump" just below .05 that is regarded as evidence of p-hacking, though there is a negative skew when simulated variables are inter-correlated. The way p-curves vary according to features of underlying data poses problems when automated text mining is used to detect p-values in heterogeneous sets of published papers. Conclusions: A significant bump in the p-curve just below .05 is not necessarily evidence of p-hacking, and lack of a bump is not indicative of lack of p-hacking. Furthermore, while studies with evidential value will usually generate a right-skewed p-curve, we cannot treat a right-skewed p-curve as an indicator of the extent of evidential value, unless we have a model specific to the type of p-values entered into the analysis. We conclude that it is not feasible to use the p-curve to estimate the extent of p-hacking and evidential value unless there is considerable control over the type of data entered into the analysis.


Sign in / Sign up

Export Citation Format

Share Document