How to obtain valid tests and confidence intervals after propensity score variable selection?

2019 ◽  
Vol 29 (3) ◽  
pp. 677-694 ◽  
Author(s):  
Oliver Dukes ◽  
Stijn Vansteelandt

The problem of how to best select variables for confounding adjustment forms one of the key challenges in the evaluation of exposure or treatment effects in observational studies. Routine practice is often based on stepwise selection procedures that use hypothesis testing, change-in-estimate assessments or the lasso, which have all been criticised for – amongst other things – not giving sufficient priority to the selection of confounders. This has prompted vigorous recent activity in developing procedures that prioritise the selection of confounders, while preventing the selection of so-called instrumental variables that are associated with exposure, but not outcome (after adjustment for the exposure). A major drawback of all these procedures is that there is no finite sample size at which they are guaranteed to deliver treatment effect estimators and associated confidence intervals with adequate performance. This is the result of the estimator jumping back and forth between different selected models, and standard confidence intervals ignoring the resulting model selection uncertainty. In this paper, we will develop insight into this by evaluating the finite-sample distribution of the exposure effect estimator in linear regression, under a number of the aforementioned confounder selection procedures. We will show that by making clever use of propensity scores, a simple and generic solution is obtained in the context of generalized linear models, which overcomes this concern (under weaker conditions than competing proposals). Specifically, we propose to use separate regularized regressions for the outcome and propensity score models in order to construct a doubly robust ‘g-estimator’; when these models are sufficiently sparse and correctly specified, standard confidence intervals for the g-estimator implicitly incorporate the uncertainty induced by the variable selection procedure.

2002 ◽  
Vol 28 (3) ◽  
Author(s):  
E. Cross ◽  
W. Marais ◽  
H. Steel ◽  
C. C. Theron

The validity and credibility of assertions on the efficiency and equity of selection procedures is dependent on the methodology with which the procedure was developed and justified. An ideal approach to the development and justification of a selection procedure was derived from standard guidelines and operationalized in the form of a comprehensive checklist. A psychometric audit on the developmental history of the selection procedure for the selection of commission advisors was undertaken. Various shortcomings were identified and rectified or recommendations were made on rectifying them. The audit found that the selection procedure had zero validity, negative utility and discriminated unfairly. Opsomming Die geldigheid en geloofwaardigheid van uitsprake oor die effektiwiteit en billikheid van ‘n keuringsprosedure is ‘n funksie van die metodologie waarmee die prosedure ontwikkel en regverdig is. ‘n Ideale benadering tot die ontwikkeling en regverdiging van ‘n keuringsprosedure is uit standaard riglyne afgelei en geoperasionaliseer in die vorm van ‘n omvattende kontrolelys. ‘n Psigometriese oudit is onderneem op die ontwikkelingsgeskiedenis van ‘n keuringsprosedure vir die keuring van kommissie-adviseurs. Verskeie tekortkominge is geïdentifiseer en reggestel of aanbevelings ten opsigte van regstelling is gemaak. Die oudit het bevind dat die keuringsprosedure oor zero geldigheid beskik, negatiewe nutwaarde toon en onbillik diskrimineer.


1964 ◽  
Vol 5 (3) ◽  
pp. 341-353 ◽  
Author(s):  
R. N. Curnow

The selection of animals or plants for high values of a certain character may favour not only genotypes associated with these high values but also genotypes associated with high variability. Any differences between genotypes in variability may therefore be of considerable importance in plant and livestock improvement programmes as well as in evolution. The effects of various selection procedures on variability have been studied in three recent experiments [Falconer & Robertson (1956) Falconer (1957) and Prout (1962)]. In these experiments one line was continued by selecting, in each generation, parents with values of a particular character near the population mean. Manning (1955, 1956) has described the effects of this kind of selection applied to cotton. Robertson (1956) derived and discussed the theory of such selection procedures when certain simplifying approximations can be made We shall obtain some more general results and show that Robertson was incorrect in saying that the selection procedure would lead to gene fixation even if the heterozygotes are less variable than the homozygotes. The importance of the results is discussed in section 8.


2013 ◽  
Vol 677 ◽  
pp. 357-362
Author(s):  
Natthasurang Yasungnoen ◽  
Patchanok Srisuradetchai

Model selection procedures play important role in many researches especially quantitative research. . In several area of sciences, the analysis and model selection of experiments are often used and often contains two fundamental goals associated with the experimental response of interest which are to determine the best model. The way to address these goals is to implement a model selection procedure. Then, the objectives of this research are to determine whether or not the final models selected are in agreement or differ substantially across the three approaches to model selection: using Akaike’s Information Criterion, using a p-value criterion, and using a stepwise procedure.. Generally, results from these three models are usually compare to each other. All selected models are based on the heredity principle to design the possible model for each design. The actual data from literature, consisting of the 2x3 and 32 and 3x4 factorial designs are used to determine the final model. The results show that the P-Value WH and Stepwise methods give the highest percentage of matched model.


2016 ◽  
Vol 12 (3) ◽  
pp. 79
Author(s):  
Tomasz Duraj

THE COMPETITIVE SELECTION PROCEDURE FOR MANAGEMENT STAFF: LEGAL ISSUES Summary This analysis relates to the legal issues in the competitive selection of management staff. Under the current provisions in Poland many legal acts pertain to this issue, giving an inhomogeneous set of regulations for the principles of conducting such procedures in particular domains of public, social and economic affairs. The subject of this article is a detailed description of the stages of the procedure for the competitive selection of management staff. Good legislation to regulate the selection procedures for competitions for management appointments will have a significant influence on the effectiveness of the adopted method of selection. The author presents the successive stages of the procedures for such competitions and conducts an in-depth legal analysis, paying specific attention to legal doubts arising in connection with the application of the current law. On the basis of his analysis he formulates some proposals de lege ferenda addressed to the legislator on the introduction of requisite amendments and supplements to the legal regulations for the procedure of competitive selection of management staff.


2020 ◽  
Vol 2020 ◽  
pp. 1-11
Author(s):  
Jinghua Zhang ◽  
Liugen Xue

Semiparametric generalized varying coefficient partially linear models with longitudinal data arise in contemporary biology, medicine, and life science. In this paper, we consider a variable selection procedure based on the combination of the basis function approximations and quadratic inference functions with SCAD penalty. The proposed procedure simultaneously selects significant variables in the parametric components and the nonparametric components. With appropriate selection of the tuning parameters, we establish the consistency, sparsity, and asymptotic normality of the resulting estimators. The finite sample performance of the proposed methods is evaluated through extensive simulation studies and a real data analysis.


2010 ◽  
Vol 61 (5) ◽  
pp. 1267-1278 ◽  
Author(s):  
L. Capelli ◽  
S. Sironi ◽  
R. Del Rosso ◽  
P. Céntola ◽  
S. Bonati

The EN 13725:2003, which standardizes the determination of odour concentration by dynamic olfactometry, fixes the limits for panel selection in terms of individual threshold towards a reference gas (n-butanol in nitrogen) and of standard deviation of the responses. Nonetheless, laboratories have some degrees of freedom in developing their own procedures for panel selection and evaluation. Most Italian olfactometric laboratories use a similar procedure for panel selection, based on the repeated analysis of samples of n-butanol at a concentration of 60 ppm. The first part of this study demonstrates that this procedure may originate a sort of “smartening” of the assessors, which means that they become able to guess the right answers in order to maintain their qualification as panel members, independently from their real olfactory perception. For this reason, the panel selection procedure has been revised with the aim of making it less repetitive, therefore preventing the possibility for panel members to be able to guess the best answers in order to comply with the selection criteria. The selection of new panel members and the screening of the active ones according to this revised procedure proved this new procedure to be more selective than the “standard” one. Finally, the results of the tests with n-butanol conducted after the introduction of the revised procedure for panel selection and regular verification showed an effective improvement of the laboratory measurement performances in terms of accuracy and precision.


Author(s):  
Md Hasinur Rahaman Khan ◽  
Anamika Bhadra ◽  
Tamanna Howlader

Abstract The instability in the selection of models is a major concern with data sets containing a large number of covariates. We focus on stability selection which is used as a technique to improve variable selection performance for a range of selection methods, based on aggregating the results of applying a selection procedure to sub-samples of the data where the observations are subject to right censoring. The accelerated failure time (AFT) models have proved useful in many contexts including the heavy censoring (as for example in cancer survival) and the high dimensionality (as for example in micro-array data). We implement the stability selection approach using three variable selection techniques—Lasso, ridge regression, and elastic net applied to censored data using AFT models. We compare the performances of these regularized techniques with and without stability selection approaches with simulation studies and two real data examples–a breast cancer data and a diffuse large B-cell lymphoma data. The results suggest that stability selection gives always stable scenario about the selection of variables and that as the dimension of data increases the performance of methods with stability selection also improves compared to methods without stability selection irrespective of the collinearity between the covariates.


2020 ◽  
pp. 106591292097171
Author(s):  
Nancy B. Arrington

Much attention is paid to how mechanisms for selecting political officials shape which types of officials hold positions of power, but selection procedures do not always produce the desired outcomes. In the context of the judiciary, many expected “merit” selection procedures to facilitate the selection of women justices to the bench, an expectation that has not been realized. Applying theories of procedural fairness to judicial selection procedures, I argue that observers’ beliefs that merit selection procedures are more “fair” (relative to unilateral selection procedures) makes observers more accepting of all-male benches. Survey experimental evidence demonstrates that respondents do perceive merit selection procedures as more fair than gubernatorial selection procedures, a priori. In turn, respondents are less critical of all-male courts when judges are selected through a merit selection procedure. These findings contribute to our understanding of the ways in which (1) selection institutions shape prospects for gender diversity, (2) institutional design can have unintended consequences, and (3) procedural fairness can obscure accountabilituy for suboptimal outcomes.


2013 ◽  
Vol 06 (03) ◽  
pp. 1350015 ◽  
Author(s):  
JIANG DU ◽  
ZHONGZHAN ZHANG ◽  
ZHIMENG SUN

In this paper, we propose a variable selection procedure for partially linear varying coefficient model under quantile loss function with adaptive Lasso penalty. The functional coefficients are estimated by B-spline approximations. The proposed procedure simultaneously selects significant variables and estimates unknown parameters. The major advantage of the proposed procedures over the existing ones is easy to implement using existing software, and it requires no specification of the error distributions. Under the regularity conditions, we show that the proposed procedure can be as efficient as the Oracle estimator, and derive the optimal convergence rate of the functional coefficients. A simulation study and a real data application are undertaken to assess the finite sample performance of the proposed variable selection procedure.


2019 ◽  
Vol 9 (3) ◽  
pp. 4169-4175
Author(s):  
R. F. Kamala ◽  
P. R. J. Thangaiah

In feature subset selection the variable selection procedure selects a subset of the most relevant features. Filter and wrapper methods are categories of variable selection methods. Feature subsets are similar to data pre-processing and are applied to reduce feature dimensions in a very large dataset. In this paper, in order to deal with this kind of problems, the selection of feature subset methods depending on the fitness evaluation of the classifier is introduced to alleviate the classification task and to progress the classification performance. To curtail the dimensions of the feature space, a novel approach for selecting optimal features on two-stage selection of feature subsets (TSFS) method is done, both theoretically and experimentally. The results of this method include improvements in the performance measures like efficiency, accuracy, and scalability of machine learning algorithms. Comparison of the proposed method is made with known relevant methods using benchmark databases. The proposed method performs better than the earlier hybrid feature selection methodologies discussed in relevant works, regarding classifiers’ accuracy and error.


Sign in / Sign up

Export Citation Format

Share Document