Data envelopment analysis with missing data: a multiple imputation approach

Author(s):  
Ya Chen ◽  
Yongjun Li ◽  
Qiwei Xie ◽  
Qingxian An ◽  
N.A. Liang
2013 ◽  
Vol 37 (9) ◽  
pp. 6135-6145 ◽  
Author(s):  
Yong Zha ◽  
Ali Song ◽  
Chuanyong Xu ◽  
Honglin Yang

2020 ◽  
Vol 12 (4) ◽  
pp. 1432 ◽  
Author(s):  
Tihana Škrinjarić

Measuring the efficiency of research and development (R&D) expenditure and innovation policy has gained attention in recent years. This research examines the efficiency of 29 selected European countries for the period ranging from 2007 to 2017 in achieving and obtaining R&D goals. The methodology applied is the data envelopment analysis approach with the inclusion of the missing data approach. The contributions of this research include the following: dynamic analysis is conducted to track changes of (in)efficiencies over time; the decomposition of the efficiency is done by separating the main variables of interest into the private, higher education, and government sectors; and the robustness of the results is evaluated, which is often ignored in the literature. The results of the analysis are discussed with possible directions for inefficient countries. The rankings provided in the empirical part of the study confirm previous findings on disparities between the European countries with respect to innovation and the R&D sector.


2019 ◽  
Author(s):  
Anni Hämäläinen ◽  
Paul Mick

Missing data can be a significant problem for statistical inference in many disciplines when information is not missing completely at random. In the worst case, it can lead to biased results when participants or subjects with certain characteristics contribute more data than other participants. Multiple imputation methods can be used to alleviate the loss of sample size and correct for this potential bias. Multiple imputation entails filling in the missing data using information from the same and other participants on the variables of interest and potentially other available data that correlate with the variables of interest. The missing data estimates and uncertainty associated with their estimation may then be taken into account in statistical inference from those variables. A complication may arise when using compound variables, such as principal component loadings (PC), which draw on a number of raw variables that themselves have non-overlapping missing data. Here, we propose a sequential multiple imputation approach to facilitate the use of all available data in the raw variables contained in compound variables in a way that conforms to the specifications of the multiple imputation framework. We first use multiple imputation to impute missing data for the subset of raw variables used in a principal component analysis (PCA) and perform the PCA with the imputed data; then, use the factor loadings to calculate PC scores for each individual with complete raw data. Finally, we include these PC scores as part of a global multiple imputation approach to estimate a final statistical model. We demonstrate (including annotated Stata code) the use of this approach by examining which sensory, health, social and cognitive factors explain self-reported sensory difficulties in the Canadian Longitudinal Study of Aging (CLSA) Comprehensive Cohort. The proposed sequential multiple imputation approach allows us to deal with the issue of having large cumulative amount of data that is missing (not completely at random) among a large number of variables, including composite cognitive scores derived from a battery of cognitive tests. We examine the resulting parameter estimates using a range of recommended diagnostic tools to highlight the potential and consequences of the approach to the statistical results.


2016 ◽  
Vol 72 (3) ◽  
Author(s):  
Talat Senel ◽  
Yuksel Terzi ◽  
Serpil Gumustekin ◽  
Mehmet Ali Cengiz

Sign in / Sign up

Export Citation Format

Share Document