multivariate location
Recently Published Documents


TOTAL DOCUMENTS

116
(FIVE YEARS 10)

H-INDEX

22
(FIVE YEARS 1)

2021 ◽  
Vol 9 (5) ◽  
pp. 653-663
Author(s):  
Yik-Siong Pang ◽  
Nor Aishah Ahad ◽  
Sharipah Soaad Syed Yahaya

2021 ◽  
Vol 31 (5) ◽  
Author(s):  
Joachim Schreurs ◽  
Iwein Vranckx ◽  
Mia Hubert ◽  
Johan A. K. Suykens ◽  
Peter J. Rousseeuw

AbstractThe minimum regularized covariance determinant method (MRCD) is a robust estimator for multivariate location and scatter, which detects outliers by fitting a robust covariance matrix to the data. Its regularization ensures that the covariance matrix is well-conditioned in any dimension. The MRCD assumes that the non-outlying observations are roughly elliptically distributed, but many datasets are not of that form. Moreover, the computation time of MRCD increases substantially when the number of variables goes up, and nowadays datasets with many variables are common. The proposed kernel minimum regularized covariance determinant (KMRCD) estimator addresses both issues. It is not restricted to elliptical data because it implicitly computes the MRCD estimates in a kernel-induced feature space. A fast algorithm is constructed that starts from kernel-based initial estimates and exploits the kernel trick to speed up the subsequent computations. Based on the KMRCD estimates, a rule is proposed to flag outliers. The KMRCD algorithm performs well in simulations, and is illustrated on real-life data.


PeerJ ◽  
2021 ◽  
Vol 9 ◽  
pp. e11436
Author(s):  
Thomas R. Etherington

The Mahalanobis distance is a statistical technique that has been used in statistics and data science for data classification and outlier detection, and in ecology to quantify species-environment relationships in habitat and ecological niche models. Mahalanobis distances are based on the location and scatter of a multivariate normal distribution, and can measure how distant any point in space is from the centre of this kind of distribution. Three different methods for calculating the multivariate location and scatter are commonly used: the sample mean and variance-covariance, the minimum covariance determinant, and the minimum volume ellipsoid. The minimum covariance determinant and minimum volume ellipsoid were developed to be robust to outliers by minimising the multivariate location and scatter for a subset of the full sample, with the proportion of the full sample forming the subset being controlled by a user-defined parameter. This outlier robustness means the minimum covariance determinant and the minimum volume ellipsoid are highly relevant for ecological niche analyses, which are usually based on natural history observations that are likely to contain errors. However, natural history observations will also contain extreme bias, to which the minimum covariance determinant and the minimum volume ellipsoid will also be sensitive. To provide guidance for selecting and parameterising a multivariate location and scatter method, a series of virtual ecological niche modelling experiments were conducted to demonstrate the performance of each multivariate location and scatter method under different levels of sample size, errors, and bias. The results show that there is no optimal modelling approach, and that choices need to be made based on the individual data and question. The sample mean and variance-covariance method will perform best on very small sample sizes if the data are free of error and bias. At larger sample sizes the minimum covariance determinant and minimum volume ellipsoid methods perform as well or better, but only if they are appropriately parameterised. Modellers who are more concerned about the prevalence of errors should retain a smaller proportion of the full data set, while modellers more concerned about the prevalence of bias should retain a larger proportion of the full data set. I conclude that Mahalanobis distances are a useful niche modelling technique, but only for questions relating to the fundamental niche of a species where the assumption of multivariate normality is reasonable. Users of the minimum covariance determinant and minimum volume ellipsoid methods must also clearly report their parameterisations so that the results can be interpreted correctly.


Econometrics ◽  
2021 ◽  
Vol 9 (2) ◽  
pp. 19
Author(s):  
Gustavo Canavire-Bacarreza ◽  
Luis Castro Peñarrieta ◽  
Darwin Ugarte Ontiveros

Outliers can be particularly hard to detect, creating bias and inconsistency in the semi-parametric estimates. In this paper, we use Monte Carlo simulations to demonstrate that semi-parametric methods, such as matching, are biased in the presence of outliers. Bad and good leverage point outliers are considered. Bias arises in the case of bad leverage points because they completely change the distribution of the metrics used to define counterfactuals; good leverage points, on the other hand, increase the chance of breaking the common support condition and distort the balance of the covariates, which may push practitioners to misspecify the propensity score or the distance measures. We provide some clues to identify and correct for the effects of outliers following a reweighting strategy in the spirit of the Stahel-Donoho (SD) multivariate estimator of scale and location, and the S-estimator of multivariate location (Smultiv). An application of this strategy to experimental data is also implemented.


Econometrica ◽  
2021 ◽  
Vol 89 (5) ◽  
pp. 2497-2516 ◽  
Author(s):  
Eben Lazarus ◽  
Daniel J. Lewis ◽  
James H. Stock

Heteroskedasticity‐ and autocorrelation‐robust (HAR) inference in time series regression typically involves kernel estimation of the long‐run variance. Conventional wisdom holds that, for a given kernel, the choice of truncation parameter trades off a test's null rejection rate and power, and that this tradeoff differs across kernels. We formalize this intuition: using higher‐order expansions, we provide a unified size‐power frontier for both kernel and weighted orthonormal series tests using nonstandard “fixed‐ b” critical values. We also provide a frontier for the subset of these tests for which the fixed‐ b distribution is t or F. These frontiers are respectively achieved by the QS kernel and equal‐weighted periodogram. The frontiers have simple closed‐form expressions, which show that the price paid for restricting attention to tests with t and F critical values is small. The frontiers are derived for the Gaussian multivariate location model, but simulations suggest the qualitative findings extend to stochastic regressors.


2020 ◽  
Vol 0 (0) ◽  
Author(s):  
Szabolcs Blazsek ◽  
Alvaro Escribano ◽  
Adrian Licht

AbstractFor policy decisions, capturing seasonal effects in impulse responses are important for the correct specification of dynamic models that measure interaction effects for policy-relevant macroeconomic variables. In this paper, a new multivariate method is suggested, which uses the score-driven quasi-vector autoregressive (QVAR) model, to capture seasonal effects in impulse response functions (IRFs). The nonlinear QVAR-based method is compared with the existing linear VAR-based method. The following technical aspects of the new method are presented: (i) mathematical formulation of QVAR; (ii) first-order representation and infinite vector moving average, VMA (∞), representation of QVAR; (iii) IRF of QVAR; (iv) statistical inference of QVAR and conditions of consistency and asymptotic normality of the estimates. Control data are used for the period of 1987:Q1 to 2013:Q2, from the following policy-relevant macroeconomic variables: crude oil real price, United States (US) inflation rate, and US real gross domestic product (GDP). A graphical representation of seasonal effects among variables is provided, by using the IRF. According to the estimation results, annual seasonal effects are almost undetected by using the existing linear VAR tool, but those effects are detected by using the new QVAR tool.


2019 ◽  
Vol 237 (6) ◽  
pp. 831-845
Author(s):  
G. L. Shevlyakov ◽  
A.A. Shagal ◽  
V. I. Shin

2019 ◽  
Vol 26 (2) ◽  
pp. 367-371
Author(s):  
Bruno Meriaux ◽  
Chengfang Ren ◽  
Mohammed Nabil El Korso ◽  
Arnaud Breloy ◽  
Philippe Forster

Sign in / Sign up

Export Citation Format

Share Document