Cheating with Models

2021 ◽  
Vol 3 (4) ◽  
pp. 417-434
Author(s):  
Kfir Eliaz ◽  
Ran Spiegler ◽  
Yair Weiss

Beliefs and decisions are often based on confronting models with data. What is the largest “fake” correlation that a misspecified model can generate, even when it passes an elementary misspecification test? We study an “analyst” who fits a model, represented by a directed acyclic graph, to an objective (multivariate) Gaussian distribution. We characterize the maximal estimated pairwise correlation for generic Gaussian objective distributions, subject to the constraint that the estimated model preserves the marginal distribution of any individual variable. As the number of model variables grows, the estimated correlation can become arbitrarily close to one regardless of the objective correlation. (JEL D83, C13, C46, C51)

2012 ◽  
Vol 591-593 ◽  
pp. 1783-1788 ◽  
Author(s):  
Zhi Yang Jia ◽  
Pu Wang ◽  
Xue Jin Gao

In the process monitoring and fault diagnosis of batch processes, traditional principal component analysis (PCA) and least-squares (PLS), are assuming that the process variables are multivariate Gaussian distribution. But in the practical industrial process, the data observed of process variables do not necessarily be the multivariate Gaussian distribution. Independent component analysis (ICA), as a higher-order statistical method, is more suitable for dynamic systems. Observational data are decomposed into a linear combination of the independent components under statistical significance. The higher order statistics will be extracted and the mixed signals are decomposed into independent non-Gaussian components. Traditional method of ICA has to predefine the number of independent components. This paper proposed an improved MICA method of realizing the automatically choosing the independent components through setting the threshold value of the negentropy. The method can solve the problem of predefining the number of independent components in traditional methods and meanwhile it reduces the complexity of the monitoring model. The proposed method is used to do the process monitoring and fault diagnosis of penicillin fermentation and the results verify the feasibility and effectiveness of the method.


Sign in / Sign up

Export Citation Format

Share Document