Accounting for Endogeneity in Regression Models Using Copulas: A Step-by-Step Guide for Empirical Studies

AbstractMarketing researchers are increasingly taking advantage of the instrumental variable (IV)-free Gaussian copula approach. They use this method to identify and correct endogeneity when estimating regression models with non-experimental data. The Gaussian copula approach’s original presentation and performance demonstration via a series of simulation studies focused primarily on regression models without intercept. However, marketing and other disciplines’ researchers mainly use regression models with intercept. This research expands our knowledge of the Gaussian copula approach to regression models with intercept and to multilevel models. The results of our simulation studies reveal a fundamental bias and concerns about statistical power at smaller sample sizes and when the approach’s primary assumptions are not fully met. This key finding opposes the method’s potential advantages and raises concerns about its appropriate use in prior studies. As a remedy, we derive boundary conditions and guidelines that contribute to the Gaussian copula approach’s proper use. Thereby, this research contributes to ensuring the validity of results and conclusions of empirical research applying the Gaussian copula approach.

Download Full-text

The distributions of the J and Cox non-nested tests in regression models with weakly correlated regressors

10.32920/14640891 ◽

2021 ◽

Author(s):

Leo Michelis

Keyword(s):

Monte Carlo ◽

Alternative Model ◽

Regression Models ◽

Population Distribution ◽

Nuisance Parameter ◽

Linear Regression Models ◽

Null Distributions ◽

Finite Samples ◽

The Monte Carlo Method ◽

Asymptotic Expressions

This paper examines the asymptotic null distributions of the J and Cox non-nested tests in the framework of two linear regression models with nearly orthogonal non-nested regressors. The analysis is based on the concept of near population orthogonality (NPO), according to which the non-nested regressors in the two models are nearly uncorrelated in the population distribution from which they are drawn. New distributional results emerge under NPO. The J and Cox tests tend to two different random variables asymptotically, each of which is expressible as a function of a nuisance parameter, c, a N(0,1) variate and a χ2(q) variate, where q is the number of non-nested regressors in the alternative model. The Monte Carlo method is used to show the relevance of the new results in finite samples and to compute alternative critical values for the two tests under NPO by plugging consistent estimates of c into the relevant asymptotic expressions. An empirical example illustrates the ‘plug in’ procedure.

Download Full-text

SHRINKAGE ESTIMATION OF REGRESSION MODELS WITH MULTIPLE STRUCTURAL CHANGES

Econometric Theory ◽

10.1017/s0266466615000237 ◽

2015 ◽

Vol 32 (6) ◽

pp. 1376-1433 ◽

Cited By ~ 16

Author(s):

Junhui Qian ◽

Liangjun Su

Keyword(s):

Regression Models ◽

Structural Changes ◽

Equity Premium ◽

Asymptotic Distributions ◽

Regression Coefficients ◽

Tuning Parameter ◽

Linear Regression Models ◽

Predictive Regression ◽

Finite Samples ◽

Fundamental Information

In this paper, we consider the problem of determining the number of structural changes in multiple linear regression models via group fused Lasso. We show that with probability tending to one, our method can correctly determine the unknown number of breaks, and the estimated break dates are sufficiently close to the true break dates. We obtain estimates of the regression coefficients via post Lasso and establish the asymptotic distributions of the estimates of both break ratios and regression coefficients. We also propose and validate a data-driven method to determine the tuning parameter. Monte Carlo simulations demonstrate that the proposed method works well in finite samples. We illustrate the use of our method with a predictive regression of the equity premium on fundamental information.

Download Full-text

The distributions of the J and Cox non-nested tests in regression models with weakly correlated regressors

10.32920/14640891.v1 ◽

2021 ◽

Author(s):

Leo Michelis

Keyword(s):

Monte Carlo ◽

Alternative Model ◽

Regression Models ◽

Population Distribution ◽

Nuisance Parameter ◽

Linear Regression Models ◽

Null Distributions ◽

Finite Samples ◽

The Monte Carlo Method ◽

Asymptotic Expressions

This paper examines the asymptotic null distributions of the J and Cox non-nested tests in the framework of two linear regression models with nearly orthogonal non-nested regressors. The analysis is based on the concept of near population orthogonality (NPO), according to which the non-nested regressors in the two models are nearly uncorrelated in the population distribution from which they are drawn. New distributional results emerge under NPO. The J and Cox tests tend to two different random variables asymptotically, each of which is expressible as a function of a nuisance parameter, c, a N(0,1) variate and a χ2(q) variate, where q is the number of non-nested regressors in the alternative model. The Monte Carlo method is used to show the relevance of the new results in finite samples and to compute alternative critical values for the two tests under NPO by plugging consistent estimates of c into the relevant asymptotic expressions. An empirical example illustrates the ‘plug in’ procedure.

Download Full-text

FAKTOR DETERMINAN PENYALURAN KREDIT BANK PERSERO

Journal of Business Economics ◽

10.35760/eb.2018.v23i1.1812 ◽

2018 ◽

Vol 23 (1) ◽

pp. 60-71

Author(s):

Wigiyanti Masodah

Keyword(s):

Interest Rate ◽

Linear Regression ◽

Interest Rates ◽

Regression Models ◽

Linear Regression Models ◽

Negative Impacts ◽

Main Activity ◽

The Impact ◽

The Given ◽

Multiple Linear Regression Models

Offering credit is the main activity of a Bank. There are some considerations when a bank offers credit, that includes Interest Rates, Inflation, and NPL. This study aims to find out the impact of Variable Interest Rates, Inflation variables and NPL variables on credit disbursed. The object in this study is state-owned banks. The method of analysis in this study uses multiple linear regression models. The results of the study have shown that Interest Rates and NPL gave some negative impacts on the given credit. Meanwhile, Inflation variable does not have a significant effect on credit given. Keywords: Interest Rate, Inflation, NPL, offered Credit.

Download Full-text

A Note on the Estimation of Linear Regression Models with Heteroskedastic Measurement Errors

SSRN Electronic Journal ◽

10.2139/ssrn.295567 ◽

2002 ◽

Cited By ~ 2

Author(s):

Daniel G. Sullivan

Keyword(s):

Linear Regression ◽

Measurement Errors ◽

Regression Models ◽

Linear Regression Models

Download Full-text

Linear Regression Models for Interval-Valued Data using Log-transformation

Anais do 14. Congresso Brasileiro de Inteligência Computacional ◽

10.21528/cbic2019-3 ◽

2020 ◽

Author(s):

Nykolas Mayko Maia Barbosa ◽

João Paulo Pordeus Gomes ◽

César Lincoln Cavalcante Mattos ◽

Diêgo Farias Oliveira

Keyword(s):

Linear Regression ◽

Regression Models ◽

Linear Regression Models ◽

Log Transformation ◽

Interval Valued

Download Full-text

THE PREDICTIVE CONTENT OF DISAGGREGATED NORMAL INCOME: An Empirical Study in the JSX

Gadjah Mada International Journal of Business ◽

10.22146/gamaijb.5633 ◽

2003 ◽

Vol 5 (3) ◽

pp. 363 ◽

Cited By ~ 1

Author(s):

Slamet Sugiri

Keyword(s):

Linear Regression ◽

Regression Models ◽

Stock Exchange ◽

Manufacturing Firms ◽

Single Step ◽

Model Parameters ◽

Linear Regression Models ◽

Operating Income ◽

Predictive Content ◽

Multiple Step

The main objective of this study is to examine a hypothesis that the predictive content of normal income disaggregated into operating income and nonoperating income outperforms that of aggregated normal income in predicting future cash flow. To test the hypothesis, linear regression models are developed. The model parameters are estimated based on fifty-five manufacturing firms listed in the Jakarta Stock Exchange (JSX) up to the end of 1997.This study finds that empirical evidence supports the hypothesis. This evidence supports arguments that, in reporting income from continuing operations, multiple-step approach is preferred to single-step one.

Download Full-text

With Great Dispersion Comes Greater Resilience: Efficient Poisoning Attacks and Defenses for Linear Regression Models

IEEE Transactions on Information Forensics and Security ◽

10.1109/tifs.2021.3087332 ◽

2021 ◽

pp. 1-1

Author(s):

Jialin Wen ◽

Benjamin Zi Hao Zhao ◽

Minhui Xue ◽

Alina Oprea ◽

Haifeng Qian

Keyword(s):

Linear Regression ◽

Regression Models ◽

Linear Regression Models ◽

Attacks And Defenses

Download Full-text

Relationships between test positivity rate, total laboratory confirmed cases of malaria, and malaria incidence in high burden settings of Uganda: an ecological analysis

Malaria Journal ◽

10.1186/s12936-021-03584-7 ◽

2021 ◽

Vol 20 (1) ◽

Author(s):

Jaffer Okiring ◽

Adrienne Epstein ◽

Jane F. Namuganga ◽

Victor Kamya ◽

Asadu Sserwanga ◽

...

Keyword(s):

Regression Models ◽

Malaria Incidence ◽

Temporal Changes ◽

Linear Regression Models ◽

High Burden ◽

Exponential Regression ◽

Malaria Morbidity ◽

Laboratory Test Results ◽

Combining Data ◽

Catchment Areas

Abstract Background Malaria surveillance is critical for monitoring changes in malaria morbidity over time. National Malaria Control Programmes often rely on surrogate measures of malaria incidence, including the test positivity rate (TPR) and total laboratory confirmed cases of malaria (TCM), to monitor trends in malaria morbidity. However, there are limited data on the accuracy of TPR and TCM for predicting temporal changes in malaria incidence, especially in high burden settings. Methods This study leveraged data from 5 malaria reference centres (MRCs) located in high burden settings over a 15-month period from November 2018 through January 2020 as part of an enhanced health facility-based surveillance system established in Uganda. Individual level data were collected from all outpatients including demographics, laboratory test results, and village of residence. Estimates of malaria incidence were derived from catchment areas around the MRCs. Temporal relationships between monthly aggregate measures of TPR and TCM relative to estimates of malaria incidence were examined using linear and exponential regression models. Results A total of 149,739 outpatient visits to the 5 MRCs were recorded. Overall, malaria was suspected in 73.4% of visits, 99.1% of patients with suspected malaria received a diagnostic test, and 69.7% of those tested for malaria were positive. Temporal correlations between monthly measures of TPR and malaria incidence using linear and exponential regression models were relatively poor, with small changes in TPR frequently associated with large changes in malaria incidence. Linear regression models of temporal changes in TCM provided the most parsimonious and accurate predictor of changes in malaria incidence, with adjusted R2 values ranging from 0.81 to 0.98 across the 5 MRCs. However, the slope of the regression lines indicating the change in malaria incidence per unit change in TCM varied from 0.57 to 2.13 across the 5 MRCs, and when combining data across all 5 sites, the R2 value reduced to 0.38. Conclusions In high malaria burden areas of Uganda, site-specific temporal changes in TCM had a strong linear relationship with malaria incidence and were a more useful metric than TPR. However, caution should be taken when comparing changes in TCM across sites.

Download Full-text