# Endogeneity and IV Estimation
Endogeneity happens when exogeneity is violated, i.e.,
The following are some sources of endogeneity.
- Omitted variable bias
- Simultaneous/Reverse causality
- Measurement error
- Sample selection bias
# Assumption
# Explanatory variables
When exogeneity is violated, we assume
(
then the estimator
and is inconsistent,
# Instrumental variables
The instrumental variable should satisfy:
- Exogeneity. They are uncorrelated with the disturbance
. - Relevance. They are correlated with the independent variable
.
We can also describe it as:
, a finite, positive definite matrix (well-behaved data). , a finite matrix with rank (relevance). (exogeneity).
# IV Estimation
# Situation when L=K
We partition
The asymptotic distribution is
The asymptotic covariance matrix is estimated as
where
In general, we have
while
- Assume
, which is the standard OLS assumption. - Find instrumental variables
, use to estimate .
# Situation when L>K
Estimator:
where
In practice,
# Relevant test
We want to test whether the regressors are correlated with the disturbances.
# Hausman Test
The statistic is
where
# Hausman Statistic
The covariance between an efficient estimator
- which means
- Thus
We have a pair of estimators
# Problems
# Weak Instrument
The asymptotic covariance matrix of the IV estimator,
will be “large” when the instruments are weak.
# Measurement Error
# Single Regressor Model
A regression model with a single regressor and no constant term,
while
While the measurement error on
where
# Multiple Regression Model
In a multiple regression model,
When only a single variable is measured with error (assume the first variable), we have
and for
where