# Hypothesis Tests and Model Selection

# General Linear Hypothesis

The linear regression

y=Xβ+ε

has restrictions

r11β1+r12β2++r1KβK=q1r21β1+r22β2++r2KβK=q2rJ1β1+rJ2β2++rJKβK=qJ

or in the matrix form

Rβ=q

We might have the hypothesis that

H0:Rβq=0H1:Rβq0

There are two types of errors.

  1. Definition (size of a test). Type I error: The null hypothesis is correct, but we rejct it.
  2. Definition (power of a test). Type II error: The null hypothesis is incorrect, but we don't rejct it.

# Two Approaches to Testing Hypotheses

Wald tests. If the hypothesis is correct, the sample discrepancy, 𝑅𝑏𝑞 should be close to zero.

Fit based tests. A measure of how much 𝑅2 falls when we impose the restrictions.

# Wald Tests

# The $ t$ Test

Statistic: t distribution with nK degree of freedom,

tk=bkβk0s2Skk

1α confidence interval

Prob{t(1α2),[nK]<tk<+t(1α2),[nK]}.

# The $ F$ Statistic

F[J,nKX]=(Rbq){R[s2(XX)1]R}1(Rbq)J

For testing a single restriction, the t statistic is the square root of the F statistic

t2=(q^q)2Var(q^qX)=F[1,nK]

# Fit Tests

# The Restricted Least Square Estimator

𝑏 and 𝑏 are the unrestricted and restricted estimators, then

b=b+(XX)1R[R(XX)1R]1[qRb].

Remark 1. 如果 b 满足约束条件,那么b=b;如果不满足约束条件,则全部参数都会发生变化。

Remark 2. 除非b=b, 不然我们总有

SSE(restricted)>SSE(unrestricted)

Remark 3

Var[b|X]=Var[b|X]a nonnegatvie definite matrix 

# The Loss of Fit

For a coefficient restriction

tz2=(RXz2RX2)/1(1RXz2)/(nK)ryz2=tz2tz2+(nK)

And for general restrictions

F[J,nK]=(R2R2)/J(1R2)/(nK)

# Testing the Significance of the Regression

Setting 𝑅2=0 in the general statistic

F[J,nK]=(R2R2)/J(1R2)/(nK),

we have

F[J,nK]=R2/(K1)(1R2)/(nK).