# Hypothesis Tests and Model Selection

# General Linear Hypothesis

The linear regression

y = X β + ε

has restrictions

\begin{matrix} r_{11} β_{1} + r_{12} β_{2} + \dots + r_{1 K} β_{K} = q_{1} \\ r_{21} β_{1} + r_{22} β_{2} + \dots + r_{2 K} β_{K} = q_{2} \\ \dots \\ r_{J 1} β_{1} + r_{J 2} β_{2} + \dots + r_{J K} β_{K} = q_{J} \end{matrix}

or in the matrix form

R β = q

We might have the hypothesis that

\begin{array}{l} H_{0} : R β - q = 0 \\ H_{1} : R β - q \neq 0 \end{array}

There are two types of errors.

Definition (size of a test). Type I error: The null hypothesis is correct, but we rejct it.
Definition (power of a test). Type II error: The null hypothesis is incorrect, but we don't rejct it.

Wald tests. If the hypothesis is correct, the sample discrepancy, $𝑅 𝑏 - 𝑞$ should be close to zero.

Fit based tests. A measure of how much $𝑅^{2}$ falls when we impose the restrictions.

Statistic: $t$ distribution with $n - K$ degree of freedom,

t_{k} = \frac{b_{k} - β_{k}^{0}}{\sqrt{s^{2} S^{k k}}}

$1 - α$ confidence interval

Prob {- t_{(1 - \frac{α}{2}), [n - K]}^{*} < t_{k} < + t_{(1 - \frac{α}{2}), [n - K]}^{*}} .

F [J, n - K ∣ X] = \frac{(R b - q)^{'} {R [s^{2} {(X^{'} X)}^{- 1}] R^{'}}^{- 1} (R b - q)}{J}

For testing a single restriction, the $t$ statistic is the square root of the $F$ statistic

t^{2} = \frac{(\hat{q} - q)^{2}}{Var (\hat{q} - q ∣ X)} = F [1, n - K]

$𝑏$ and $𝑏_{*}$ are the unrestricted and restricted estimators, then

b_{*} = b + (X^{'} X)^{- 1} R^{'} [R (X^{'} X)^{- 1} R^{'}]^{- 1} [q - R b] .

Remark 1. 如果 $b$ 满足约束条件，那么 $b_{*} = b$ ;如果不满足约束条件，则全部参数都会发生变化。

Remark 2. 除非 $b_{*} = b$ , 不然我们总有

SSE(restricted) > SSE(unrestricted)

Remark 3

\begin{aligned} Var [b_{*} | X] = & Var [b | X] \\ - a nonnegatvie definite matrix \end{aligned}

For a coefficient restriction

t_{z}^{2} = \frac{(R_{X z}^{2} - R_{X}^{2}) / 1}{(1 - R_{X z}^{2}) / (n - K)}

r_{y z}^{* 2} = \frac{t_{z}^{2}}{t_{z}^{2} + (n - K)}

And for general restrictions

F [J, n - K] = \frac{(R^{2} - R_{*}^{2}) / J}{(1 - R^{2}) / (n - K)}

Setting $𝑅_{*}^{2} = 0$ in the general statistic

F [J, n - K] = \frac{(R^{2} - R_{*}^{2}) / J}{(1 - R^{2}) / (n - K)},

we have

F [J, n - K] = \frac{R^{2} / (K - 1)}{(1 - R^{2}) / (n - K)} .