# Discrete Choices

# Models for Binary Outcomes

# Random Utility Models

Let 𝑈𝑎 and 𝑈𝑏 represents an individual’s utility of two choices:

𝑈𝑎=𝑊β𝑎+𝑍𝑎γ𝑎+ε𝑎,𝑈𝑏=𝑊β𝑏+𝑍𝑏γ𝑏+ε𝑏.

If we denote by Y=1 the consumer’s choice of alternative a , we infer from Y=1 that 𝑈𝑎>𝑈𝑏.

Prob[𝑌=1|𝑊,𝑍𝑎,𝑍𝑏]=Prob[𝑈𝑎>𝑈𝑏]=Prob[𝑋β+ε>0|𝑋].W(βaβb)+ZaγaZbγb+εaεb>0.

# A Latent Regression Model

We model the net benefit of a choice as an variable y such that

y=𝑋β+ε,

where ε has mean zero and has either a standardized logistic or normal distribution.

We do not observe y, instead, our observation is

y=1 if y>0,y=0 if y0.

Then we have

Prob[y>0|𝑋]=P𝑟𝑜𝑏[ε<𝑋β|𝑋]=𝐹(𝑋β).
  • Note that the assumptions of known variance and zero cutoff are innocent normalization.

# Why not Linear Probability Model?

Linear Probability Model:

y=𝑋β+ε.

Shortcomings:

  • We cannot constrain 𝑋β to the 0-1 interval.
  • Heterogeneity: 𝑉𝑎𝑟[ε𝑖|𝑋]=𝑥𝑖β(1𝑥𝑖β).

# Models

We want to construct a model produce predictions consistent with the underlying theory

Prob[y>0|𝑋]=P𝑟𝑜𝑏[ε<𝑋β|𝑋]=𝐹(𝑋β),

and we expect that

lim𝑋β+Prob[𝑌=1|𝑋]=1,limXβProb[𝑌=1|𝑋]=0.

The normal distribution has been commonly used, denoted as the probit model,

Prob(𝑌=1|𝑋)=Xβϕ(t)dt=Φ(Xβ).

Another commonly used model is the logit model, assuming logistic distribution,

Prob(𝑌=1|𝑋)=exp(𝑋β)1+exp(𝑋β)=Λ(Xβ).

# Marginal Effect

The probability model is

𝐸[𝑦|𝑋]=𝐹(𝑋β).

The parameters of the model are not necessarily the marginal effects

𝐸[𝑦|𝑋]𝑋=[𝑑𝐹(𝑋β)𝑑(𝑋β)]×β=𝑓(𝑋β)×β.

# Estimation and Hypothesis Test

# Estimation

Likelihood equations:

ln𝐿β=𝑖=1𝑛[𝑦𝑖𝑓𝑖𝐹𝑖+(1𝑦𝑖)𝑓𝑖(1𝐹𝑖)]𝑥𝑖=0.

Logit model

ln𝐿β=𝑖=1𝑛(yiΛ𝑖)𝑥𝑖=0.

Probit model

ln𝐿β=𝑖=1𝑛(q𝑖ϕ(𝑞𝑖𝑥𝑖β)Φ(𝑞𝑖𝑥𝑖β))𝑥𝑖=0,

where 𝑞𝑖=2𝑦𝑖1.

Goodness of Fit

Likelihood Ratio Index:

𝐿𝑅𝐼=1ln𝐿ln𝐿0

# Hypothesis Test

For a single estimator, use the t test.

For more involved restrictions

  • Wald test (unrestricted).
  • Likelihood ratio test (restricted and unrestricted).
  • Lagrange multiplier (restricted).