Multivariate Ordinary Least Squares Regression

Y_{i} = β_{0} + β_{1} X_{1 i} + β_{2} X_{2 i} + β_{3} X_{3 i} + ϵ_{i}

Omitted Variables in the regression. Refer to the document in that case.

Variance of Parameter Estimators

In this multivariate model the variance of $\hat{β_{1}}$ is:

Var (\hat{β_{1}}) = \frac{σ ^ ^{2}}{N \cdot Var ( X _{1} ) \cdot ( 1 - R _{1}^{2} )}

where

$\overset{σ}{^}^{2}$ is the variance of the regression
$R_{1}$ is the coefficient of determination in the auxiliary regression $X_{1 i} = γ_{0} + γ_{1} X_{2 i} + γ_{2} X_{3 i} + ϵ_{i}$ . It measures multicolinearity
- & It measures how much of $X_{1}$ can be explained by $X_{2}, X_{3}$
- In perfect multicolinearity $R_{1} = 1$ then $X_{2}, X_{3}$ will perfectly determine $X_{1}$ , thus $X_{1}$ is not relevant anymore.
- & Equivalently $R_{2}$ is the coefficient of determination from $X_{2 i} = γ_{0} + γ_{1} X_{1 i} + γ_{2} X_{3 i} + ε_{i}$ , etc. etc.
$\frac{1}{1 - R _{1}^{2}}$ is known as the variance inflation factor. The higher the multicollinearity, the more inflated is the variance of parameter estimators.
Multicollinearity is not a problem iff:
- it occurs only between control variables
- $Var (\hat{β_{1}})$ is small enough; i.e. $\hat{β_{1}}$ is statistically significant.
$N$ is the sample size

Coefficient of Determination

In multivariate situations (as opposed to bivariate) the coefficient of determination of the whole regression ( $R^{2}$ ) is more troublesome because

Adding more variables will only increase $R^{2}$
Thus one is incentivized to increase the number of (possibly irrelevant) independent variables.
To mitigate this, we report the adjusted $R^{2}$ values instead, with a penalty for each additional independent variables used.

Hypothesis Testing regarding Coefficients

Standardizing Coeffiefficients

Example. In the model $GDP = β_{0} + β_{1} Life Expectancy + β_{2} Literacy Rate$ , if we want to compare the effects of life expectancy and literacy, we cannot simply compare the values of $β_{1}, β_{2}$ . This is because their units are different, i.e. $β_{1}$ is in $\frac{Dollars}{Year}$ and literacy is in $\frac{Dollars}{Percentage Point}$ . Thus we need to standardize them:

\hat{β_{i}^{std}} = \frac{β _{i} ^ - E ( β _{i} ^ )}{Var ( β _{i} ^ )}

which shows “how much $Y$ increase in units of $σ_{Y}$ does one unit $σ_{X_{i}}$ increase in $X$ cause?”

Remark. Only $X_{1}$ and $X_{2}$ need be standardized to compare $\hat{β_{1}}$ and $\hat{β_{2}}$ ’s effects. $Y$ need not be standardized.

Hypothesis Testing about Coefficients

Let the model $Y_{i} = β_{0} + β_{1} X_{1, i} + β_{2} X_{2, i} + β_{3} X_{3, i} + ϵ_{i}$ . Sometimes we may want to check if $\hat{β_{1}} =^{?} \hat{β_{2}}$ , or $\hat{β_{1}} = \hat{β_{2}} =^{?} 0$ . In these cases we use a hypothesis test. Let $R_{unrestricted}^{2}$ be the $R^{2}$ of this regression. Now, before we do anything we need to…

$X_{1},X_{2}$ .

Case 1: $H_{0} : \hat{β_{1}} = \hat{β_{2}} = 0$ . Then the model under null would change to:

Y_{i} = β_{0} + β_{3} X_{3, i} + ϵ_{i}

We run regression on this new model and get $R_{restricted}^{2}$ Remark. This is not equivalent to running a t-test on $H_{A} : \hat{β_{1}} \neq = 0 \hat{\land} β_{2} \neq = 0$ because $X_{1}, X_{2}$ may be multicollinear.

Case 2: $H_{0} : \hat{β_{1}} = \hat{β_{2}}$ . Then the model under null would change to:

Y_{i} = β_{0} + β_{1} (X_{1, i} + X_{2, i}) + β_{3} X_{3, i}

We run regression on this to also get $R_{restricted}^{2}$ .

We can observe that in both cases, $R_{unrestricted}^{2} > R_{restricted}^{2}$ , always, because “restricting” the model will lead only to less (coefficient of-) determination. Now, the bigger this difference is, the more likely that the null is false. We formalize this using the $F$ -test:

def. F-Test. For the F-statistic defined as:

F_{q, N - k} : = \frac{( R _{unres.}^{2} - R _{restr.}^{2} ) / q}{( 1 - R _{unres}^{2} ) / ( N - k )}

where

$q$ is “how many equal signs in null hypothesis”
$k$ is the degrees of freedom (=number of coefficients in the _un_restricted model) Then:

{H_{0} H_{1} else if F > K

where $K$ is the critical value. The critical values are:

$K = 3.00$ in case 1 ( $H_{0} : \hat{β_{1}} = \hat{β_{2}} = 0$ )
$K = 3.84$ in case 2 ( $H_{0} : \hat{β_{1}} \neq = \hat{β_{2}}$ )

PK's Notes

Explorer

Multivariate Ordinary Least Squares Regression

Variance of Parameter Estimators

Coefficient of Determination

Hypothesis Testing regarding Coefficients

Standardizing Coeffiefficients

Hypothesis Testing about Coefficients

Graph View

Table of Contents

Backlinks

PK's Notes

Explorer

Multivariate Ordinary Least Squares Regression

Variance of Parameter Estimators §

Coefficient of Determination §

Hypothesis Testing regarding Coefficients §

Standardizing Coeffiefficients §

Hypothesis Testing about Coefficients §

Graph View

Table of Contents

Backlinks

Variance of Parameter Estimators

Coefficient of Determination

Hypothesis Testing regarding Coefficients

Standardizing Coeffiefficients

Hypothesis Testing about Coefficients