This course is concerned with theory and application of linear regression methods, including an examination of the classical regression model and the statistical properties of the estimator. The effect of violations of the classical assumptions are considered, and appropriate estimation methods are introduced. This course is the first of a two-course sequence. At course completion, a successful student will:
How do we handle random processes? We can define a random variable as a measurable function defined on a probability space. Bievens(2004)
See set theory. Kolmorgorov gives us the axioms of probability.
For example, in a coin toss, the probability space is heads and tails. If x is a function over the probability space, we can say that x(heads) equals one and x(tails) equals zero. So the probability that x is one equals the probability of heads, and the probability that x is zero equals the probability of tails, and they are disjoint probabilities. So if the probability of heads is Ph, the probability of tails is 1-Ph. (Note that this example does not assume that you have a fair coin, though you could.)
positive semidefinite (PSD), positive definite (PD), quadratic forms, symettric, idempotent, diagonal, block diagonal,
Matix derivatives
The Probability Distribution Function is where
By having a set of ys, assumed to be dependent on a set of xs, we solve for some constant . (Where k is the number of observations, and n is the number of predictor or independent variables, y is a kx1 vector and x is a kxn matrix multiplied by an nx1 vector which relates y and x.)
To solve for , which will give us an expected value of that we will call beta hat we use this linear operation:
This can be derived from OLS, ordinary least squares, which minimizes the squared residuals. Since , then . The residuals are then the estimated errors , found from the estimated model: The squared residuals are then . By deriving this expression with respect to and solving for gives the OLS estimator.
Then or x times beta hat equals y hat, where y hat is our expected value for y given x and our calculated value of beta, beta hat.
Now the difference between the predicted values of y and the actual values of y are given by: where is residual values, sometimes called error.
Matrix algebra,
Properties of Ordinary Least Squares (OLS estimator beta hat is unbiased for beta, and the Covariance Matrix for beta hat is ), Classical Normal, The Information Matrix, Chi Square Distribution, The relationship between e and epsilon, The Maximum Likelihood Estimator of the variance is biased, Distribution of the variance under normality,
regression,
principles of estimation and testing,
stationary time series models,
limited dependent variable models,
longitudinal (panel) data models,
generalized methods of moments,
instrumental variable models,
non-stationarity, stochastic trends,
co-integration,
Omission of relevant explanatory variables
Inclusion of irrelevant explanatory variables
Squared explanatory variables
Cubed explanatory variables
Rarely is perfect multicollinearity a problem, but multicollinearity, nearly linear relationships between explanatory variables, often is. Multicollinearity causes an increase in the variance of ß estimations, which in turn decreases the probability of rejecting the null hypotheses that such ßs are insignificant.
How to find out if two or more variables are correlated:
- Correlation matrix
- Variance inflation factor (VIF)
Recommended texts