Syllabus.
Statistics 201b: Regression Analysis.
Prof. Rick Paik Schoenberg.
Lectures: MW 3:00-4:20pm, MS 5203.
Office hours: Mondays, 1:00-2:20pm, MS 8965. Or by email appointment.
email: frederic@stat.ucla.edu
Course webpage:
http://www.stat.ucla.edu/~frederic/201b/W07
Text: "Data Analysis and Graphics Using R" by John Maindonald and John Braun.
Description: Applied regression analysis, with emphasis on general
linear model (e.g., multiple regression) and generalized linear model
(e.g., logistic regression). Special attention to modern extensions
of regression, including regression diagnostics,
graphical procedures, and bootstrapping for statistical inference.
Grading:
Class participation (5%).
Homework (20%).
Written Project (25%).
Midterm. (20%).
Final exam. (30%).
The midterm is Monday, Feb 26, 2007, 3:00-4:20pm (in class).
The final exam is Wednesday, March 21, 2007, 3:00pm-6:00pm.
Project due dates will be announced in class.
Homework will be due each Wednesday, beginning with HW1 which is due
Wednesday Jan 17.
Homeworks must be handed in at the beginning of class, or may be slipped under
my office door (MS 8965) any time before class. Each homework assignment
is graded out of 10 points.
Homeworks handed in between 3:05 and 3:10 will be given a one-point deduction.
Those handed in between 3:10 and 3:20 will be given a two-point deduction.
Homeworks handed in between 3:20 and 4:20 will be given a three-point deduction.
Homeworks submitted after 4:20pm will not be accepted. Homeworks and projects
must be submitted in hard copy, rather than by email or fax.
Rough Outline:
Week 1: Introductory material. Chapters 1-2.
Week 2: Univariate modeling and inference. Chapters 3-4.
Week 3: Regression with one predictor. Chapter 5.
Week 4: Multiple linear regression. Chapter 6.
Week 5: Linear model. Chapter 7.
Week 6: Logistic regression and generalized linear models. Chapter 8.
Week 7: Review and midterm.
Week 8: Multi-level models, time series, and repeated measures. Chapter 9.
Week 9: Tree-based classification and regression. Chapter 10.
Week 10: Review.
Homework 1 (due Wednesday Jan 17):
1st edition: p26 #2, p27 #4, p50 #1, p51 #2, p51 #3, p51 #5.
Homework 2 (due Wednesday Jan 24):
1st edition: p103 #2, p103 #3, p104 #6, p105 #11, p132 #1, p132 #2
Homework 3 (due Wednesday Feb 7):
2nd edition: p243 #2, p244 #7, p244 #8, p245 #9,
p245 #14 (use kernel regression).
In the 1st edition, these are: the problem written below, and p195 #6, p195 #7,
p195 #8, p195 #12 (use kernel regression).
p243 #2 from the 2nd edition is the following:
Use anova() to compare the two models:
roller.lm = lm(depression~weight, data=roller)
roller.lm2 = lm(depression~weight+I(weight^2), data=roller)
Is there any justification for including the squared term?
Homework 4 (due Wednesday Feb 14):
2nd edition: p284 #1, #3.
On #1:
(a) Print output (coefficients, SEs, etc.)
(b) Interpret coefficients in 1-2 sentences.
(c) Plot fitted curve: logit(p) vs conc.
1st edition: p223 #1, #3(a), #3(c). See instructions above for #1.