STats 110A  Homework 7
Due Friday, May 28

1.  Let X = the number of tosses of a coin until a Head first appears.  For example, you toss a coin and get (in this order) TTH, then X=3.
a) Find the pdf of X if the probability of a head on any given toss is p and assume p < 1.  (Hint, P(X=0)=0, P(X=1) = p).  This is called the geometric distribution.
b) What's the largest possible value of X?
c) What's the expected number of tosses before a H appears?  (Hint: Look up geometric sums in your calculus book.  If it only gives you the formula for finite sums (up to n), then take the limit as n goes to infinity.  (What is the limit of p^n as n goes to infinity if 0<p <1?)

2. For sufficiently large values of n, the binomial pdf can be approximated by a normal distribution with mean np and variance np(1-p).  For p = .5, an n of 100 is more than enough to make this a valid approximation.  Psally the Psychic claims to be able to guess whether a coin comes up heads or tails.  She does not claim perfect accuracy, but still claims to have a "sixth sense" that helps her do this.  If she is a fraud, then she just guesses like all of us do.
a) What's the approximate probability that Psally will get 60 or more coin flips correct in 100 tosses if she is just guessing?
b) Suppose its National Psychic Day and 10,000 people all participate in this experiment in which they guess coin flips in 100 flips of a fair coin.  About how many people do we expect to get 60 or more correct?

NOTE: THERE WERE CORRECTIONS MADE TO NUMBER 3 ON WED, JUNE 26.
3. In the last lab, you learned that the command (sample weight 6 t) would produce a random sample,WITH replacement, of size 6 from the weight variable in class. (Assuming you had downloaded this data.)  (If you omit the "t", then it samples without replacement.)
a) What's the average and SD of the weight data?
b) Assume for the moment that the weights follow a normal distribution? Imagine selecting a person at random. What would be the probability of being 12 pounds heavier OR MORE than the mean?  What is the actual probability for our data?
c) Suppose you select 5 people at random (with replacement).  What's the sampling distribution of their average WEIGHT?
d) What's the expected value for the average WEIGHT of these 5 people?
e) What's the Standard Error (SE) for the average?
f) What's the approximate probability that the average will be 12 pounds heavier OR MORE than the mean?
g) Select 5 people with replacement and calculate the average weight.  Repeat this 10 times.  What's the closest you get to the mean? What's the farthest?  What's the average of your averages?  what's the standard deviation of your averages?  How do these compare to the theoretical values you calculated above?

4. Select 10 people with replacement from the classdata and find their height.  Assuming heights follow a normal distribution with a standard deviation of 3 inches,  what's a 95% confidence interval for the mean height in the class?  Does it cover the true mean?  In this problem, we're treating the class as the population, and because we sample with replacement its as if this population were infinitely large with the probability distribution given exactly by the distribution of values in this class.  So the distribution of the population is not normal.  Explain why this might not matter.

5. Same question as 4, but now we make the more realistic assumption that the standard deviation is unknown.
 

6. Download the ARC data set rat.lsp from the data page.  The data represent a sample of rats, which we can assume were randomly sampled from a particular species of rat.
a) What's your estimate of the mean weight of  this species of rat?
b) What's your estimate of the SD of weights for this species of rat?
c) Calculate a 90% confidence interval for the mean weight.
d) Do the data look normal?  If not, how does this affect your confidence interval?

7. Download the blood pressure data from the data page.  These data represent systolic blood pressure of 15 patients before and after receiving medication designed to lower blood pressure.  Calculate the difference (after minus before) for all 15 patients.  (You do this by choosing "Add a variate" under the "Dataset" menu, and then typing "diff = after - before".  You don't have to call it "diff", of course.)
Is there evidence that the drug lowers blood pressure?  Or could these changes be explained just by chance variation?  State your assumptions, give evidence to support them (and if you can't, then state the possible effect on your conclusion should the assumptions not hold), and explain why you reached the conclusion you did.