Number of applications received using the variables in the College data set.

1. Predict the number of applications received using the other variables in the College data set. a. Split the data set into a training set and a test set. b. Fit a linear model using least squares on the training set, and report the test error obtained. c. Fit a ridge regression model on the training set, with A chosen by cross-validation. d.Report the test error obtained. 2. Consider the Boston housing data set, from the MASS library. a. Based on this data set, provide an estimate for the population mean of medv. Call this estimate ft . b. Provide an estimate of the standard error of 1.1 . Interpret this result. Hint: We can compute the standard C7707* of the sample mean by dividing the sample standard deviation by the square root of the number of observations. c. Now estimate the standard error of 12 using the bootstrap. How does this compare to your answer from (b)? d. Based on your bootstrap estimate from (c), provide a 95% confidence interval for the mean of medv. Compare it to the results obtained using ttest(BostonSmedv). Hint: You can approximate a 95% confidence interval using the formula D.2— 25E0.0 + 25E0.01                                                                                                                                                                                          

Sample Solution