Analyze the dataset and answer the following questions to help understand the effects of different weather variables on the demand of rental bikes in the city. These are the variables that the file contains:
Variable: Type of data: Units:
Date Date
Rented Bike Count Integer
Hour Integer
Temperature Continuous C
Humidity Integer %
Wind speed Continuous m/s
Visibility Integer 10m
Dew point temperature Continuous C
Solar Radiation Continuous Mj/m2
Rainfall Integer mm
Snowfall Integer cm
Seasons Categorical
Holiday Binary
Functioning Day Binary
Questions:
- Descriptive Statistics and Distributions:
§ Describe the following variables: Temperature, Humidity, Wind speed. [10%]
§ Represent these variables using at least two types of charts and discuss their distributions/frequencies. [15%]
- Linear Regression:
§ Perform a linear regression between Rented Bike Count and another quantitative variable of your choice. [10%]
§ Discuss the significance and the strength of the relationship between them. Interpret the results. [10%]
§ Represent it using a chart. [5%]
- Multiple Regression:
§ Perform a multiple regression analysis to identify the relationships between Rented Bike Count and all the other quantitative variables of the dataset. Discuss the results at a level of significance of α=5%. [15%]
§ What are the coefficients for each variable? Interpret the results. [10%]
- Predictive Modeling:
§ Create a linear regression equation to predict Rented Bike Count. [15%]
§ Using the equation, predict the number of Rented Bike Count on the 2/12/2017 at 17:00. [5%]