Data validation

An artificial data set consistingof 1,000 0bservations and one target variable Y together with other 10 variables namely X1 to X10 was partitioned into two data sets first. That is training set (50%) and temp set (50%). The temp set was then further split into two namely validation ste (30%) and test set (20%). Now we got our three segments namely training set (trset), validation set (vset) and test set (ttset). Develop a multiple regression model based on training data set then test the model using validation data to test how good or bad the model is.

Sample Solution