Community

Why do I need to split my data set?

E-mail Print PDF

The data set of molecules and associated property values that have been experimentally determined are split into three sub-sets; a training set used to train individual models using each modeling technique, a validation set used to compare the performance of each model in order to select the best and an independent test set used to confirm the predictive power of the final model. This provides for the development of a robust model.

Comments (0)
Only registered users can write comments!
 

Latest Forums

Read more >

Popular Downloads

Read more >