1

I divided my dataset into Test and Validation (50-50 split).

I ran glm function (link=binomial) on Test dataset and got the parameter estimates.

How do I score the Validation dataset based on these parameter estimates (beta) that I got from Test dataset. I know it has something to do with apply () but I am not sure. please advise.

bison2178
  • 487

1 Answers1

1

Split sample validation is highly unstable if $n < 20000$ and is not recommended. If you do have an enormous test sample you can use the R rms package val.prob to do the validation and its help file will give you some pointers. But better to look at the bootstrap and the rms validate and calibrate functions.

Frank Harrell
  • 91,879
  • 6
  • 178
  • 397