Work with predictor values not present in train dataset

Question

I have a scenario where some of the values for predictors are not present in the train dataset, but are present in the test dataset. How do we handle this in regression?

Here is what I have tried:

f1: Test dataset s1: Train dataset

f1[,c(328:1364)] <- sapply(f1[,c(328:1364)],as.character) 

lapply(c(328:1364),function(x) {
y <- x+14
f1[,x][which(!(f1[,x] %in% s1[,y]))] <- ""
})

While this executes successfully, the values in Test are still retained. Please advise.

Thanks, Sam

Any idea why this is not working? – Sam Mar 23 '15 at 05:28 — Sam, Mar 23 '15 at 05:28

score 0 · Answer 1 · answered Mar 24 '15 at 04:07

0

I used for loop and it worked.

for(x in 328:1364)
{
y <- x+14
f1[,x][which(!(f1[,x] %in% s1[,y]))] <- ""
}

answered Mar 24 '15 at 04:07

Sam

1

Work with predictor values not present in train dataset

f1: Test dataset s1: Train dataset

1 Answers1