10

I have a whole column of numbers that include comma separators at the thousands. When I try to create a numeric column out of them, anything over 999 becomes NA.

I used cbind:

df <- cbind(df, var2 = as.numeric(as.character(df$var1)))

and wound up with:

        var1  var2
1   2,518.50    NA
2   2,518.50    NA
3   5,018.50    NA
4   4,018.50    NA
5  10,018.50    NA
6     318.50 318.5
7   2,518.50    NA
8   3,518.50    NA
9   7,518.50    NA
10  1,018.50    NA

Is there a way to strip the commas or tell as.numeric how to handle them?

Amanda
  • 10,799
  • 17
  • 59
  • 87

2 Answers2

15

If you are trying to add a new column var2 to df, you can use the following

  df$var2 <- as.numeric(gsub(",", "", as.character(df$var1)))
Ricardo Saporta
  • 52,793
  • 14
  • 136
  • 168
4

Use as.numeric(gsub(",", "", df$var1)).

You want to use gsub as sub will only replace the first comma.

Erik Shilts
  • 4,231
  • 2
  • 24
  • 47