Assigning unique variable from a data.frame

Question

This is a similiar question to this but my output results are different.

Take the data:

example <- data.frame(var1 = c(2,3,3,2,4,5), 
                  var2 = c(2,3,5,4,2,5), 
                  var3 = c(3,3,4,3,4,5))

Now I want to create example$Identity which take a value from 1:x for each unique var1 value

I have used

example$Identity <- apply(example[,1], 2, function(x)(unique(x)))

But I am not familiar with correct formatting function()

The output of example$Identity should be 1,2,2,1,3,4

Do you need `1:n` groups based on `var1` only? Does this work for you: `as.numeric(as.factor(example$var1))`? — zx8754, May 19 '15 at 11:04
Yes, you answered just before the proposed answer, do you want to write and i will select and close the question — lukeg, May 19 '15 at 11:07

Jaap · Accepted Answer · 2015-05-19T11:11:07.610

2

This:

example$Identity <- as.numeric(as.factor(example$var1))

will give you the desired result:

> example$Identity
[1] 1 2 2 1 3 4

By wrapping the as.factor in as.numeric it starts counting the factor levels with 1 and so on.

edited May 19 '15 at 11:11

answered May 19 '15 at 11:04

Jaap

score 2 · Answer 2 · answered May 19 '15 at 13:48

2

Or you can use match

example$Identity <- with(example, match(var1, unique(var1)))

If the values are sorted as in the vector, findInterval can be also used

findInterval(example$var1, unique(example$var1))
#[1] 1 2 2 1 3 4

answered May 19 '15 at 13:48

akrun

nice to see the flexibility and diversity of the R language +1 – Jaap May 19 '15 at 13:53
@Jaap Thanks, my first option would be also `factor(` – akrun May 19 '15 at 13:54

2 Answers2