andrew has a point which makes my solution wrong. you'd have to change the factors to numerics and I'm not sure what would happen when you did that. if you want to send a sample file of your data, that would be best but andrew's suggestion may
work right off the bat.


On Sun, Dec 14, 2008 at  9:19 PM, andrew wrote:

if the records are in the file dupIDs.txt, then when you read them in,
the IDs become factors.  Coercing them to numeric gets them to assign
a unique number to each factor.

So, you could try the following:

dupIDs <- read.table("dupIDs.txt", header = T)
dupIDs$ID2 <- cummax(as.numeric(dupIDs$ID)-1)

dupIDs
  ID record ID2
1  1     20   1
2  .     30   1
3  .     25   1
4  2     26   2
5  .     15   2
6  3     21   3

HTH,

Andrew.


On Dec 15, 12:56 pm, "Zhixin Liu" <z...@efs.mq.edu.au> wrote:
Hi R helpers,

If I have a dataset looks like:
ID   record
1        20
.         30
.         25
2         26
.         15
3         21
4.....................

And I want it becomes
ID   record
1        20
1        30
1        25
2         26
2        15
3         21
4.....................

That is, I have to duplicate IDs for those with multiple records. I am wondering it is possible to be done in R, and I am grateful if you would like to show me the direction.

Many thanks!

Zhixin

______________________________________________
r-h...@r-project.org mailing listhttps://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guidehttp://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to