On Aug 24, 2011, at 5:15 PM, Kathleen Rollet wrote:
Dear R users,

I am encoutering the following problem: I have a dataset with a 'unique_id' and different 'visit_date' (formatted as.Date, "%d/%m/ %Y") per unique_id. I would like to create a new variable with the most recent date of visit per unique_id as shown below.
That should not result in what is below unless you have changes  
something in options() forcing a different data output format. (Is  
that even possible?)
unique_id visit_date last_visit_date
1  01/06/2010  01/06/2011
1  01/01/2011  01/06/2011
1  01/06/2011  01/06/2011
2  01/01/2009  01/07/2011
2  01/06/2009  01/07/2011
2  01/06/2010  01/07/2011
2  01/01/2011  01/07/2011
2  01/07/2011  01/07/2011
3  01/01/2008  01/01/2008
4  01/01/2009  01/01/2010
4  01/01/2010  01/01/2010

Read it in as dfrm named "dat" with:
colClasses=c("numeric", "character", "character")

Then:

dat$visit_date <-as.Date(dat$visit_date, format="%d/%m/%Y", origin="1970-01-01") dat$last_visit_date <-as.Date(dat$last_visit_date, format="%d/%m/%Y", origin="1970-01-01")
I know the coding to easily do this in Stata, SAS, and Excel but I cannot find how to do it in R. I try multiple function such as tapply( ), ave( ), ddply ( ), and transform ( ) after looking into previous postings. The codes are running but only NA values are generated or I get error messages that the replacement has less row than the data has (there are about 1000 unique_id and over 4000 rows in my dataset presently).
The 'ave' function should be able to do it. It returns a vector as  
long as the dataframe has rows.
You are asked to post your failures as well as reproducible code which  
is best produced with dput(). (This apples doubly so when you choose  
non-standard formats for Date objects.)
Please read:
?dput
?ave

Worked example:

dat$most_recent<- format(ave(dat$visit_date, dat$unique_id, FUN=max), format="%d/%m/%Y")
dat

NOTE: that last column is not an R date but rather a character vector.



I would greatly appreciate if someone could help me.

Thank you!

Kathleen R.
Epidemiologist
Montreal, QC, Canada                                    
        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
David Winsemius, MD
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to