Thank you Phil. The only problem with the code is when the 2 y values are 
equal, 
it should default to "right". Your code would default to the first maximum. But 
it's ok, I see the logic here and can figure it out. I wondered whether there 
is 
some one-line thing that can do this, I guess not possible.

Thank you

John




________________________________
From: Phil Spector <spec...@stat.berkeley.edu>

Cc: r-help@r-project.org
Sent: Fri, October 1, 2010 10:58:55 AM
Subject: Re: [R] add a new column to data frame

Here are two ways:

dat0 = subset(dat,time==0)

one = as.data.frame(as.table(by(dat0,dat0$id,
                     function(x)as.character(x$mode)[which.max(x$y)])))
names(one) = c('id','type')
merge(dat,one)

two = sapply(split(dat0,dat0$id),
              function(x)as.character(x$mode)[which.max(x$y)])
merge(dat,data.frame(id=names(two),type=two))


                    - Phil Spector
                     Statistical Computing Facility
                     Department of Statistics
                     UC Berkeley
                    spec...@stat.berkeley.edu




On Fri, 1 Oct 2010, array chip wrote:

> Hi, I am wondering if anyone can propose a simple/best way to do the 
following:
>
> Let's say I have a data frame
>
> dat <-
>cbind(expand.grid(mode=c('right','left'),time=0:3,id=c('p1','p2','p3')),y=c(3,5,rep(4,6),6,2,rep(3,6),4,4,rep(2,6)))
>)
>
>
> dat
>    mode time id y
> 1  right    0 p1 3
> 2   left    0 p1 5
> 3  right    1 p1 4
> 4   left    1 p1 4
> 5  right    2 p1 4
> 6   left    2 p1 4
> 7  right    0 p2 6
> 8   left    0 p2 2
> 9  right    1 p2 3
> 10  left    1 p2 3
> 11 right    2 p2 3
> 12  left    2 p2 3
> 13 right    0 p3 4
> 14  left    0 p3 4
> 15 right    1 p3 2
> 16  left    1 p3 2
> 17 right    2 p3 2
> 18  left    2 p3 2
>
> Now I want to add a new column "type" to this data frame with values of either
> "left" or "right" for each "id" based on the following logic:
>
> For each "id", the value of "type" column is the value of "mode" for which the
> value of "y" is the maximum of "y" based on time=0. For example for id=p1, the
> value of "type" is "left" because for the 2 "y" values (3 & 5) based on 
time=0,
> mode=left has the bigger "y" value (5). But if the 2 "y" values are the same 
>for
> "mode", then let type=right.
>
> In the end the new data frame is:
>
>    mode time id y  type
> 1  right    0 p1 3  left
> 2   left    0 p1 5  left
> 3  right    1 p1 4  left
> 4   left    1 p1 4  left
> 5  right    2 p1 4  left
> 6   left    2 p1 4  left
> 7  right    0 p2 6 right
> 8   left    0 p2 2 right
> 9  right    1 p2 3 right
> 10  left    1 p2 3 right
> 11 right    2 p2 3 right
> 12  left    2 p2 3 right
> 13 right    0 p3 4 right
> 14  left    0 p3 4 right
> 15 right    1 p3 2 right
> 16  left    1 p3 2 right
> 17 right    2 p3 2 right
> 18  left    2 p3 2 right
>
> Any suggestions would be appreciated.
>
> John
>
>
>
>     [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>



      
        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to