Hello,

I believe the following solves your problem. It's a bit more complicated but with the sample dataset you've provided the result is as wished.



tmp <- lapply(split(subz, subz$time), function(x) {
        i1 <- which(as.character(x$fix) == "noon")[1]
        i2 <- which(as.character(x$fix) == "midnight")
        x$day[i2] <- x$day[i1]
        x})
names(tmp) <- NULL
result <- do.call(rbind, tmp)


Hope this helps,

Rui Barradas

Em 25-03-2013 14:15, zuzana zajkova escreveu:
Hi Rui,

thank you for your code, but unfortunately it doesn't work correctly. What
I got is this:


subz
          jul                time    dtime      fix    ddawn    ddusk day
101608 15006 2011-02-01 19:14:49 19.24694     noon 7.916667 19.88333   1
101609 15006 2011-02-01 19:24:49 19.41361 midnight 7.916667 19.56667   1
101610 15006 2011-02-01 19:24:49 19.41361     noon 7.916667 19.88333   1
101611 15006 2011-02-01 19:34:49 19.58028 midnight 7.916667 19.56667   1

101612 15006 2011-02-01 19:34:49 19.58028     noon 7.916667 19.88333   1
101613 15006 2011-02-01 19:44:49 19.74694 midnight 7.916667 19.56667   1
101614 15006 2011-02-01 19:44:49 19.74694     noon 7.916667 19.88333   1
  101615 15006 2011-02-01 19:54:49 19.91361 midnight 7.916667 19.56667   1

101616 15006 2011-02-01 19:54:49 19.91361     noon 7.916667 19.88333   0
101617 15006 2011-02-01 20:04:49 20.08028 midnight 7.916667 19.56667   0
101618 15006 2011-02-01 20:04:49 20.08028     noon 7.916667 19.88333   0

Where "day" for "time" 19:54:49 for midnight is 1 and for noon is 0. There
are supposed to be 0 both (as "dtime" 19.91361 > "ddusk" for noon 19.88333)
Probably the problem would be adding 1 to he index in
subz$day[idx + 1] <- subz$day[idx]
So far, I haven't found solution...

Zuzana



On 22 March 2013 20:01, Rui Barradas <ruipbarra...@sapo.pt> wrote:

Hello,

Try the following.


idx <- which(subz$fix == "noon")
if(idx[length(idx)] == nrow(subz)) idx <- idx[-length(idx)]
subz$day[idx + 1] <- subz$day[idx]


Hope this helps,

Rui Barradas

Em 22-03-2013 18:18, zuzana zajkova escreveu:

Hi,

I would appreciate if somebody could help me with this small issue...
I have a dataframe like this (originaly has more than 100 000 rows):

  subz

           jul                time    dtime      fix    ddawn    ddusk day
101608 15006 2011-02-01 19:14:49 19.24694     noon 7.916667 19.88333   1
101609 15006 2011-02-01 19:24:49 19.41361 midnight 7.916667 19.56667   1
101610 15006 2011-02-01 19:24:49 19.41361     noon 7.916667 19.88333   1
101611 15006 2011-02-01 19:34:49 19.58028 midnight 7.916667 19.56667   0
101612 15006 2011-02-01 19:34:49 19.58028     noon 7.916667 19.88333   1
101613 15006 2011-02-01 19:44:49 19.74694 midnight 7.916667 19.56667   0
101614 15006 2011-02-01 19:44:49 19.74694     noon 7.916667 19.88333   1
101615 15006 2011-02-01 19:54:49 19.91361 midnight 7.916667 19.56667   0
101616 15006 2011-02-01 19:54:49 19.91361     noon 7.916667 19.88333   0
101617 15006 2011-02-01 20:04:49 20.08028 midnight 7.916667 19.56667   0
101618 15006 2011-02-01 20:04:49 20.08028     noon 7.916667 19.88333   0

  dput(subz)

structure(list(jul = c(15006, 15006, 15006, 15006, 15006, 15006,
15006, 15006, 15006, 15006, 15006), time = structure(c(1296587689,
1296588289, 1296588289, 1296588889, 1296588889, 1296589489, 1296589489,
1296590089, 1296590089, 1296590689, 1296590689), class = c("POSIXct",
"POSIXt"), tzone = "GMT"), dtime = c(19.2469444444444, 19.4136111111111,
19.4136111111111, 19.5802777777778, 19.5802777777778, 19.7469444444444,
19.7469444444444, 19.9136111111111, 19.9136111111111, 20.0802777777778,
20.0802777777778), fix = structure(c(2L, 1L, 2L, 1L, 2L, 1L,
2L, 1L, 2L, 1L, 2L), .Label = c("midnight", "noon"), class = "factor"),
      ddawn = c(7.91666666666667, 7.91666666666667, 7.91666666666667,
      7.91666666666667, 7.91666666666667, 7.91666666666667,
7.91666666666667,
      7.91666666666667, 7.91666666666667, 7.91666666666667,
7.91666666666667
      ), ddusk = c(19.8833333333333, 19.5666666666667, 19.8833333333333,
      19.5666666666667, 19.8833333333333, 19.5666666666667,
19.8833333333333,
      19.5666666666667, 19.8833333333333, 19.5666666666667,
19.8833333333333
      ), day = c(1, 1, 1, 0, 1, 0, 1, 0, 0, 0, 0)), .Names = c("jul",
"time", "dtime", "fix", "ddawn", "ddusk", "day"), row.names =
101608:101618, class = "data.frame")

where "day" is calculated as

subz$day <- ifelse( subz$dtime > subz$ddusk | subz$dtime < subz$ddawn, 0,
1
)

The way I would like to calculate "day" is this
- for the same "time",  the "day" is calculated for "noon" as mentioned
above but  for "midnight" is just copying the same value as for "noon".
So for the same "time" the "day" value should be the same for "noon" and
"midnight".
Something like this:

    jul                time    dtime      fix    ddawn    ddusk day
101608 15006 2011-02-01 19:14:49 19.24694     noon 7.916667 19.88333   1
101609 15006 2011-02-01 19:24:49 19.41361 midnight 7.916667 19.56667   1
101610 15006 2011-02-01 19:24:49 19.41361     noon 7.916667 19.88333   1
101611 15006 2011-02-01 19:34:49 19.58028 midnight 7.916667 19.56667   1
101612 15006 2011-02-01 19:34:49 19.58028     noon 7.916667 19.88333   1
101613 15006 2011-02-01 19:44:49 19.74694 midnight 7.916667 19.56667   1
101614 15006 2011-02-01 19:44:49 19.74694     noon 7.916667 19.88333   1
101615 15006 2011-02-01 19:54:49 19.91361 midnight 7.916667 19.56667   0
101616 15006 2011-02-01 19:54:49 19.91361     noon 7.916667 19.88333   0
101617 15006 2011-02-01 20:04:49 20.08028 midnight 7.916667 19.56667   0
101618 15006 2011-02-01 20:04:49 20.08028     noon 7.916667 19.88333   0

Where I get stuck, is I don't know how to get the value for "midnight".

Any suggestion is welcome. Thanks

Zuzana

         [[alternative HTML version deleted]]

______________________________**________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/**listinfo/r-help<https://stat.ethz.ch/mailman/listinfo/r-help>
PLEASE do read the posting guide http://www.R-project.org/**
posting-guide.html <http://www.R-project.org/posting-guide.html>
and provide commented, minimal, self-contained, reproducible code.




______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to