Hi,

I need a bit of guidance with the sapply function. I've read the help page, but am still a bit unsure how to use it.

I have a large data frame with about 100 columns and 30,000 rows. One of the columns is "group" of which there are about 2,000 distinct "groups".

I want to normalize (sum to 1) one of my variables per-group.

Normally, I would just write a huge "for each" loop, but have read that is hugely inefficient with R.

The old way would be (just an example, syntax might not be perfect):

for (group in data$group){
    for (score in data[data$group == group]){
        new_score <- score / sum(data$score[data$group==group])
    }
}

How would I simplify this with sapply?

Thanks!

--
Noah

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to