On 11-01-27 14:58, Louise Mair wrote:
Hello,
I am working with a dataset which essentially has only one column - a list of
distances in metres, accurate to several decimal places. eg
distance
1000
6403.124
1000
1414.214
1414.214
1000
I want to organise this into a frequency table, grouping into categories of 0
- 999, 1000 - 1999, 2000-2999 etc. I'd also like the rows where there are no
data points in that category to contain 0, in order to be able to plot a
histrogram with a linear x axis, and to statistically analyse differences
between datasets.
I have tried table() which doesn't group the data the way I'd like it, I've
also tried cut() but couldn't make it work. Ideally I'd like the output to
look something like this...
distance frequency
0-999 0
1000-1999 3
2000-2999 0
...
Could be a starting point for testing:
x <- abs(rnorm(500, 5000, 3000)
br <- seq(0, 20000, 1000)
summary(cut(x, br, labels=br[-1], include.lowest=T,
ordered_result=T))
Look at ?cut to find out more. Good luck, *S*
Any suggestions that are an improvement on doing it manually please?
Thanks in advance!
Louise
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.
--
Sascha Vieweg, saschav...@gmail.com
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.