Hello,

I have a bunch of files containing 300 data points each with values from 0
to 1 which also sum to 1 (I don't think  the last element is relevant
though). In addition, each data point is annotated as an "a" or a "b".

I would like to know in which files (if any) the data is uniformly
distributed.

I used Google and found out that a Kolmogorov-Smirnov or a Chi-square
goodness-of-fit test could be used. Then I looked up ?kolmogorov and found
"ks.test", but the example there is for the normal distribution and I am not
sure how to adapt it for the uniform distribution. I did ?runif and read
about the uniform distribution but it doesn't say what the "cumulative
distribution" is. Is it "punif", like "pnorm"? I thought of that because I
found a message on this list where someone was told to use "pnorm" instead
of "dnorm". But the help page on the uniform distribution says punif is the
"distribution function". Are the "cumulative distribution" and the
"distribution function" the same thing? Having several names for the same
thing has always confused me very much in statistics.

Also, I am not sure whether I need to specify any parameters for the
distribution and which. I thought maybe I should specify "min=0" and "max=1"
but those appear to be the defaults. Do I need to specify q, the vector of
quantiles?

So is
> ks.test(x, punif)
correct or not for what I am attempting to do?

After this I will also need to find out whether the a's and b's are
distributed randomly in each file. I would be greatful for any pointers
although I have not researched this issue yet.

Kairavi.

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to