Thanks, Greg.

I also considered the clusters. The difficulty is those objects not only
enter the system at different time, but may have different duration in the
system. Once they have a time overlap in the system, impacts may exist. If
splitting  into two clusters by setting a time threshold t, I need to drop
all objects that enter before time t and leave after time t. The more
clusters, the more objects to be dropped that I don't prefer. But two or
three clusters may be too small as a sample size. My purpose is to test the
difference between two systems.

Back to the R function question. When sample size is large, the full
permutation test is infeasible and we have to use randomization test by
selecting permutation order randomly. One factor I know that impacts the
randomness is the random number generator. I am not sure how well the
function "sample" is in randomness.

Thanks,
Wenjin

On Tue, May 24, 2011 at 4:45 PM, Greg Snow <greg.s...@imail.org> wrote:

> If the x's that don't enter at the same time can be considered independent
> of each other, and only clusters that enter at the same time are dependent,
> then you can still do a permutation test by creating clusters with dependent
> values within each cluster, but independent between clusters, then permute
> the clusters rather than the individual data points.  This maintains the
> dependency.
>
> I don't know of any existing functions that will do the whole thing for
> you, but this would only be a few lines of R code to do this type of
> permutation test.  The split function can help with separating the clusters,
> sample can do the permutations, and unlist or sapply can be used in
> calculating the statistic of interest.
>
> -----Original Message-----
> From: r-help-boun...@r-project.org [mailto:r-help-boun...@r-project.org]
> On Behalf Of Wenjin Mao
> Sent: Tuesday, May 24, 2011 11:22 AM
> To: Meyners, Michael
> Cc: r-help@r-project.org
> Subject: Re: [R] help on permutation/randomization test
>
> Thank you, Michael.
>
> I don't think those data for the same group can be treated as repeated
> measurements. Let's say I have 1000 observations from group 1 and 1500 obs
> from group 2. Some of the 1000 objects of group 1 entered the system at the
> same time and may effect each other; same for the other group. It's hard to
> measure the heaviness of the dependency.
>
> Even after some twist or transformation, the correlation can be reduced,
> the
> R function "permtest" cannot handle such high sample size. Is there any
> other R function I can use?
>
> Thanks,
> Wenjin
>
> On Tue, May 24, 2011 at 1:37 AM, Meyners, Michael <meyner...@pg.com>
> wrote:
>
> > I suspect you need to give more information/background on the data
> (though
> > this is not primarily an R-related question; you might want to try other
> > resources instead). Unless I'm missing something here, I cannot think of
> ANY
> > reasonable test: A permutation (using permtest or anything else) would
> > destroy the correlation structure and hence give invalid results, and the
> > assumptions of parametric tests are violated as well. Basically, you only
> > have two observations, one for each group; with some good will you might
> > consider these as repeated measurements, but still on the same subject or
> > whatsoever. Hence no way to discriminate the subject from a treatment
> > effect. There is not enough data to permute or to rely a statistical test
> > on. So unless you can get rid of the dependency within groups (or at
> least
> > reasonably assume observations to be independent), I'm not very
> > optimistic...
> > HTH, Michael
> >
> > > -----Original Message-----
> > > From: r-help-boun...@r-project.org [mailto:r-help-bounces@r-
> > > project.org] On Behalf Of Wenjin Mao
> > > Sent: Monday, May 23, 2011 20:56
> > > To: r-help@r-project.org
> > > Subject: [R] help on permutation/randomization test
> > >
> > > Hi,
> > >
> > > I have two groups of data of different size:
> > >    group A: x1, x2, ...., x_n;
> > >    group B: y1, y2, ...., y_m; (m is not equal to n)
> > >
> > > The two groups are independent but observations within each group are
> > > not independent,
> > >  i.e., x1, x2, ..., x_n are not independent; but x's are independent
> > > from y's
> > >
> > > I wonder if randomization test is still applicable to this case. Does
> > > R have any function that can do this test for large m and n? I notice
> > > that "permtest" can only handle small (m+n<22) samples.
> > >
> > > Thank you very much,
> > > Wenjin
> > >
> > > ______________________________________________
> > > R-help@r-project.org mailing list
> > > https://stat.ethz.ch/mailman/listinfo/r-help
> > > PLEASE do read the posting guide http://www.R-project.org/posting-
> > > guide.html
> > > and provide commented, minimal, self-contained, reproducible code.
> >
>
>         [[alternative HTML version deleted]]
>
> ______________________________________________
> R-help@r-project.org mailing list
> https://stat.ethz.ch/mailman/listinfo/r-help
> PLEASE do read the posting guide
> http://www.R-project.org/posting-guide.html
> and provide commented, minimal, self-contained, reproducible code.
>

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to