Hi all

newbie question.

I've generated a very simple script to process gene expression data  
from mysql output.

One of the things I'm trying to do is feed about 17000 vectors that I  
generate in this script into a wilcoxon test through rpy2, and  
collecting the p value as output.

Some of the vectors are quite long.  I begin to see truncated returns  
e.g.:
        
'Wilcoxon rank sum test with'

or

'data:  (3, 3, 3, 3, 3, 3, 3, 3,'

once the vector length hits 1500, and things like the examples above  
are uniformly returned once the vector length hits around 4000.  These  
are of course the beginning of the R output, which looks like this:

        Wilcoxon rank sum test with continuity correction

data:  t(a[1]) and t(a[3])
W = 3052972, p-value < 2.2e-16
alternative hypothesis: true location shift is not equal to 0

which makes me believe that the test has actually been run.

I've checked all my inputs and everything is peachy on the Python  
side.  Given the high correlation of the bug with the vector length, I  
think the most likely explanations are the following:

(1)  vector is too long and this causes some breakdown when R and  
Python are talking to each other
(2)  test takes too long and i need to somehow slow the script down to  
wait for the full output.

Any ideas or help would be appreciated.

Joel

------------------------------------------------------------------------------
_______________________________________________
rpy-list mailing list
rpy-list@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rpy-list

Reply via email to