Re: How can I do pair-wise computation between RDD feature columns?

2015-05-16 Thread Debasish Das
I opened it up today but it should help you: https://github.com/apache/spark/pull/6213 On Sat, May 16, 2015 at 6:18 PM, Chunnan Yao wrote: > Hi all, > Recently I've ran into a scenario to conduct two sample tests between all > paired combination of columns of an RDD. But the networking load and

How can I do pair-wise computation between RDD feature columns?

2015-05-16 Thread Chunnan Yao
Hi all, Recently I've ran into a scenario to conduct two sample tests between all paired combination of columns of an RDD. But the networking load and generation of pair-wise computation is too time consuming. That has puzzled me for a long time. I want to conduct Wilcoxon rank-sum test (http://en