[GitHub] spark pull request: SPARK-1242 Add aggregate to python rdd

2014-03-13 Thread holdenk
GitHub user holdenk opened a pull request: https://github.com/apache/spark/pull/139 SPARK-1242 Add aggregate to python rdd You can merge this pull request into a Git repository by running: $ git pull https://github.com/holdenk/spark add_aggregate_to_python_api Alternatively

[GitHub] spark pull request: Spark 615 map partitions with index callable f...

2014-03-08 Thread holdenk
Github user holdenk commented on the pull request: https://github.com/apache/spark/pull/16#issuecomment-37107763 Sure, I'll give this a shot today :) On Sat, Mar 8, 2014 at 11:24 AM, Patrick Wendell wrote: > @holdenk <https://github.com/holdenk>

[GitHub] spark pull request: GRAPH-1: Map side distinct in collect vertex i...

2014-03-08 Thread holdenk
Github user holdenk commented on the pull request: https://github.com/apache/spark/pull/21#issuecomment-37094612 Jenkins, retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have

[GitHub] spark pull request: GRAPH-1: Map side distinct in collect vertex i...

2014-03-08 Thread holdenk
Github user holdenk commented on the pull request: https://github.com/apache/spark/pull/21#issuecomment-37093324 retest this please --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your project does not have this

[GitHub] spark pull request: MLI-2: Start adding k-fold cross validation to...

2014-03-07 Thread holdenk
Github user holdenk commented on the pull request: https://github.com/apache/spark/pull/18#issuecomment-36977010 Is MLI-2 not a good JIRA issue to use for this? --- If your project is set up for it, you can reply to this email and have your reply appear on GitHub as well. If your

[GitHub] spark pull request: GRAPH-1: Map side distinct in collect vertex i...

2014-03-06 Thread holdenk
Github user holdenk commented on the pull request: https://github.com/apache/spark/pull/21#issuecomment-36951277 Ok I'll switch it tonight. On Thu, Mar 6, 2014 at 3:09 PM, Reynold Xin wrote: > We should use the primitive hashmap - otherwise it is pre

[GitHub] spark pull request: MLI-2: Start adding k-fold cross validation to...

2014-03-03 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/18#discussion_r10241454 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/util/MLUtils.scala --- @@ -62,6 +67,20 @@ object MLUtils { } /** + * Return a k

[GitHub] spark pull request: MLI-2: Start adding k-fold cross validation to...

2014-03-03 Thread holdenk
Github user holdenk commented on a diff in the pull request: https://github.com/apache/spark/pull/18#discussion_r10203849 --- Diff: mllib/src/main/scala/org/apache/spark/mllib/util/MLUtils.scala --- @@ -62,6 +67,20 @@ object MLUtils { } /** + * Return a k