Re: Using mllib-1.1.0-SNAPSHOT on Spark 1.0.1

2014-08-09 Thread Matt Forbes
I was having this same problem early this week and had to include my changes in the assembly. On Sat, Aug 9, 2014 at 9:59 AM, Debasish Das wrote: > I validated that I can reproduce this problem with master as well (without > adding any of my mllib changes)... > > I separated mllib jar from asse

Documentation confusing or incorrect for decision trees?

2014-08-06 Thread Matt Forbes
I found the section on ordering categorical features really interesting, but the A, B, C example seemed inconsistent. Am I interpreting this passage wrong, or are there typos? Aren't the split candidates A | C, B and A, C | B ? For example, for a binary classification problem with one categorical

Re: Problems running modified spark version on ec2 cluster

2014-08-04 Thread Matt Forbes
ress ~/src/forestry-main/spark-defaults.conf $MASTER:spark/conf ssh $MASTER "spark-ec2/copy-dir --delete /root/spark/lib" ssh $MASTER "spark/bin/spark-submit $PRIMARY_JAR --class com.ttforbes.TreeTest --verbose" On Mon, Aug 4, 2014 at 10:23 AM, Matt Forbes wrote: > I'm t

Problems running modified spark version on ec2 cluster

2014-08-04 Thread Matt Forbes
I'm trying to run a forked version of mllib where I am experimenting with a boosted trees implementation. Here is what I've tried, but can't seem to get working properly: *Directory layout:* src/spark-dev (spark github fork) pom.xml - I've tried changing the version to 1.2 arbitrarily in core