Re: Connect to remote YARN cluster

2015-04-09 Thread Steve Loughran
> On 9 Apr 2015, at 17:42, Marcelo Vanzin wrote: > > If YARN is authenticating users it's probably running on kerberos, so > you need to log in with your kerberos credentials (kinit) before > submitting an application. also: make sure that you have the full JCE and not the crippled crypto; ever

Re: finding free ports for tests

2015-04-09 Thread Steve Loughran
On 8 Apr 2015, at 20:19, Hari Shreedharan mailto:hshreedha...@cloudera.com>> wrote: One good way to guarantee your tests will work is to have your server bind to an ephemeral port and then query it to find the port it is running on. This ensures that race conditions don’t cause test failures.

Re: enum-like types in Spark

2015-04-09 Thread Imran Rashid
any update here? This is relevant for a currently open PR of mine -- I've got a bunch of new public constants defined w/ format #4, but I'd gladly switch to java enums. (Even if we are just going to postpone this decision, I'm still inclined to switch to java enums ...) just to be clear about th

Re: extended jenkins downtime, thursday april 9th 7am-noon PDT (moving to anaconda python & more)

2015-04-09 Thread shane knapp
ok, we're looking good. i'll keep an eye on this for the rest of the day, and if you happen to notice any infrastructure failures before i do (i updated a LOT), please let me know immediately! :) On Thu, Apr 9, 2015 at 8:38 AM, shane knapp wrote: > things are looking pretty good and i expect t

Re: Spark remote communication pattern

2015-04-09 Thread Reynold Xin
For torrent broadcast, data are read directly through the block manager: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/broadcast/TorrentBroadcast.scala#L167 On Thu, Apr 9, 2015 at 7:27 AM, Zoltán Zvara wrote: > Thanks! I've found the fetcher! Is there any ot

Re: Connect to remote YARN cluster

2015-04-09 Thread Marcelo Vanzin
If YARN is authenticating users it's probably running on kerberos, so you need to log in with your kerberos credentials (kinit) before submitting an application. On Thu, Apr 9, 2015 at 4:57 AM, Zoltán Zvara wrote: > I'm trying to debug Spark in yarn-client mode. On my local, single node > cluster

Re: extended jenkins downtime, thursday april 9th 7am-noon PDT (moving to anaconda python & more)

2015-04-09 Thread shane knapp
things are looking pretty good and i expect to be done within an hour. i've got some test builds running right now, and will give the green light when they successfully complete. On Thu, Apr 9, 2015 at 7:29 AM, shane knapp wrote: > and this is now happening. > > On Tue, Apr 7, 2015 at 4:38 PM,

Re: [VOTE] Release Apache Spark 1.3.1 (RC2)

2015-04-09 Thread Sean McNamara
+1 tested on OS X Sean > On Apr 7, 2015, at 11:46 PM, Patrick Wendell wrote: > > Please vote on releasing the following candidate as Apache Spark version > 1.3.1! > > The tag to be voted on is v1.3.1-rc2 (commit 7c4473a): > https://git-wip-us.apache.org/repos/asf?p=spark.git;a=commit;h=7c4473

Re: extended jenkins downtime, thursday april 9th 7am-noon PDT (moving to anaconda python & more)

2015-04-09 Thread shane knapp
and this is now happening. On Tue, Apr 7, 2015 at 4:38 PM, shane knapp wrote: > reminder! this is happening thurday morning. > > On Fri, Apr 3, 2015 at 9:59 AM, shane knapp wrote: > >> welcome to python2.7+, java 8 and more! :) >> >> i'll be doing a major upgrade to our build system next thur

Re: Spark remote communication pattern

2015-04-09 Thread Zoltán Zvara
Thanks! I've found the fetcher! Is there any other places and cases where blocks are traveled through network? Zvara Zoltán mail, hangout, skype: zoltan.zv...@gmail.com mobile, viber: +36203129543 bank: 10918001-0021-50480008 address: Hungary, 2475 Kápolnásnyék, Kossuth 6/a elte: HSKSJZ

Connect to remote YARN cluster

2015-04-09 Thread Zoltán Zvara
I'm trying to debug Spark in yarn-client mode. On my local, single node cluster everything works fine, but the remote YARN resource manager throws away my request because of authentication error. I'm running IntelliJ 14 on Ubuntu and the driver tries to connect to YARN with my local user name. How

Re: Spark remote communication pattern

2015-04-09 Thread Reynold Xin
Take a look at the following two files: https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/shuffle/hash/BlockStoreShuffleFetcher.scala and https://github.com/apache/spark/blob/master/core/src/main/scala/org/apache/spark/storage/ShuffleBlockFetcherIterator.scala On

Spark remote communication pattern

2015-04-09 Thread Zoltán Zvara
Dear Developers, I'm trying to investigate the communication pattern regarding data-flow during execution of a Spark program defined by an RDD chain. I'm investigating from the Task point of view, and found out that the task type ResultTask (as retrieving the iterator for its RDD for a given parti