Re: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Sean Owen
the discussion (re-loading batch RDD from file within spark > steraming context) - lets leave that since we are not getting anywhere > > -Original Message- > From: Sean Owen [mailto:so...@cloudera.com] > Sent: Wednesday, April 15, 2015 8:30 PM > To: Evo Eftimov > Cc: user@

RE: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Evo Eftimov
e not getting anywhere -Original Message- From: Sean Owen [mailto:so...@cloudera.com] Sent: Wednesday, April 15, 2015 8:30 PM To: Evo Eftimov Cc: user@spark.apache.org Subject: Re: adding new elements to batch RDD from DStream RDD What API differences are you talking about? a DStream gives

Re: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Sean Owen
only way to keep reloading a batch RDD from file is from a >> separate thread still using the same spark context. The thread will >> reload the batch RDD with the same reference ie reassign the reference >> to the newly instantiated/loaded batch RDD - is that what you mean by &

RE: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Evo Eftimov
h RDDs from file for e.g. a second time moreover after specific period of time -Original Message- From: Sean Owen [mailto:so...@cloudera.com] Sent: Wednesday, April 15, 2015 8:14 PM To: Evo Eftimov Cc: user@spark.apache.org Subject: Re: adding new elements to batch RDD from DStream RDD

Re: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Sean Owen
loading batch RDD from file > > -Original Message- > From: Sean Owen [mailto:so...@cloudera.com] > Sent: Wednesday, April 15, 2015 7:43 PM > To: Evo Eftimov > Cc: user@spark.apache.org > Subject: Re: adding new elements to batch RDD from DStream RDD > > What do you m

RE: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Evo Eftimov
iginal Message- From: Sean Owen [mailto:so...@cloudera.com] Sent: Wednesday, April 15, 2015 7:43 PM To: Evo Eftimov Cc: user@spark.apache.org Subject: Re: adding new elements to batch RDD from DStream RDD What do you mean by "batch RDD"? they're just RDDs, though store their d

Re: adding new elements to batch RDD from DStream RDD

2015-04-15 Thread Sean Owen
What do you mean by "batch RDD"? they're just RDDs, though store their data in different ways and come from different sources. You can union an RDD from an HDFS file with one from a DStream. It sounds like you want streaming data to live longer than its batch interval, but that's not something you