Hi,

Here's my situation, I have some kind of offline dataset, but I want to
form a virtual data stream feeding to Spark Streaming, my code looks like
this


   // sort offline data by time
 1)  JavaRDD sortedByTime = offlineDataRDD.sortBy( );

   // compute a list of JavaRDD,  each element JavaRDD is hosting the data
in the same time
   // bucket.
  2) List<JavaRDD> virtualStreamRdd = ?

    Queue<JavaRDD<Row>> queue = Queues.newLinkedBlockingQueue();
    queue.addAll(virtualStreamRdd);

    /*
     * Create DStream from the queue
     */

    3) final JavaDStream<Row> rowDStream =
streamingContext.queueStream(queue);


Currently I'm stucking in 2), any suggestion is appreciated.

Thanks

-- 
--Anfernee

Reply via email to