If you are wanting to share RDDs it might be a good idea to check out Tachyon / Alluxio.
For the Thrift server, I believe the datasets are located in your Spark cluster as RDDs and you just communicate with it via the Thrift JDBC Distributed Query Engine connector. 2016-05-17 5:12 GMT+10:00 Michael Segel <[email protected]>: > For one use case.. we were considering using the thrift server as a way to > allow multiple clients access shared RDDs. > > Within the Thrift Context, we create an RDD and expose it as a hive table. > > The question is… where does the RDD exist. On the Thrift service node > itself, or is that just a reference to the RDD which is contained with > contexts on the cluster? > > > Thx > > -Mike > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [email protected] > For additional commands, e-mail: [email protected] > >
