Hi Mohammed, Not sure if you have tried this or not. You could try using the below api to start the thriftserver with an existing context.
https://github.com/apache/spark/blob/master/sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2.scala#L42 The one thing that Michael Ambrust @ databrick recommended was this: > You can start a JDBC server with an existing context. See my answer here: > http://apache-spark-user-list.1001560.n3.nabble.com/Standard-SQL-tool-access-to-SchemaRDD-td20197.html So something like this based on example from Cheng Lian: *Server* import org.apache.spark.sql.hive.HiveContext import org.apache.spark.sql.catalyst.types._ val sparkContext = sc import sparkContext._ val sqlContext = new HiveContext(sparkContext) import sqlContext._ makeRDD((1,"hello") :: (2,"world") ::Nil).toSchemaRDD.cache().registerTempTable("t") // replace the above with the C* + spark-casandra-connectore to generate SchemaRDD and registerTempTable import org.apache.spark.sql.hive.thriftserver._ HiveThriftServer2.startWithContext(sqlContext) Then Startup ./bin/beeline -u jdbc:hive2://localhost:10000/default 0: jdbc:hive2://localhost:10000/default> select * from t; I have not tried this yet from Tableau. My understanding is that the tempTable is only valid as long as the sqlContext is, so if one terminates the code representing the *Server*, and then restarts the standard thrift server, sbin/start-thriftserver ..., the table won't be available. Another possibility is to perhaps use the tuplejump cash project, https://github.com/tuplejump/cash. HTH. -Todd On Fri, Apr 3, 2015 at 11:11 AM, pawan kumar <[email protected]> wrote: > Thanks mohammed. Will give it a try today. We would also need the > sparksSQL piece as we are migrating our data store from oracle to C* and it > would be easier to maintain all the reports rather recreating each one from > scratch. > > Thanks, > Pawan Venugopal. > On Apr 3, 2015 7:59 AM, "Mohammed Guller" <[email protected]> wrote: > >> Hi Todd, >> >> >> >> We are using Apache C* 2.1.3, not DSE. We got Tableau to work directly >> with C* using the ODBC driver, but now would like to add Spark SQL to the >> mix. I haven’t been able to find any documentation for how to make this >> combination work. >> >> >> >> We are using the Spark-Cassandra-Connector in our applications, but >> haven’t been able to figure out how to get the Spark SQL Thrift Server to >> use it and connect to C*. That is the missing piece. Once we solve that >> piece of the puzzle then Tableau should be able to see the tables in C*. >> >> >> >> Hi Pawan, >> >> Tableau + C* is pretty straight forward, especially if you are using DSE. >> Create a new DSN in Tableau using the ODBC driver that comes with DSE. Once >> you connect, Tableau allows to use C* keyspace as schema and column >> families as tables. >> >> >> >> Mohammed >> >> >> >> *From:* pawan kumar [mailto:[email protected]] >> *Sent:* Friday, April 3, 2015 7:41 AM >> *To:* Todd Nist >> *Cc:* [email protected]; Mohammed Guller >> *Subject:* Re: Tableau + Spark SQL Thrift Server + Cassandra >> >> >> >> Hi Todd, >> >> Thanks for the link. I would be interested in this solution. I am using >> DSE for cassandra. Would you provide me with info on connecting with DSE >> either through Tableau or zeppelin. The goal here is query cassandra >> through spark sql so that I could perform joins and groupby on my queries. >> Are you able to perform spark sql queries with tableau? >> >> Thanks, >> Pawan Venugopal >> >> On Apr 3, 2015 5:03 AM, "Todd Nist" <[email protected]> wrote: >> >> What version of Cassandra are you using? Are you using DSE or the stock >> Apache Cassandra version? I have connected it with DSE, but have not >> attempted it with the standard Apache Cassandra version. >> >> >> >> FWIW, >> http://www.datastax.com/dev/blog/datastax-odbc-cql-connector-apache-cassandra-datastax-enterprise, >> provides an ODBC driver tor accessing C* from Tableau. Granted it does not >> provide all the goodness of Spark. Are you attempting to leverage the >> spark-cassandra-connector for this? >> >> >> >> >> >> >> >> On Thu, Apr 2, 2015 at 10:20 PM, Mohammed Guller <[email protected]> >> wrote: >> >> Hi – >> >> >> >> Is anybody using Tableau to analyze data in Cassandra through the Spark >> SQL Thrift Server? >> >> >> >> Thanks! >> >> >> >> Mohammed >> >> >> >> >> >
