Hi Mohammed,

Not sure if you have tried this or not.  You could try using the below api
to start the thriftserver with an existing context.

https://github.com/apache/spark/blob/master/sql/hive-thriftserver/src/main/scala/org/apache/spark/sql/hive/thriftserver/HiveThriftServer2.scala#L42

The one thing that Michael Ambrust @ databrick recommended was this:

> You can start a JDBC server with an existing context.  See my answer here:
> http://apache-spark-user-list.1001560.n3.nabble.com/Standard-SQL-tool-access-to-SchemaRDD-td20197.html

So something like this based on example from Cheng Lian:

*Server*

import  org.apache.spark.sql.hive.HiveContext
import  org.apache.spark.sql.catalyst.types._

val  sparkContext  =  sc
import  sparkContext._
val  sqlContext  =  new  HiveContext(sparkContext)
import  sqlContext._
makeRDD((1,"hello") :: (2,"world")
::Nil).toSchemaRDD.cache().registerTempTable("t")
// replace the above with the C* + spark-casandra-connectore to
generate SchemaRDD and registerTempTable

import  org.apache.spark.sql.hive.thriftserver._
HiveThriftServer2.startWithContext(sqlContext)

Then Startup

./bin/beeline -u jdbc:hive2://localhost:10000/default
0: jdbc:hive2://localhost:10000/default> select * from t;


I have not tried this yet from Tableau.   My understanding is that the
tempTable is only valid as long as the sqlContext is, so if one terminates
the code representing the *Server*, and then restarts the standard thrift
server, sbin/start-thriftserver ..., the table won't be available.

Another possibility is to perhaps use the tuplejump cash project,
https://github.com/tuplejump/cash.

HTH.

-Todd

On Fri, Apr 3, 2015 at 11:11 AM, pawan kumar <[email protected]> wrote:

> Thanks mohammed. Will give it a try today. We would also need the
> sparksSQL piece as we are migrating our data store from oracle to C* and it
> would be easier to maintain all the reports rather recreating each one from
> scratch.
>
> Thanks,
> Pawan Venugopal.
> On Apr 3, 2015 7:59 AM, "Mohammed Guller" <[email protected]> wrote:
>
>>  Hi Todd,
>>
>>
>>
>> We are using Apache C* 2.1.3, not DSE. We got Tableau to work directly
>> with C* using the ODBC driver, but now would like to add Spark SQL to the
>> mix. I haven’t been able to find any documentation for how to make this
>> combination work.
>>
>>
>>
>> We are using the Spark-Cassandra-Connector in our applications, but
>> haven’t been able to figure out how to get the Spark SQL Thrift Server to
>> use it and connect to C*. That is the missing piece. Once we solve that
>> piece of the puzzle then Tableau should be able to see the tables in C*.
>>
>>
>>
>> Hi Pawan,
>>
>> Tableau + C* is pretty straight forward, especially if you are using DSE.
>> Create a new DSN in Tableau using the ODBC driver that comes with DSE. Once
>> you connect, Tableau allows to use C* keyspace as schema and column
>> families as tables.
>>
>>
>>
>> Mohammed
>>
>>
>>
>> *From:* pawan kumar [mailto:[email protected]]
>> *Sent:* Friday, April 3, 2015 7:41 AM
>> *To:* Todd Nist
>> *Cc:* [email protected]; Mohammed Guller
>> *Subject:* Re: Tableau + Spark SQL Thrift Server + Cassandra
>>
>>
>>
>> Hi Todd,
>>
>> Thanks for the link. I would be interested in this solution. I am using
>> DSE for cassandra. Would you provide me with info on connecting with DSE
>> either through Tableau or zeppelin. The goal here is query cassandra
>> through spark sql so that I could perform joins and groupby on my queries.
>> Are you able to perform spark sql queries with tableau?
>>
>> Thanks,
>> Pawan Venugopal
>>
>> On Apr 3, 2015 5:03 AM, "Todd Nist" <[email protected]> wrote:
>>
>> What version of Cassandra are you using?  Are you using DSE or the stock
>> Apache Cassandra version?  I have connected it with DSE, but have not
>> attempted it with the standard Apache Cassandra version.
>>
>>
>>
>> FWIW,
>> http://www.datastax.com/dev/blog/datastax-odbc-cql-connector-apache-cassandra-datastax-enterprise,
>> provides an ODBC driver tor accessing C* from Tableau.  Granted it does not
>> provide all the goodness of Spark.  Are you attempting to leverage the
>> spark-cassandra-connector for this?
>>
>>
>>
>>
>>
>>
>>
>> On Thu, Apr 2, 2015 at 10:20 PM, Mohammed Guller <[email protected]>
>> wrote:
>>
>> Hi –
>>
>>
>>
>> Is anybody using Tableau to analyze data in Cassandra through the Spark
>> SQL Thrift Server?
>>
>>
>>
>> Thanks!
>>
>>
>>
>> Mohammed
>>
>>
>>
>>
>>
>

Reply via email to