Re: Nested DataFrames

2015-06-25 Thread pawan kumar
May be you could try something like this using sparkSQL 1.4 and dataframes student.join(Grade, Grade("student_id") === student("student_id"), "left") .groupBy("id") .agg(sum(grade("Marks")), avg(grade("Marks"))) You could refer to the following document : https://spark.apache.o

Re: Code review - Spark SQL command-line client for Cassandra

2015-06-22 Thread pawan kumar
for the connector for 1.4.0 (as found here > http://search.maven.org/#artifactdetails%7Ccom.datastax.spark%7Cspark-cassandra-connector_2.11%7C1.4.0-M1%7Cjar)? > What exactly is it that does not currently exist for Spark 1.4? > > > > Thanks, > > Matthew > > > > *Fr

Re: Code review - Spark SQL command-line client for Cassandra

2015-06-22 Thread pawan kumar
Hi, Zeppelin has a cassandra-spark-connector built into the build. I have not tried it yet may be you could let us know. https://github.com/apache/incubator-zeppelin/pull/79 To build a Zeppelin version with the *Datastax Spark/Cassandra connector

Re: Spark 1.4.0 build Error on Windows

2015-06-03 Thread pawan kumar
I got the same error message when using maven 3.3 . On Jun 3, 2015 8:58 AM, "Ted Yu" wrote: > I used the same command on Linux but didn't reproduce the error. > > Can you include -X switch on your command line ? > > Also consider upgrading maven to 3.3.x > > Cheers > > On Wed, Jun 3, 2015 at 2:36

Re: Tableau + Spark SQL Thrift Server + Cassandra

2015-04-03 Thread pawan kumar
e I have had a chance to do anything > with Zeppelin, but here is a link to a post on what I did to get it working > https://groups.google.com/forum/#!topic/zeppelin-developers/mCNdyOXNikI. > This may or may not work with the newer releases from Zeppelin. > > -Todd > > On Fri,

Re: Tableau + Spark SQL Thrift Server + Cassandra

2015-04-03 Thread pawan kumar
st:1/default >> 0: jdbc:hive2://localhost:1/default> select * from t; >> >> >> I have not tried this yet from Tableau. My understanding is that the >> tempTable is only valid as long as the sqlContext is, so if one terminates >> the code representing t

RE: Tableau + Spark SQL Thrift Server + Cassandra

2015-04-03 Thread pawan kumar
to see the tables in C*. > > > > Hi Pawan, > > Tableau + C* is pretty straight forward, especially if you are using DSE. > Create a new DSN in Tableau using the ODBC driver that comes with DSE. Once > you connect, Tableau allows to use C* keyspace as schema and column

Re: Tableau + Spark SQL Thrift Server + Cassandra

2015-04-03 Thread pawan kumar
Hi Todd, Thanks for the link. I would be interested in this solution. I am using DSE for cassandra. Would you provide me with info on connecting with DSE either through Tableau or zeppelin. The goal here is query cassandra through spark sql so that I could perform joins and groupby on my queries.

Re: Creating a SparkR standalone job

2014-04-07 Thread pawan kumar
ssible > to do that in your code if the classpath etc. is setup correctly. I haven't > tried it out though, but do let us know if you get it to work. > > Thanks > Shivaram > > > On Mon, Apr 7, 2014 at 2:21 PM, pawan kumar wrote: > >> Hi, >> >>

Creating a SparkR standalone job

2014-04-07 Thread pawan kumar
Hi, Is it possible to create a standalone job in scala using sparkR? If possible can you provide me with the information of the setup process. (Like the dependencies in SBT and where to include the JAR files) This is my use-case: 1. I have a Spark Streaming standalone Job running in local machin