Re: Use case for data in SQL Server

Cheng Lian Tue, 24 Feb 2015 03:21:07 -0800

There is a newly introduced JDBC data source in Spark 1.3.0 (not theJdbcRDD in Spark core), which may be useful. However, currently there'sno SQL server specific logics implemented. I'd assume standard SQLqueries should work.


Cheng


On 2/24/15 7:02 PM, Suhel M wrote:

Hey,
I am trying to work out what is the best way we can leverage Spark forcrunching data that is sitting in SQL Server databases.Ideal scenario is being able to efficiently work with big data(10billion+ rows of activity data). We need to shape this data formachine learning problems and want to do ad-hoc & complex queries andget results in timely manner.
All our data crunching is done via SQL/MDX queries, but theseobviously take a very long time to run over large data size. Also wecurrently don't have hadoop or any other distributed storage.
Keen to hear feedback/thoughts/war stories from the Spark community onbest way to approach this situation.
Thanks
Suhel

Re: Use case for data in SQL Server

Reply via email to