Re: Any ways to connect BI tool to Spark without Hive

ayan guha Thu, 07 Jul 2016 21:22:11 -0700

Hi

Spark Thrift does not need Hive/hadoop. STS should be your first choice if
you are planning to integrate BI tools with Spark. It works with Zeppelin
as well. We do all our development using Zeppelin and STS.


One thing to note: many BI tools like Qliksense, Tablaue (not sure of
oracle Bi Tool) queires and the caches data on client side. This works
really well in real life.


On Fri, Jul 8, 2016 at 1:58 PM, Chanh Le <giaosu...@gmail.com> wrote:

> Hi Mich,
> Thanks for replying. Currently we think we need to separate 2 groups of
> user.
> 1. Technical: Can write SQL
> 2. Business: Can drag and drop fields or metrics and see the result.
> Our stack using Zeppeline, Spark SQL to query data from Alluxio. Our data
> current store in parquet files.* Zeppelin is using HiveContext but we
> haven’t set up Hive and Hadoop yet*.
>
> I am little bit confuse in Spark Thift Server because Thift Server in
> Spark can allow external tools connect but *is that require to set up
> Hive and Hadoop*?
>
> Thanks and regards,
> Chanh
>
>
>
> On Jul 8, 2016, at 10:49 AM, Mich Talebzadeh <mich.talebza...@gmail.com>
> wrote:
>
> hi,
>
> I have not used Alluxio but it is a distributed file system much like an
> IMDB say Oracle TimesTen. Spark is your query tool and Zeppelin is the GUI
> interface to your Spark which basically allows you graphs with Spark
> queries.
>
> You mentioned Hive so I assume your persistent storage is Hive?
>
> Your business are using Oracle BI tool. It is like Tableau. I assume
> Oracle BI tool accesses a database of some sort say Oracle DW using native
> connectivity and it may also have ODBC and JDBC connections to Hive etc.
>
> The issue I see here is your GUI tool Zeppelin which does the same thing
> as Oracle BI tool. Can you please clarify below:
>
>
>    1. you use Hive as your database/persistent storage and use Alluxio on
>    top of Hive?
>    2. are users accessing Hive or a Data Warehouse like Oracle
>    3. Oracle BI tools are pretty mature. Zeppelin is not in the same
>    league so you have to decide which technology stack to follow
>    4. Spark should work with Oracle BI tool as well (need to check this)
>    as a fast query tool. In that case the users can use Oracle BI tool with
>    Spark as well.
>
> It seems to me that the issue is that users don't want to move from Oracle
> BI tool. We had the same issue with Tableau. So you really need to make
> that Oracle BI tool use Spark and Alluxio and leave Zeppelin at one side.
>
> Zeppelin as I used it a while back may not do what Oracle BI tool does. So
> the presentation layer has to be Oracle BI tool.
>
> HTH
>
>
>
> Dr Mich Talebzadeh
>
>
> LinkedIn * 
> https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw
> <https://www.linkedin.com/profile/view?id=AAEAAAAWh2gBxianrbJd6zP6AcPCCdOABUrV8Pw>*
>
>
> http://talebzadehmich.wordpress.com
>
> *Disclaimer:* Use it at your own risk. Any and all responsibility for any
> loss, damage or destruction of data or any other property which may arise
> from relying on this email's technical content is explicitly disclaimed.
> The author will in no case be liable for any monetary damages arising from
> such loss, damage or destruction.
>
>
>
> On 8 July 2016 at 04:19, Chanh Le <giaosu...@gmail.com> wrote:
>
>> Hi everyone,
>> Currently we use Zeppelin to analytics our data and because of using SQL
>> it’s hard to distribute for users use. But users are using some kind of
>> Oracle BI tools to analytic because it support some kinds of drag and drop
>> and we can do some kind of permitted for each user.
>> Our architecture is Spark, Alluxio, Zeppelin. Because We want to share
>> what we have done in Zeppelin to business users.
>>
>> Is there any way to do that?
>>
>> Thanks.
>> ---------------------------------------------------------------------
>> To unsubscribe e-mail: user-unsubscr...@spark.apache.org
>>
>>
>
>


-- 
Best Regards,
Ayan Guha

Re: Any ways to connect BI tool to Spark without Hive

Reply via email to