Hi Wes, Here goes some inputs on these 2 usecases.
1. The first use case is around calling Watson Services in a scalable manner using a custom Rest Data Source https://developer.ibm.com/dwblog/2018/distributed-rest-calls-to-watson-services-using-rest-data-source-on-apache-spark/. The idea is to make this Custom Rest Data Source ( https://github.com/sourav-mazumder/Data-Science-Extensions/tree/master/spark-datasource-rest) store the data using Arrow and getting the Performance advantage. This Rest Data Source can be used for any Rest based Data Service not just Watson Service. 2. NetCDF is a pretty popular format in scientific world. I have worked on some use cases that use that around Climate Analytics. Here is the list of organizations use that today - https://www.unidata.ucar.edu/software/netcdf/usage.html. I did some work where I tried mapping NetCDf format to Spark RDD. But with Arrow I am trying to make that more generic. So this topic would be more around the conceptual aspects of mapping NetCDF to Arrow. Regards, Sourav Mazumder Data Science Center of Competency IBM Analytics On Tue, Mar 27, 2018 at 11:44 AM, Wes McKinney <wesmck...@gmail.com> wrote: > hi Sourav, > > Do you have prior references for either of these topics / use cases? I > had not heard about them before. > > Thanks, > Wes > > On Tue, Mar 27, 2018 at 2:12 PM, Sourav Mazumder > <sourav.mazumde...@gmail.com> wrote: > > Hi Jacques, > > > > I can talk about on either of these 2 topics - > > > > 1. Using Arrow with IBM Watson Studio for vectorized query processing on > > large volume of data > > > > 2. Using Arrow for NetCDF data format for supporting scientific data > > processing > > > > Regards, > > Sourav Mazumder > > Data Science Center of Competency > > IBM Analytics > > > > On Tue, Mar 27, 2018 at 10:03 AM, Bryan Cutler <cutl...@gmail.com> > wrote: > > > >> Hi Jacques, > >> > >> I could talk about some of the integration with Spark, if you have room > for > >> another. > >> > >> Thanks, > >> Bryan > >> > >> On Tue, Mar 27, 2018 at 9:51 AM, Atul Dambalkar < > >> atul.dambal...@xoriant.com> > >> wrote: > >> > >> > Hi Jacques, > >> > > >> > If it makes sense, I can certainly talk about the JDBC Adapter work I > >> have > >> > been doing and also some of the future enhancements/related bits of > work > >> > that could happen surrounding that. Please let me know if this would > be > >> > useful for the audience. > >> > > >> > Regards, > >> > -Atul > >> > > >> > -----Original Message----- > >> > From: Jacques Nadeau [mailto:jacq...@apache.org] > >> > Sent: Tuesday, March 27, 2018 9:12 AM > >> > To: dev@arrow.apache.org > >> > Subject: Meetup in SF, Additional Speakers? > >> > > >> > Hey All, > >> > > >> > It looks like Thumbtack crew offered to host an Arrow meetup for the > >> Arrow > >> > community in San Francisco and one of my colleagues set it up on > Meetup. > >> > Sidd and I have been volunteered to do talks but I would really love > to > >> > have one or two other speakers as well. Anyone going to be in San > >> Francisco > >> > on April 17 and interested in proposing a talk about Arrow related > >> things? > >> > > >> > Meetup Link: https://www.meetup.com/Apache- > >> Arrow-Meetup/events/249150105/ > >> > > >> > thanks, > >> > Jacques > >> > > >> >