spark over c* can pushdown lots of things (basic filter or where clause to
more advanced semi join)

2016-04-09 3:54 GMT+02:00 kurt Greaves <k...@instaclustr.com>:

> If you're using C* 3.0 you can probably achieve this with UDFs.
> http://www.planetcassandra.org/blog/user-defined-functions-in-cassandra-3-0/
>
> On 9 April 2016 at 00:22, Kevin Burton <bur...@spinn3r.com> wrote:
>
>> Ha..  Yes... C*...  I guess I need something like coprocessors in
>> bigtable.
>>
>> On Fri, Apr 8, 2016 at 1:49 AM, vincent gromakowski <
>> vincent.gromakow...@gmail.com> wrote:
>>
>>> c* I suppose
>>>
>>> 2016-04-07 19:30 GMT+02:00 Jonathan Haddad <j...@jonhaddad.com>:
>>>
>>>> What is CS?
>>>>
>>>> On Thu, Apr 7, 2016 at 10:03 AM Kevin Burton <bur...@spinn3r.com>
>>>> wrote:
>>>>
>>>>> I have a paging model whereby we stream data from CS by fetching
>>>>> 'pages' thereby reading (sequentially) entire datasets.
>>>>>
>>>>> We're using the bucket approach where we write data for 5 minutes,
>>>>> then we can just fetch the bucket for that range.
>>>>>
>>>>> Our app now has TONS of data and we have a piece of middleware that
>>>>> filters it based on the client requests.
>>>>>
>>>>> So if they only want english they just get english and filter away
>>>>> about 60% of our data.
>>>>>
>>>>> but it doesn't support condition pushdown.  So ALL this data has to be
>>>>> sent from our CS boxes to our middleware and filtered there (wasting a lot
>>>>> of network IO).
>>>>>
>>>>> Is there away (including refactoring the code) that I could push this
>>>>> this into CS?  Maybe some way I could discovery the CS topology and put
>>>>> daemons on each of our CS boxes and fetch from CS directly (doing the
>>>>> filtering there).
>>>>>
>>>>> Thoughts?
>>>>>
>>>>> --
>>>>>
>>>>> We’re hiring if you know of any awesome Java Devops or Linux
>>>>> Operations Engineers!
>>>>>
>>>>> Founder/CEO Spinn3r.com
>>>>> Location: *San Francisco, CA*
>>>>> blog: http://burtonator.wordpress.com
>>>>> … or check out my Google+ profile
>>>>> <https://plus.google.com/102718274791889610666/posts>
>>>>>
>>>>>
>>>
>>
>>
>> --
>>
>> We’re hiring if you know of any awesome Java Devops or Linux Operations
>> Engineers!
>>
>> Founder/CEO Spinn3r.com
>> Location: *San Francisco, CA*
>> blog: http://burtonator.wordpress.com
>> … or check out my Google+ profile
>> <https://plus.google.com/102718274791889610666/posts>
>>
>>
>
>
> --
> Kurt Greaves
> k...@instaclustr.com
> www.instaclustr.com
>

Reply via email to