Re: Efficient datatypes?

Stephan Ewen Thu, 19 Feb 2015 02:15:05 -0800

Hey!

All data types are always kept serialized for caching/hashing/sorting.
Deserialization is sometimes needed in the internal algorithms (on hash
collisions and sort-prefix collisions). The most efficient data types for
that are actually Tuples. POJOs and other data types are a little less
efficient for that, but we are trying to improve that.


Stephan


On Thu, Feb 19, 2015 at 10:23 AM, Kruse, Sebastian <sebastian.kr...@hpi.de>
wrote:

>  Hi everyone,
>
>
>
> I think that during one of the meetups, it was mentioned that Flink can in
> some cases operate on serialized data. Given I understood that correctly,
> which cases that would be, i.e, which data types and operators support such
> a feature?
>
>
>
> Cheers,
>
> Sebastian
>
>
>
>
>
> ---
>
> Sebastian Kruse
>
> Doktorand am Fachbereich Information Systems Group
>
> Hasso-Plattner-Institut an der Universität Potsdam
> Prof.-Dr.-Helmert-Str. 2-3, D-14482 Potsdam
> Tel +49 331 5509 240
>
> Amtsgericht Potsdam, HRB 12184
> Geschäftsführung: Prof. Dr. Christoph Meinel
>
>
>
>
>

Re: Efficient datatypes?

Reply via email to