Re: Thrift object serialization

2017-05-16 Thread Flavio Pompermaier
Ok thanks Gordon! It would be nice to have a benchmark also on this ;) Thanks a lot for the support, Flavio On Tue, May 16, 2017 at 9:41 AM, Tzu-Li (Gordon) Tai wrote: > If you don’t register the TBaseSerializer for your MyThriftObj (or in > general don’t register any serializer for the Thrift

Re: Thrift object serialization

2017-05-16 Thread Tzu-Li (Gordon) Tai
If you don’t register the TBaseSerializer for your MyThriftObj (or in general don’t register any serializer for the Thrift class), I think Kryo’s default FieldSerializer will be used for it. The TBaseSerializer basically just uses TBase for de-/serialization as you normally would for the Thrift

Re: Thrift object serialization

2017-05-16 Thread Flavio Pompermaier
Hi Gordon, thanks for the link. Will the usage ofTBaseSerializer wrt Kryo lead to a performance gain? On Tue, May 16, 2017 at 7:32 AM, Tzu-Li (Gordon) Tai wrote: > Hi Flavio! > > I believe [1] has what you are looking for. Have you taken a look at that? > > Cheers, > Gordon > > [1] https://ci.ap

Re: Thrift object serialization

2017-05-15 Thread Tzu-Li (Gordon) Tai
Hi Flavio! I believe [1] has what you are looking for. Have you taken a look at that? Cheers, Gordon [1]  https://ci.apache.org/projects/flink/flink-docs-release-1.3/dev/custom_serializers.html On 15 May 2017 at 9:08:33 PM, Flavio Pompermaier (pomperma...@okkam.it) wrote: Hi to all, in my Flin

Thrift object serialization

2017-05-15 Thread Flavio Pompermaier
Hi to all, in my Flink job I create a Dataset using HadoopInputFormat in this way: HadoopInputFormat inputFormat = new HadoopInputFormat<>( new ParquetThriftInputFormat(), Void.class, MyThriftObj.class, job); FileInputFormat.addInputPath(job, new org.apache.hadoop.fs.Path(inputPath); *Dat