Hi, Thanks for the confirmation. In this case should we just leave things as they are - and if anyone is interested in having consistent processing each can build the wrapper you mention?
-----Original Message----- From: Fabian Hueske [mailto:fhue...@gmail.com] Sent: Monday, February 26, 2018 9:34 AM To: dev@flink.apache.org Subject: Re: RowSerializer Hi Radu, none of Flink's serializers adds checksums to ensure data integrity. It would be possible to implement a wrapping serializer that adds a checksum to each record, but that would be at the cost of performance. Not sure if this is done at some point in Flink, maybe for savepoints. Best, Fabian 2018-02-23 14:44 GMT+01:00 Radu Tudoran <radu.tudo...@huawei.com>: > Hi, > > > > I was using the RowSerializer (package org.apache.flink.api.java. > typeutils.runtime;) recently to serialize Rows to file (for reading > them back in the future). > > I observed a strange behavior that I would like to double check with > you in case this is a serious problem to be addressed: > > When the rowserializer is used to convert data back, there is no check > for the consistency of the data (e.g., size of the object that was > serialized, checksum…). This leads to situation that for random reads > of bytes it can happen that inconsistent objects are deserialized, > which of course can lead to inconsistent data. > > > > For example if we serialize object of the form (Int, Long, Double, > String > String) – if we have available only 1/3 of the bytes, we can end up > reading back objects as (0,0,0,null, null) – this is not the only > example of how the object can be incorrectly deserialized …rather than > having an error for this process. > > > > Hence, I wanted to double check if this is an intended behavior for > some reason and if we should consider fixing the rowserializer to > guarantee integrity of the objects that are deserialized. > > > > Best regards, > > > > Dr. Radu Tudoran > > Staff Research Engineer - Big Data Expert > > IT R&D Division > > > > [image: cid:image007.jpg@01CD52EB.AD060EE0] > > HUAWEI TECHNOLOGIES Duesseldorf GmbH > > German Research Center > > Munich Office > > Riesstrasse 25, 80992 > <https://maps.google.com/?q=Riesstrasse+25,+80992&entry=gmail&source=g > > > München > > > > E-mail: *radu.tudo...@huawei.com <radu.tudo...@huawei.com>* > > Mobile: +49 15209084330 <+49%201520%209084330> > > Telephone: +49 891588344173 <+49%2089%201588344173> > > > > HUAWEI TECHNOLOGIES Duesseldorf GmbH > Hansaallee 205, 40549 Düsseldorf, Germany > <https://maps.google.com/?q=Hansaallee+205,+40549+D%C3%BCsseldorf,+Ger > many&entry=gmail&source=g>, > www.huawei.com > Registered Office: Düsseldorf, Register Court Düsseldorf, HRB 56063, > Managing Director: Bo PENG, Qiuen Peng, Shengli Wang Sitz der > Gesellschaft: Düsseldorf, Amtsgericht Düsseldorf, HRB 56063, > Geschäftsführer: Bo PENG, Qiuen Peng, Shengli Wang > > This e-mail and its attachments contain confidential information from > HUAWEI, which is intended only for the person or entity whose address > is listed above. Any use of the information contained herein in any > way (including, but not limited to, total or partial disclosure, > reproduction, or dissemination) by persons other than the intended > recipient(s) is prohibited. If you receive this e-mail in error, > please notify the sender by phone or email immediately and delete it! > > >