[ 
https://issues.apache.org/jira/browse/FLINK-10525?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rinat Sharipov updated FLINK-10525:
-----------------------------------
    Description: 
Hi mates, in accordance with the contract of 
*org.apache.flink.api.common.serialization.DeserializationSchema*, it should 
return *null* value, when content couldn’t be deserialized.

But in most cases (e.x. 
*org.apache.flink.formats.avro.AvroDeserializationSchema*) method fails if data 
is corrupted. 
  
 We’ve implemented our own SerDe class, that returns *null*, if data doesn’t 
satisfy avro schema, but it’s rather hard to maintain this functionality during 
migration to the latest Flink version. 

I think, that it’ll be useful feature, if Flink will support optional skip of 
failed records in avro and other Deserializers

  was:
Hi mates, in accordance with the contract of 
*org.apache.flink.api.common.serialization.DeserializationSchema*, it should 
return *null* value, when content couldn’t be deserialized.

But in most cases (e.x. 
*org.apache.flink.formats.avro.AvroDeserializationSchema*) method fails if data 
is corrupted. 
  
 We’ve implemented our own SerDe class, that returns *null*, if data doesn’t 
satisfy avro schema, but it’s rather hard to maintain this functionality during 
migration to the latest Flink version. 

I think, that it’ll be useful if Flink will support optional skip of failed 
records in avro and other Deserializers in the source code


> Deserialization schema, skip data, that couldn't be properly deserialized
> -------------------------------------------------------------------------
>
>                 Key: FLINK-10525
>                 URL: https://issues.apache.org/jira/browse/FLINK-10525
>             Project: Flink
>          Issue Type: Improvement
>          Components: Core
>            Reporter: Rinat Sharipov
>            Priority: Minor
>
> Hi mates, in accordance with the contract of 
> *org.apache.flink.api.common.serialization.DeserializationSchema*, it should 
> return *null* value, when content couldn’t be deserialized.
> But in most cases (e.x. 
> *org.apache.flink.formats.avro.AvroDeserializationSchema*) method fails if 
> data is corrupted. 
>   
>  We’ve implemented our own SerDe class, that returns *null*, if data doesn’t 
> satisfy avro schema, but it’s rather hard to maintain this functionality 
> during migration to the latest Flink version. 
> I think, that it’ll be useful feature, if Flink will support optional skip of 
> failed records in avro and other Deserializers



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to