Jacek Wislicki created FLINK-28609:
--------------------------------------

             Summary: Flink-Pulsar connector fails on larger schemas
                 Key: FLINK-28609
                 URL: https://issues.apache.org/jira/browse/FLINK-28609
             Project: Flink
          Issue Type: Bug
          Components: Connectors / Pulsar
    Affects Versions: 1.15.1, 1.14.5, 1.14.4, 1.14.3
            Reporter: Jacek Wislicki
         Attachments: exception.txt

When a model results in a larger schema (this seems to be related to its byte 
array representation), the number of expected bytes to read is different than 
the number of actually read bytes: [^exception.txt]. The "read" is such a case 
is always 1018 while the expected "byteLen" gives a greater value. For smaller 
schemata, the numbers are equal (less than 1018) and no issue occurs.

The problem reproduction is on 
[GitHub|https://github.com/JacekWislicki/vp-test2]. There are 2 simple jobs 
(SimpleJob1 and SimpleJob2) using basic models for the Pulsar source definition 
(PulsarMessage1 and PulsarMessage2, respectively). Each of the corresponding 
schemata is properly serialised and deserialised, unless an effective byte 
array length becomes excessive (marked with "the problem begins" in model 
classes). The fail condition can be achieved by a number of fields 
(PulsarMessage1) or just longer field names (PulsarMessage2). The problem 
occurs on either Avro or a JSON schema set in the Pulsar source definition.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to