Hi all, The current interfaces for sources in Pulsar IO are geared towards streaming sources where data is available on a continuous basis. There exist a whole bunch of data sources where data is not available on a continuous/streaming fashion, but rather arrives periodically/in spurts. These set of 'Batch Sources' have a set of common characteristics that might warrant framework level support in Pulsar IO.
Jerry and myself have jotted down the ideas around this in PIP-65. Please review it and let us know what you think. https://github.com/apache/pulsar/wiki/PIP-65:-Adapting-Pulsar-IO-Sources-to-support-Batch-Sources Thanks!