Hi all,

The current interfaces for sources in Pulsar IO are geared towards
streaming sources where data is available on a continuous basis. There
exist a whole bunch of data sources where data is not available on a
continuous/streaming fashion, but rather arrives periodically/in spurts.
These set of 'Batch Sources' have a set of common characteristics that
might warrant framework level support in Pulsar IO.

Jerry and myself have jotted down the ideas around this in PIP-65. Please
review it and let us know what you think.

https://github.com/apache/pulsar/wiki/PIP-65:-Adapting-Pulsar-IO-Sources-to-support-Batch-Sources

Thanks!

Reply via email to