Re: S3 bucket as a source

2023-12-05 Thread Feng Jin
Hi Matwey, I think you can customize an inputFormat to meet your needs. And use the FileSource::forBulkFileFormat interface to create a FileSource; In the custom inputFormat, you can choose to only read the metadata of the file without reading its content. https://github.com/apache/flink/blob/1

S3 bucket as a source

2023-12-05 Thread Matwey V. Kornilov
Hello, I have an S3 bucket and I would like to process the objects metainfo (such as keys (filenames), metainfo, tags, etc.). I don't care about the objects content since it is irrelevant for my task. What I want is to construct a data stream where each instance is a metainfo attached to some