Re: Streaming multiple csv files

2020-05-29 Thread Robert Metzger
Hi Nikola, you could implement a custom SourceFunction that implements this in some way: If the files are small (< 10 MB) send each file as a record, then process it in a subsequent flatMap operation. If the files are large, split the work across the parallel sources and read them serially in the

Streaming multiple csv files

2020-05-28 Thread Nikola Hrusov
Hello, I have multiple files (file1, file2, file3) each being CSV and having different columns and data. The column headers are finite and we know their format. I would like to take them and parse them based on the column structure. I already have the parsers e.g.: file1 has columns (id, firstna