Hi Nick,

On a project I worked on, we simply made the file accessible on a shared
NFS drive.
Our source was custom, and we forced it to parallelism 1 inside the job, so
the file wouldn't be read multiple times. The rest of the job was
distributed.
This was also on a standalone cluster. On a resource managed cluster I
guess the resource manager could take care of copying the file for us.

Hope this can help. If there would have been a better solution, I'm also
happy to hear it :).

Regards,

Laurent.

On Tue, Jun 23, 2020, 20:51 Nick Bendtner <buggi...@gmail.com> wrote:

> Hi guys,
> What is the best way to process a file from a unix file system since there
> is no guarantee as to which task manager will be assigned to process the
> file. We run flink in standalone mode. We currently follow the brute force
> way in which we copy the file to every task manager, is there a better way
> to do this ?
>
>
> Best,
> Nick.
>

-- 
♻ Be green, keep it on the screen

Reply via email to