Hi Flavio,

it is not possible to split by line count because that would mean to read
and parse the file just for splitting.

Parallel processing of data sources depends on the input splits created by
the InputFormat. Local files can be split just like files in HDFS. Usually,
each file corresponds to at least one split but multiple files could also
be put into a single split if necessary.The logic for that would go into to
the InputFormat.createInputSplits() method.

Cheers, Fabian

2015-10-07 14:47 GMT+02:00 Flavio Pompermaier <pomperma...@okkam.it>:

> Hi to all,
>
> is there a way to split a single local file by line count (e.g. a split
> every 100 lines) in a LocalEnvironment to speed up a simple map function?
> For me it is not very clear how the local files (files into directory if
> recursive=true) are managed by Flink..is there any ref to this internals?
>
> Best,
> Flavio
>

Reply via email to