Re: Distributing Tasks over Task manager

Jürgen Thomann Wed, 12 Oct 2016 12:13:37 -0700

Hi Robert,

Thanks for your suggestions. We are using the DataStream API and I triedit with disabling it completely, but that didn't help.

I attached the plan and to add some context, it starts with a Kafkasource followed by a map operation ( parallelism 4). The next map is theexpensive part with a parallelism of 18 which produces a Tuple2 which isused for splitting. Starting here the parallelism is always 2 except thesink with 1. Both resulting streams have two maps, a filter, one moremap and are ending with an assignTimestampsAndWatermarks. If there isnow a small box in the picture it is a filter operation and otherwise itgoes directly to a keyBy, timewindow and apply operation followed by a sink.

If one task manager contains more sub tasks of the expensive map thanany other task manager, everything later in the stream is running on thesame task manager. If two task manager have the same amount of subtasks, the following tasks with a parallelism of 2 are distributed overthe two task manager.

Interesting is also that the task manager have 6 task slots configuredand the expensive part has 6 sub tasks on one task manager but stilleverything later in the flow is running on this task manager. This alsohappens if operator chaining is disabled.


Best,
Jürgen


On 12.10.2016 17:43, Robert Metzger wrote:

Hi Jürgen,

Are you using the DataStream or the DataSet API?
Maybe the operator chaining is causing too many operations to be"packed" into one task. Check out this documentation page:https://ci.apache.org/projects/flink/flink-docs-master/dev/datastream_api.html#task-chaining-and-resource-groupsYou could try to disable chaining completely to see if that resolvesthe issue (you'll probably pay for this by having more serializationoverhead and network traffic).
If my suggestions don't help, can you post a screenshot of your jobplan (from the web interface) here, so that we see what operations youare performing?
Regards,
Robert
On Wed, Oct 12, 2016 at 12:52 PM, Jürgen Thomann<[email protected] <mailto:[email protected]>>wrote:
    Hi,

    we currently have an issue with Flink, as it allocates many tasks
    to the same task manager and as a result it overloads it. I
    reduced the amount of task slots per task manager (keeping the CPU
    count) and added some more servers but that did not help to
    distribute the load.

    Is there some way to force Flink to distribute the load/tasks on a
    standalone cluster? I saw that
    https://issues.apache.org/jira/browse/FLINK-1003
    <https://issues.apache.org/jira/browse/FLINK-1003> would maybe
    provide what we need, but that is currently not worked on as it seems.

    Cheers,
    Jürgen

Re: Distributing Tasks over Task manager

Reply via email to