2018-02-20, 14:31:58 2018-02-20, 14:52:28 20m 30s Map (Map at com.rfk.dataplatform.batch.jobs.topk.TopkOperations$$anonfun$4.apply(TopkOperations.scala:128)) 10.8 GB 130,639,359 10.8 GB 130,639,359 16 00016000 FINISHED Start Time End Time Duration Bytes received Records received Bytes sent Records sent Attempt Host Status 2018-02-20, 14:43:05 2018-02-20, 14:52:28 9m 22s 693 MB 8,169,369 693 MB 8,169,369 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:32:35 37s 692 MB 8,164,898 692 MB 8,164,898 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:45:52 2018-02-20, 14:52:25 6m 32s 692 MB 8,160,648 692 MB 8,160,648 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:32:53 2018-02-20, 14:33:30 36s 692 MB 8,164,117 692 MB 8,164,117 1 ip-10-17-11-156:53921 FINISHED 2018-02-20, 14:39:05 2018-02-20, 14:39:43 37s 692 MB 8,168,042 692 MB 8,168,042 1 ip-10-17-11-156:53921 FINISHED 2018-02-20, 14:42:12 2018-02-20, 14:46:57 4m 45s 692 MB 8,161,923 692 MB 8,161,923 1 ip-10-17-11-156:53921 FINISHED 2018-02-20, 14:38:13 2018-02-20, 14:38:47 34s 692 MB 8,163,351 692 MB 8,163,351 1 ip-10-17-8-168:54366 FINISHED 2018-02-20, 14:39:34 2018-02-20, 14:40:08 33s 692 MB 8,163,694 692 MB 8,163,694 1 ip-10-17-8-168:54366 FINISHED 2018-02-20, 14:32:09 2018-02-20, 14:32:42 33s 692 MB 8,165,675 692 MB 8,165,675 1 ip-10-17-8-168:54366 FINISHED 2018-02-20, 14:41:34 2018-02-20, 14:46:52 5m 17s 692 MB 8,165,679 692 MB 8,165,679 1 ip-10-17-8-193:33639 FINISHED 2018-02-20, 14:44:03 2018-02-20, 14:47:10 3m 6s 692 MB 8,165,245 692 MB 8,165,245 1 ip-10-17-8-193:33639 FINISHED 2018-02-20, 14:41:20 2018-02-20, 14:41:54 34s 692 MB 8,168,041 692 MB 8,168,041 1 ip-10-17-8-193:33639 FINISHED 2018-02-20, 14:40:55 2018-02-20, 14:41:32 36s 692 MB 8,167,142 692 MB 8,167,142 1 ip-10-17-9-52:36094 FINISHED 2018-02-20, 14:41:35 2018-02-20, 14:46:54 5m 18s 692 MB 8,161,355 692 MB 8,161,355 1 ip-10-17-9-52:36094 FINISHED 2018-02-20, 14:40:08 2018-02-20, 14:40:52 44s 692 MB 8,166,737 692 MB 8,166,737 1 ip-10-17-9-52:36094 FINISHED 2018-02-20, 14:44:23 2018-02-20, 14:47:12 2m 48s 692 MB 8,163,443 692 MB 8,163,443 1 ip-10-17-9-52:36094 FINISHED
2018-02-20, 14:31:58 2018-02-20, 14:59:18 27m 19s GroupReduce (topk.IntermsToTopkEntityOp.reduceGroup) 10.8 GB 130,639,359 3.53 GB 5,163,805 16 00016000 FINISHED Start Time End Time Duration Bytes received Records received Bytes sent Records sent Attempt Host Status 2018-02-20, 14:31:58 2018-02-20, 14:58:49 26m 51s 684 MB 8,098,138 226 MB 323,203 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:01 27m 3s 690 MB 8,210,429 226 MB 322,178 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:06 27m 8s 714 MB 8,483,239 226 MB 322,797 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:57 26m 58s 694 MB 8,176,076 226 MB 322,600 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:02 27m 4s 680 MB 8,005,934 226 MB 323,506 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:15 27m 16s 739 MB 8,708,468 227 MB 323,087 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:39 26m 41s 682 MB 8,015,473 225 MB 322,401 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:51 26m 53s 674 MB 7,994,360 226 MB 323,354 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:18 27m 19s 715 MB 8,581,459 226 MB 322,303 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:44 26m 45s 682 MB 7,912,704 228 MB 322,915 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:07 27m 8s 706 MB 8,288,227 226 MB 322,480 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:59 27m 1s 698 MB 8,152,011 225 MB 322,836 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:04 26m 5s 646 MB 7,598,798 226 MB 322,270 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:22 26m 24s 656 MB 7,769,116 225 MB 321,911 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:59:12 27m 14s 719 MB 8,440,687 226 MB 322,699 1 ip-10-17-10-20:46079 FINISHED 2018-02-20, 14:31:58 2018-02-20, 14:58:48 26m 50s 693 MB 8,204,240 227 MB 323,265 1 ip-10-17-10-20:46079 FINISHED > On 20-Feb-2018, at 3:42 PM, Aljoscha Krettek <aljos...@apache.org> wrote: > > Could you please send a screenshot? > >> On 20. Feb 2018, at 11:09, Aneesha Kaushal <aneesha.kaus...@reflektion.com >> <mailto:aneesha.kaus...@reflektion.com>> wrote: >> >> Hello Aljoscha >> >> I looked into the Subtasks session on Flink Dashboard, for the about two >> tasks. >> >> Thanks >> Aneesha >> >>> On 20-Feb-2018, at 3:32 PM, Aljoscha Krettek <aljos...@apache.org >>> <mailto:aljos...@apache.org>> wrote: >>> >>> Hi, >>> >>> Could you please also post where/how you see which tasks are mapped to >>> which slots/TaskManagers? >>> >>> Best, >>> Aljoscha >>> >>>> On 20. Feb 2018, at 10:50, Aneesha Kaushal <aneesha.kaus...@reflektion.com >>>> <mailto:aneesha.kaus...@reflektion.com>> wrote: >>>> >>>> Hello, >>>> >>>> I have a fink batch job, where I am grouping dataset on some keys, and >>>> then using group reduce. Parallelism is set to 16. >>>> The slots for the Map task is distributed across all the machines, but for >>>> GroupReduce all the slots are being assigned to the same machine. Can you >>>> help me understand why/when this can happen? >>>> Code looks something like: >>>> dataset.map(MapFunction()) >>>> .groupBy(<keys to groupon>) >>>> .sortGroup(<key to sort on>, Order.DESCENDING) >>>> .reduceGroup(GroupReduceFunction()).name("Group reduce") >>>> From flink dashboard: >>>> >>>> <Screen Shot 2018-02-20 at 2.39.35 PM.png> >>>> >>>> >>>> Thanks in advance >>>> Aneesha >>>> >>>> >>>> >>>> >>> >> >