Hey, The thread you are referring to is about DataStream API job and long checkpointing issue. While from your message it seems like you are using Table API (SQL) to process a batch data? Or what exactly do you mean by:
> i notice that there are one or two subtasks that take too long to finish Aside from that, don’t you have just a problem with a data skew, where some subset of keys are more heavily used than others? Piotrek > On 31 Mar 2020, at 01:43, Fanbin Bu <fanbin...@coinbase.com> wrote: > > Hi, > > I m running flink 1.9 on EMR using flink sql blink planner reading and > writing to JDBC input/output. my sql is just a listagg over window for the > last 7 days. However, i notice that there are one or two subtasks that take > too long to finish. In this thread > http://mail-archives.apache.org/mod_mbox/flink-user/201901.mbox/%3CCAEv5b0yD+0WBXgAnfT0b=ZqLC8rPE9_izzE3g+9Vxw8oK9w2=a...@mail.gmail.com%3E > > <http://mail-archives.apache.org/mod_mbox/flink-user/201901.mbox/%3CCAEv5b0yD+0WBXgAnfT0b=ZqLC8rPE9_izzE3g+9Vxw8oK9w2=a...@mail.gmail.com%3E>, > that is a similar issue. > > Any idea on how to debug this? > > Thanks > Fanbin >