Re: Batch jobs stalling after initial progress

2019-03-13 Thread Ken Krugler
Hi Marko, Some things that have caused my jobs to run very slowly (though not completely stall) 1. Cross-joins generating huge result sets. 2. Joins causing very large spills to disk. 3. Slow external API access With streaming, iterations can cause stalls, but I don’t think that’s true for

Batch jobs stalling after initial progress

2019-03-13 Thread Marko Mušnjak
Hi, I'm running flink batch jobs on EMR 5.21, and I'm seeing many (>50%) jobs stall and make no progress after some initial period. I've seen the behaviour earlier (5.17), but not nearly as much as now. The job is a fairly simple enrichment job, loading an avro metadata file, creating several dat