Hi Federico,
1) Which version of Flink are you using?
2) Can you also share the JobManager log?
3) Why do you think, Flink is stuck at the BlobCache? Is it really blocked, or 
do you still have CPU load? Can you post stack traces of the TaskManager (TM) 
and JobManager processes when you think they are stuck (using jstack)?
4) These PUT requests in the TM logs are strange, unless you showed the TM 
logs in the web interface - did you?


Nico


On Thursday, 31 August 2017 09:45:59 CEST Fabian Hueske wrote:
> Hi Federico,
> 
> Not sure what's going on there but Nico (in CC) is more familiar with the
> blob cache and might be able to help.
> 
> Best, Fabian
> 
> 2017-08-30 15:35 GMT+02:00 Federico D'Ambrosio <fedex...@gmail.com>:
> > Hi,
> > 
> > I have a rather simple Flink job which has a KinesisConsumer as a source
> > and an HBase table as sink, in which I write using writeOutputFormat. I'm
> > running it on a local machine with a single taskmanager (2 slots, 2G). The
> > KinesisConsumer works fine and the connection to the HBase table gets
> > opened fine (i.e. the open method of the class implementing OutputFormat
> > gets actually called).
> > 
> > I'm running the job at a parallelism of 2, while the sink has a
> > parallelism of 1. The
> > 
> > Still, looking at the log I see that after opening the connection, the job
> > gets stuck at lines like this one:
> > 
> > INFO  org.apache.flink.runtime.blob.BlobCache                       -
> > Downloading 8638bdf78b0e540786de6c291f710a8db447a2b4 from
> > localhost/127.0.0.1:43268
> > 
> > Each following one another, like this:
> > 
> > 2017-08-30 14:17:21,318 INFO  org.apache.flink.runtime.blob.BlobCache     
> >                  - Created BLOB cache storage directory
> > /tmp/blobStore-8a2a96af-b836-4c95-b79a-a4b80929126f 2017-08-30
> > 14:17:21,321 DEBUG org.apache.flink.runtime.blob.BlobClient              
> >        - PUT content addressable BLOB stream to /127.0.0.1:59937
> > 2017-08-30 14:17:21,323 DEBUG
> > org.apache.flink.runtime.blob.BlobServerConnection            - Received
> > PUT request for content addressable BLOB 2017-08-30 14:17:21,324 INFO 
> > org.apache.flink.runtime.blob.BlobCache                       -
> > Downloading 3ff486dff4c4eaafdab42b30a877326e62bfca82 from
> > localhost/127.0.0.1:43268 2017-08-30 14:17:21,324 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - GET
> > content addressable BLOB 3ff486dff4c4eaafdab42b30a877326e62bfca82 from
> > /127.0.0.1:59938 2017-08-30 14:18:13,708 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - PUT
> > content addressable BLOB stream to /127.0.0.1:59976 2017-08-30
> > 14:18:13,708 DEBUG org.apache.flink.runtime.blob.BlobServerConnection    
> >        - Received PUT request for content addressable BLOB 2017-08-30
> > 14:18:13,710 INFO  org.apache.flink.runtime.blob.BlobCache               
> >        - Downloading 2f5283326aab77faa047b705cd1d6470035b3b7d from
> > localhost/127.0.0.1:43268 2017-08-30 14:18:13,710 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - GET
> > content addressable BLOB 2f5283326aab77faa047b705cd1d6470035b3b7d from
> > /127.0.0.1:59978 2017-08-30 14:19:29,811 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - PUT
> > content addressable BLOB stream to /127.0.0.1:60022 2017-08-30
> > 14:19:29,812 DEBUG org.apache.flink.runtime.blob.BlobServerConnection    
> >        - Received PUT request for content addressable BLOB 2017-08-30
> > 14:19:29,814 INFO  org.apache.flink.runtime.blob.BlobCache               
> >        - Downloading f91fd7ecec6f90809f52ee189cb48aa1e30b04f6 from
> > localhost/127.0.0.1:43268 2017-08-30 14:19:29,814 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - GET
> > content addressable BLOB f91fd7ecec6f90809f52ee189cb48aa1e30b04f6 from
> > /127.0.0.1:60024 2017-08-30 14:21:42,856 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - PUT
> > content addressable BLOB stream to /127.0.0.1:60110 2017-08-30
> > 14:21:42,856 DEBUG org.apache.flink.runtime.blob.BlobServerConnection    
> >        - Received PUT request for content addressable BLOB 2017-08-30
> > 14:21:42,858 INFO  org.apache.flink.runtime.blob.BlobCache               
> >        - Downloading 8638bdf78b0e540786de6c291f710a8db447a2b4 from
> > localhost/127.0.0.1:43268 2017-08-30 14:21:42,859 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - GET
> > content addressable BLOB 8638bdf78b0e540786de6c291f710a8db447a2b4 from
> > /127.0.0.1:60112 2017-08-30 14:26:11,242 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - PUT
> > content addressable BLOB stream to /127.0.0.1:60295 2017-08-30
> > 14:26:11,243 DEBUG org.apache.flink.runtime.blob.BlobServerConnection    
> >        - Received PUT request for content addressable BLOB 2017-08-30
> > 14:26:11,247 INFO  org.apache.flink.runtime.blob.BlobCache               
> >        - Downloading 6d30c88539d511bb9acc13b53bb2a128614f5621 from
> > localhost/127.0.0.1:43268 2017-08-30 14:26:11,247 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - GET
> > content addressable BLOB 6d30c88539d511bb9acc13b53bb2a128614f5621 from
> > /127.0.0.1:60297 2017-08-30 14:29:20,942 DEBUG
> > org.apache.flink.runtime.blob.BlobClient                      - PUT
> > content addressable BLOB stream to /127.0.0.1:60410
> > 
> > 
> > My questions are: what is the jobmanager doing here? Why is he taking ages
> > to do this? How do i speed up this behaviour?
> > 
> > Thank you very much for your attention,
> > 
> > Federico D'Ambrosio

Attachment: signature.asc
Description: This is a digitally signed message part.

Reply via email to