Till Rohrmann created FLINK-6525:
------------------------------------

             Summary: Transferred TM log/stdout files are never removed from 
BlobStore
                 Key: FLINK-6525
                 URL: https://issues.apache.org/jira/browse/FLINK-6525
             Project: Flink
          Issue Type: Bug
          Components: Distributed Coordination
    Affects Versions: 1.3.0, 1.4.0
            Reporter: Till Rohrmann


The {{TaskManager}} uses the {{BlobClient}} to upload its stdout/log file to 
the {{BlobServer}}. If HA mode is enabled, then these files will also be 
uploaded to the {{BlobStore}}. Since the {{TaskManagerLogHandler}} only cleans 
up files from a TM in case it has already received another file from this TM 
and additionally does this in a non thread safe manner, it can easily happen 
that files won't get cleaned up from the {{BlobStore}}.

I think we should not upload these kind of files to the persistent/HA 
{{BlobStore}}. We could do this by introducing a storage mode when uploading 
files to the {{BlobServer}} (e.g. {{HA_STORAGE}} vs. {{LOCAL_STORAGE}}). 
Additionally, we should also register a timeout for only locally stored files 
or at least store them under its {{JobID}} such that these files are also 
cleaned up once the job is being cleaned up.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to