Juan Miguel Cejuela created FLINK-8046: ------------------------------------------
Summary: ContinuousFileMonitoringFunction wrongly ignores files with exact same timestamp Key: FLINK-8046 URL: https://issues.apache.org/jira/browse/FLINK-8046 Project: Flink Issue Type: Bug Components: Streaming Affects Versions: 1.3.2 Reporter: Juan Miguel Cejuela Fix For: 1.5.0 The current monitoring of files sets the internal variable `globalModificationTime` to filter out files that are "older". However, the current test (to check "older") does `boolean shouldIgnore = modificationTime <= globalModificationTime;` (rom `shouldIgnore`) The comparison should strictly be SMALLER (NOT smaller or equal). The method documentation also states "This happens if the modification time of the file is _smaller_ than...". The equality acceptance for "older", makes some files with same exact timestamp to be ignored. The behavior is also non-deterministic, as the first file to be accepted ("first" being pretty much random) makes the rest of files with same exact timestamp to be ignored. -- This message was sent by Atlassian JIRA (v6.4.14#64029)