Hi Dawid, My team's Flink application's primary purpose is not to count the number of SQS messages received or the number of successful or failed S3 downloads. The application's primary purpose is to process events and the corresponding data, and for each event, create or update a new entry in our database with the result of this processing. (I can't really go into too much detail on what is processed and what results are produced because that is proprietary information.)
As you suggest, the counters for SQS messages and successful or unsuccessful downloads from S3 are just for the purposes of monitoring. We use these metrics to have an idea of how healthy our application is, and to help indicate which components may have faults. We are not using these counters to calculate results. My team is trying to understand why we see inaccurate values for our metrics, with the intention of fixing the inaccuracies so we can better monitor our application. I hope this helps to clarify the context of my inquiry. Best, Shane