Hi to all, we often need to track the number of rows of a dataset. In order to burden on the job complexitye we use accumulators to track this information. The problem is that we have to extends all InputFormats that we use in order to properly handle such row-count accumulator...my question is: what about introducing it as a first class citizen (forcing all input format to handle a rowCount accumulator when required)?
What do you think? Will it be useful in general? Best, Flavio