Hello,

In Reduce, can I get the number of values for the current key without
iterating over them? Does Hadoop has this number?

Or, at least the total number of pairs that will be processed by the
current Reduce instance. I am pretty sure that Hadoop already knows
this number because it sorted them.

BTW, the iterators given to Reduce are one-time use iterators, right?

Thanks!
Rares

Reply via email to