Hi, Before I send out the keys for network shuffle, in reduceByKey after map + combine are done, I would like to filter the keys based on some threshold...
Is there a way to get the key, value after map+combine stages so that I can run a filter on the keys ? Thanks. Deb