I see there's a comment in the TaskInfo class that the index may not be the same as the ID of the RDD partition the task is computing. Under what circumstances *will* the ID by the same? If there are zero guarantees, any suggestions on how to grab this info from the scheduler to populate a new field inside TaskInfo?
Cheers, -- Michael Mior mm...@apache.org