I would like to get the file name along with the associated objects so that
I can do further mapping on it.

My code below gives me AvroKey[myObject], NullWritable but I don't know how
to get the file that gave those objects.

 sc.newAPIHadoopRDD(job.getConfiguration,
        classOf[AvroKeyInputFormat[myObject]],
        classOf[AvroKey[myObject]],
        classOf[NullWritable])

Basically I would like to end up having a tuple of (FileName,
AvroKey[MyObject, NullWritable])

Any help is appreciated.

.Manas

Reply via email to