I would like to get the file name along with the associated objects so that I can do further mapping on it.
My code below gives me AvroKey[myObject], NullWritable but I don't know how to get the file that gave those objects. sc.newAPIHadoopRDD(job.getConfiguration, classOf[AvroKeyInputFormat[myObject]], classOf[AvroKey[myObject]], classOf[NullWritable]) Basically I would like to end up having a tuple of (FileName, AvroKey[MyObject, NullWritable]) Any help is appreciated. .Manas