You can use RDD.setName to give it a name. There’s also a creationSite field that is private[spark] — we may want to add a public setter for that later. If the name isn’t enough and you’d like this, please open a JIRA issue for it.
Matei On Jun 3, 2014, at 5:22 PM, John Salvatier <jsalvat...@gmail.com> wrote: > I have created some extension methods for RDDs in RichRecordRDD and these are > working exceptionally well for me. > > However, when looking at the logs, its impossible to tell what's going on > because all the line number hints point to RichRecordRDD.scala rather than > the code that uses it. For example: > > INFO scheduler.DAGScheduler: Submitting Stage 122 (MappedRDD[1223] at map at > RichRecordRDD.scala:633), which is now runnable > Is there any way set up my extension methods class so that the logs will > print a more useful line number? >