Re: Wrong runtime type when using newAPIHadoopFile in Java

2017-03-06 Thread Steve Loughran
On 6 Mar 2017, at 12:30, Nira Amit mailto:amitn...@gmail.com>> wrote: And it's very difficult if it's doing unexpected things. All serialisations do unexpected things. Nobody understands them. Sorry

Re: Wrong runtime type when using newAPIHadoopFile in Java

2017-03-06 Thread Nira Amit
And by the way - I don't want the Avro details to be hidden away from me. The whole purpose of the work I'm doing is to benchmark different serialization tools and strategies. If I want to use Kryo serialization for example, then I need to understand how the API works. And it's very difficult if it

Re: Wrong runtime type when using newAPIHadoopFile in Java

2017-03-06 Thread Nira Amit
Hi Sean, Yes, we discussed this in Jira and you suggested I take this discussion to the mailing list, so I did. I don't have the option to migrate the code I'm working on to Datasets at the moment (or to Scala, as another developer suggested in the Jira discussion), so I have to work with the the J

Re: Wrong runtime type when using newAPIHadoopFile in Java

2017-03-06 Thread Sean Owen
I think this is the same thing we already discussed extensively on your JIRA. The type of the key/value class argument to newAPIHadoopFile are not the type of your custom class, but of the Writable describing encoding of keys and values in the file. I think that's the start of part of the problem.