Re: Trace Key-Value pairs

2016-12-06 Thread Robert Grandl
Rajesh, Thanks so much for your answers. However, I am struggling to get the right information. As you have mentioned, in ReduceSinkOperator.java, keys and values are present but I have a hard time to be able to print their content. For key: I am trying to print it in ReduceSinkOperator.java -

Re: Trace Key-Value pairs

2016-12-04 Thread Rajesh Balamohan
Hi Robert, Tez deals with bytes and does not understand if the data is coming from Hive/Pig/Cascading etc. So in case you print the content from Hive, you would get mostly binary data. For hive, org.apache.hadoop.hive.ql.io.HiveKey, and value would be org.apache.hadoop.io.BytesWritable. Printing

Trace Key-Value pairs

2016-12-04 Thread Robert Grandl
Hi guys, I am running Hive atop Tez and run several TPC-DS / TPC-H queries. I am trying to print the Key/Value pairs received as input by each vertex and generated as output accordingly. However, looking at Hive / Tez code, it seems they are converted to Object type and use their serialized fo