Re: Hive statistics

2016-12-21 Thread Robert Grandl
Hi guys, I am wondering. Is there any other mailing list for hive related questions? I feel there is not much activity on the user/dev hive mailing lists or at least not much support in answering my questions. Thanks,Robert On Tuesday, December 20, 2016 11:01 PM, Robert Grandl wrote

Hive statistics

2016-12-20 Thread Robert Grandl
Hi guys, I am wondering if it's possible to estimate the number of distinct keys and their distribution in a way or another. More concretely, for every stage, it is possible to determine the number of distinct keys and for each key the number of values  before the data is actually processed? T

Re: Print actual key / value details in Hive

2016-12-06 Thread Robert Grandl
use a StructObjectInspector and iterate over every getStructFieldsDataAsList(obj) element. Is this the right way to do it? However, for the value I have no clue how to get it. Thanks,Robert On Monday, December 5, 2016 11:07 AM, Robert Grandl wrote: Hi guys, I am trying to trace the key / value de

Re: Trace Key-Value pairs

2016-12-06 Thread Robert Grandl
contents. This is just an example. ~Rajesh.B On Mon, Dec 5, 2016 at 5:43 AM, Robert Grandl wrote: Hi guys, I am running Hive atop Tez and run several TPC-DS / TPC-H queries. I am trying to print the Key/Value pairs received as input by each vertex and generated as output accordingly. However, look

Print actual key / value details in Hive

2016-12-05 Thread Robert Grandl
Hi guys, I am trying to trace the key / value details processed by each Operator or at least each stage in Hive. However, I have a hard time to do that, as key / values are serialized. Is there any way I can see the original key / value details? Thanks in advance,Robert

Trace Key-Value pairs

2016-12-04 Thread Robert Grandl
Hi guys, I am running Hive atop Tez and run several TPC-DS / TPC-H queries. I am trying to print the Key/Value pairs received as input by each vertex and generated as output accordingly. However, looking at Hive / Tez code, it seems they are converted to Object type and use their serialized fo

Re: clarification regarding Tez DAGs

2016-12-02 Thread Robert Grandl
g an inbound shuffled input then there is no difference except for their logical names. Feel free to send more questions to the list to get more clarifications. thanks — Hitesh   > On Nov 28, 2016, at 3:44 PM, Robert Grandl wrote: > > Hi all, > > I am trying to get a better u