Hi guys,
I am wondering. Is there any other mailing list for hive related questions?
I feel there is not much activity on the user/dev hive mailing lists or at
least not much support in answering my questions.
Thanks,Robert
On Tuesday, December 20, 2016 11:01 PM, Robert Grandl
wrote
Hi guys,
I am wondering if it's possible to estimate the number of distinct keys and
their distribution in a way or another.
More concretely, for every stage, it is possible to determine the number of
distinct keys and for each key the number of values before the data is
actually processed?
T
use a StructObjectInspector and iterate over
every getStructFieldsDataAsList(obj) element. Is this the right way to do it?
However, for the value I have no clue how to get it.
Thanks,Robert
On Monday, December 5, 2016 11:07 AM, Robert Grandl
wrote:
Hi guys,
I am trying to trace the key / value de
contents. This is just an example.
~Rajesh.B
On Mon, Dec 5, 2016 at 5:43 AM, Robert Grandl wrote:
Hi guys,
I am running Hive atop Tez and run several TPC-DS / TPC-H queries. I am trying
to print the Key/Value pairs received as input by each vertex and generated as
output accordingly.
However, look
Hi guys,
I am trying to trace the key / value details processed by each Operator or at
least each stage in Hive.
However, I have a hard time to do that, as key / values are serialized.
Is there any way I can see the original key / value details?
Thanks in advance,Robert
Hi guys,
I am running Hive atop Tez and run several TPC-DS / TPC-H queries. I am trying
to print the Key/Value pairs received as input by each vertex and generated as
output accordingly.
However, looking at Hive / Tez code, it seems they are converted to Object type
and use their serialized fo
g an inbound shuffled input
then there is no difference except for their logical names.
Feel free to send more questions to the list to get more clarifications.
thanks
— Hitesh
> On Nov 28, 2016, at 3:44 PM, Robert Grandl wrote:
>
> Hi all,
>
> I am trying to get a better u