Here's the original design doc: https://wiki.apache.org/pig/PigSkewedJoinSpec

Alan.

Gagan Juneja <mailto:[email protected]>
July 29, 2015 at 21:30
Any help?

Regards,
Gagan


Gagan Juneja <mailto:[email protected]>
July 14, 2015 at 4:56
Hi Team,

We are using Pig intensively in our various projects. We are doing
optimizations for that we wanted to know how join works. Though we have
moved to Skewed joins for some of our use cases.

At many places in the documentation this is mentioned that in join data is
streamed for second table. But I was identify how this can fit in map
reduce paradigm.

1. Can anyone please clarify how join happens in pig.
2. What is the meaning of Streaming here? Are we loading the files directly
in the reducres?


Regards,
Gagan

Reply via email to