Rohini Palaniswamy created PIG-4120:
---------------------------------------

             Summary: Broadcast the index file in case of POMergeCoGroup and 
POMergeJoin
                 Key: PIG-4120
                 URL: https://issues.apache.org/jira/browse/PIG-4120
             Project: Pig
          Issue Type: Sub-task
            Reporter: Rohini Palaniswamy


Currently merge join and merge cogroup use two DAGs - the first DAG creates the 
index file in hdfs and second DAG does the merge join.  Similar to replicate 
join, we can broadcast the index file and cache it and use it in merge join and 
merge cogroup. This will give better performance and also eliminate need for 
the second DAG.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to