Hi,
I have a hive query like this

from (
     from mytable
     map mytable.c1, mytable.c2, mytable.c3
     using '/bin/cat' as c1,c2,c3
     cluster by c1) mo
     insert overwrite table mytable2
     reduce mo.c1, mo.c2, mo.c3
     using 'python convert.py'
     as x1, x2;

Now what is happening is that some of the records from 'mytable' are
not making their way into 'mytable2'. However,

from (
     from (select * from mytable where c1=1)
     map mytable.c1, mytable.c2, mytable.c3
     using '/bin/cat' as c1,c2,c3
     cluster by c1) mo
     insert overwrite table mytable2
     reduce mo.c1, mo.c2, mo.c3
     using 'python convert.py'
     as x1, x2;

Say, the missing record was for c1=1, then with this query, the record
shows up in mytable2. Is there something I am missing ?

Regards,
--
Rohan Monga

Reply via email to