Re: why using mapreduce python scripts print more NULLs

2013-03-26 Thread 周梦想
Thank you,Abdelrhman! I notice that if the python script output delimiter is ' ',then it will print more NULL to fill as (a,b,c...) field. if I change the ' '.join() to '\t'.join(), it will be ok. so select clause will output field delimited by '\t'. Best Regards, Andy Zhou 2013/3/27 Abdelrhma

Re: why using mapreduce python scripts print more NULLs

2013-03-26 Thread Abdelrhman Shettia
Hi Andy , Can you view the data from the table by hadoop fs -text $tabledir/$filename? The data may be corrupted or the filed delimiter is mixed with the data used in the transform script. Thanks On Mar 26, 2013, at 2:54 AM, 周梦想 wrote: > testpoker

why using mapreduce python scripts print more NULLs

2013-03-26 Thread 周梦想
hive version:0.10.0 hive> from testpoker select transform(ldate,ltime,threadid,gameid,userid,pid,roundbet,fold,allin,cardtype,cards,chipwon) using 'calcpoker.py' as ldate,gameid,userid,pid,win,fold,allin,cardtype,cards ; 03/13/13 1009 185690475 8639 0 1 0 -1 NULL NULLNULLNULL NULL