Sorry ,I check my production jdk is java version "1.7.0_45" not java version "1.6.0_35"
r7raul1...@163.com From: r7raul1...@163.com Date: 2015-04-02 17:01 To: dev Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL I download full data from hdfs. Then load data into my table. In my test enviroment. Everything is ok. My production is hadoop 2.3.0-cdh 5.0.2 REDHAT 5.8 java version "1.6.0_35" r7raul1...@163.com From: r7raul1...@163.com Date: 2015-04-02 16:57 To: dev Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL In my test enviroment . I use hive 0.14 ,hive 1.1.0 ,the result is ok. But in production enviroment ,the result is not correct. r7raul1...@163.com From: Thejas Nair Date: 2015-04-02 16:41 To: r7raul1...@163.com CC: dev Subject: Re: Re: hive 0.14 on some platform return some not NULL value as NULL I am unable to reproduce this issue using the sample data - For this query, using 1.1.0, i get the following result- 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 91501191044440048 7326356 2015-01-19 10:44:442015-01-19 On Thu, Apr 2, 2015 at 12:36 AM, r7raul1...@163.com <r7raul1...@163.com> wrote: > > DDL is > CREATE TABLE dw.fct_traffic_navpage_path_detl( > date_id string, > chanl_id bigint, > sessn_id string, > gu_id string, > prov_id string, > city_id string, > landing_page_type_id string, > landing_track_time string, > landing_url string, > nav_refer_tracker_id string, > nav_refer_page_type_id string, > nav_refer_page_value string, > nav_refer_link_position string, > nav_tracker_id string, > nav_page_categ_id string, > nav_page_type_id string, > nav_page_value string, > nav_srce_type string, > internal_keyword string, > internal_result_sum string, > pltfm_id int, > app_vers string, > nav_link_position string, > nav_button_position string, > nav_track_time string, > nav_next_tracker_id string, > sessn_last_time string, > sessn_pv int, > detl_tracker_id string, > detl_page_type_id string, > detl_page_value string, > detl_pm_id bigint, > detl_link_position string, > detl_position_track_id string, > cart_tracker_id string, > cart_page_type_id string, > cart_page_value string, > cart_link_postion string, > cart_button_position string, > cart_position_track_id string, > cart_prod_id bigint, > ordr_tracker_id string, > ordr_page_type_id string, > ordr_code string, > updt_time string, > cart_pm_id bigint, > brand_code string, > categ_type int, > os string, > end_user_id string, > add_cart_flag string, > navgation_page_flag int, > nav_page_url string, > detl_button_position string, > manul_flag int, > manul_track_date string, > nav_refer_tpa string, > nav_refer_tpa_id string, > nav_refer_tpc string, > nav_refer_tpi string, > nav_refer_tcs string, > nav_refer_tcsa string, > nav_refer_tcdt string, > nav_refer_tcd string, > nav_refer_tci string, > nav_refer_postn_type string, > nav_tpa_id string, > nav_tpa string, > nav_tpc string, > nav_tpi string, > nav_tcs string, > nav_tcsa string, > nav_tcdt string, > nav_tcd string, > nav_tci string, > nav_postn_type string, > detl_tpa_id string, > detl_tpa string, > detl_tpc string, > detl_tpi string, > detl_tcs string, > detl_tcsa string, > detl_tcdt string, > detl_tcd string, > detl_tci string, > detl_postn_type string, > cart_tpa_id string, > cart_tpa string, > cart_tpc string, > cart_tpi string, > cart_tcs string, > cart_tcsa string, > cart_tcdt string, > cart_tcd string, > cart_tci string, > cart_postn_type string, > sessn_chanl_id bigint, > gu_sec_flg bigint, > detl_refer_page_type_id string, > detl_refer_page_value string, > detl_event_id string, > nav_refer_intrn_reslt_sum string, > nav_intrn_reslt_sum string, > nav_refer_intrn_kw string, > nav_intrn_kw string, > detl_track_time string, > cart_track_time string) > PARTITIONED BY ( > ds string) > ROW FORMAT SERDE > 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' > STORED AS INPUTFORMAT > 'org.apache.hadoop.mapred.TextInputFormat' > OUTPUTFORMAT > 'org.apache.hadoop.hive.ql.io.HiveIgnoreKeyTextOutputFormat' > LOCATION > '/user/hive/dw/fct_traffic_navpage_path_detl' > TBLPROPERTIES ( > 'numPartitions'='265', > 'numFiles'='26677', > 'last_modified_by'='bi_etl', > 'last_modified_time'='1423633028', > 'transient_lastDdlTime'='1427870517', > 'numRows'='0', > 'totalSize'='8268127466928', > 'rawDataSize'='0') > > My query is : > > SELECT a1.sessn_id, > > a1.ordr_code, > > a1.cart_tracker_id, > > a1.end_user_id, > > a1.cart_track_time > > FROM dw.fct_traffic_navpage_path_detl a1 > > WHERE a1.ds = '2015-01-19' > > AND a1.cart_tracker_id > 0 > > AND (a1.cart_button_position IS NULL OR length(a1.cart_button_position) = > 0) > > AND a1.sessn_id IN ('Y49EY895ACABHS95DRQEE8DVFEB8JSE1', > > 'UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG', > > '87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM') > > > > I attach my sample data. > > > ________________________________ > r7raul1...@163.com > > > From: Thejas Nair > Date: 2015-04-02 15:28 > To: dev > Subject: Re: hive 0.14 on some platform return some not NULL value as NULL > Can you give more details > - the query you are running > - schema of the table > - serialization format of the table, sample records if possible. > > > On Wed, Apr 1, 2015 at 6:32 PM, r7raul1...@163.com <r7raul1...@163.com> > wrote: >> >> I use hive 1.1.0 cli on computer A (linux) the result is >> >> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 91501191044440048 >> 7326356 NULL >> >> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 121501191035580028 >> 7326356 NULL >> >> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG 2362223711289 161501191549050061 >> 14837289 NULL >> >> Y49EY895ACABHS95DRQEE8DVFEB8JSE1 2360853052224 111501191426280023 >> 115883224 NULL >> >> But I use hive0.14 cli in my test enviroment the result is correct. >> >> >> I use hive 0.10 on computer B (linux) the result is >> >> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 91501191044440048 >> 7326356 2015-01-19 10:44:44 >> >> 87FQEZT1UEDXJHJQPFFX7G7ET8S2DVPM 2357378283356 121501191035580028 >> 7326356 2015-01-19 10:35:58 >> >> UBDTK8D9XUZ9GRZU8NZNXDEG73D4PCZG 2362223711289 161501191549050061 >> 14837289 2015-01-19 15:49:05 >> >> Y49EY895ACABHS95DRQEE8DVFEB8JSE1 2360853052224 111501191426280023 >> 115883224 2015-01-19 14:26:28 >> >> >> Why ? >> >> I attach my log. Also in my log I found 2015-04-01 09:55:38,409 WARN >> [main] >> org.apache.hadoop.hive.serde2.lazy.LazyStruct: Extra bytes detected at the >> end of the row! Ignoring similar problems. >> >> >> ________________________________ >> r7raul1...@163.com