[ https://issues.apache.org/jira/browse/HIVE-9896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Madhan Sundararajan Devaki updated HIVE-9896: --------------------------------------------- Description: We Sqooped (1.4.5) data from many RDBMS into HDFS in text format with options --null-non-string '\\N' --null-string '\\N'. When we load these into Hive tables in text format the \N is properly recognized as NULL and we are able to use SQL clauses such as IS NULL and IS NOT NULL against columns. However, when we convert the text files into AVRO (1.7.6) with SNAPPY compression and try to query using the above SQL clauses, the query does not return results as expected. Further, we have to use column_name = '\\N' or column_name <> '\\N' as a workaround. was: We Sqooped data from many RDBMS into HDFS in text format with options --null-non-string '\\N' --null-string '\\N'. When we load these into Hive tables in text format the \N is properly recognized as NULL and we are able to use SQL clauses such as IS NULL and IS NOT NULL against columns. However, when we convert the text files into AVRO (1.7.6) with SNAPPY compression and try to query using the above SQL clauses, the query does not return results as expected. Further, we have to use column_name = '\\N' or column_name <> '\\N' as a workaround. > \N un-recognized in AVRO format Hive tables > ------------------------------------------- > > Key: HIVE-9896 > URL: https://issues.apache.org/jira/browse/HIVE-9896 > Project: Hive > Issue Type: Bug > Components: Database/Schema, File Formats, Hive > Affects Versions: 0.13.0 > Environment: CDH5.2.1, RHEL6.5, Java 7 > Reporter: Madhan Sundararajan Devaki > > We Sqooped (1.4.5) data from many RDBMS into HDFS in text format with options > --null-non-string '\\N' --null-string '\\N'. > When we load these into Hive tables in text format the \N is properly > recognized as NULL and we are able to use SQL clauses such as IS NULL and IS > NOT NULL against columns. > However, when we convert the text files into AVRO (1.7.6) with SNAPPY > compression and try to query using the above SQL clauses, the query does not > return results as expected. > Further, we have to use column_name = '\\N' or column_name <> '\\N' as a > workaround. -- This message was sent by Atlassian JIRA (v6.3.4#6332)