Hi,

I have a Hive table stored as ORC with the following properties:


# Detailed Table Information
Database:               default
Owner:                  root
CreateTime:             Tue Mar 24 21:33:53 PDT 2015
LastAccessTime:         UNKNOWN
Protect Mode:           None
Retention:              0
Location:               hdfs://
sandbox.hortonworks.com:8020/apps/hive/warehouse/mytable_orc
Table Type:             MANAGED_TABLE
Table Parameters:
        COLUMN_STATS_ACCURATE   true
        numFiles                1
        numRows                 9
        rawDataSize             8730
        totalSize               1946
        transient_lastDdlTime   1427258050

# Storage Information
SerDe Library:          org.apache.hadoop.hive.ql.io.orc.OrcSerde
InputFormat:            org.apache.hadoop.hive.ql.io.orc.OrcInputFormat
OutputFormat:           org.apache.hadoop.hive.ql.io.orc.OrcOutputFormat

Compressed:             No
Num Buckets:            -1
Bucket Columns:         []
Sort Columns:           []
Storage Desc Params:
        field.delim             ,
        serialization.format    ,
Time taken: 1.702 seconds, Fetched: 41 row(s)

I have a UDF that I want to run on the tables. It works for every other
kind of table but for ORC table like above, the configure method of the
GenericUDF is not called.

Any thoughts on this?

Reply via email to