Hello, I recently noticed that the OrcInputFormat/OutputFormat's have been exposed to be usable outside of hive ( https://issues.apache.org/jira/browse/HIVE-5728). Does anyone know of any documentation, or have any information, as to how one uses these with a mapReduce application to write orc records? In particular, I would like to use a spark streaming process to read avro records off of kafka, and write then write them directly to hdfs in orc format where they could be used for hive queries.
Thanks, Brock