I don't think the file name s are stored per se, more like it's metadata
that can be accessed in queries like pseudocolumns in Oracle.
More info:
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+VirtualColumns
On Nov 22, 2016 10:41 PM, "Gopal Vijayaraghavan" wrote:
> > I am using
> I am using INPUT__FILE__NAME to store the file name in a column of ORC table.
>
That's something I don't understand (the use of the virtual column).
It is common practice to normalize stuff like images as S3 file-paths instead
of actual bytes inside an ORC file for efficiency of storage.
>
Raw file contents? Maybe create an external single-column textfile table
over the data?
On Nov 21, 2016 5:33 PM, "Arun Patel" wrote:
> Hive Experts,
>
> I am using INPUT__FILE__NAME to store the file name in a column of ORC
> table.
>
> Now, Is there a way to retrieve the content of the HDFS fil
Hey All,
I am using Hive 2.0 with external meta-store on EMR-5.0.0 and TEZ as
execution engine.
Our data are stored in json format so for serialization and deserialization
purpose we are planning to use lazy serde
(classname is 'org.apache.hadoop.hive.serde2.lazy.LazySimpleSerDe' ).
My table def
Done. Welcome to the Hive wiki team, Anishek!
-- Lefty
On Tue, Nov 22, 2016 at 1:05 AM, Anishek Agarwal
wrote:
> Hello,
>
> Please provide edit permissions to the hive wiki pages for the confluence
> user name *anagarwal*
>
> Regards,
> anishek
>
Hello,
Please provide edit permissions to the hive wiki pages for the confluence user
name anagarwal
Regards,
anishek