This is the idea which I have thought, But in our scenario we have less control 
on writing avro data with delimited TABS and NEWLINES.(encoding tabs and 
newlines with other characters).

Since avro data can be pumped on to the Warehouse system from many sources and 
if we have to implement this kind of logic we have handle this TABS and 
NEWLINES encoding on every data writing part.

Interested if this can be handled without delimiting avro data, like reading 
the AVRO data and transforming into other encoding format and sending to the 
cli output in this format.

And our app will decode the data and display.



Regards

Sathish Valluri



From: Sanjay Subramanian [mailto:sanjay.subraman...@wizecommerce.com]
Sent: Saturday, May 04, 2013 12:08 AM
To: user@hive.apache.org
Subject: Re: hive cli escaping TAB and NEW LINE Characters.



+1 to Stephens suggestion...



From: Stephen Sprague <sprag...@gmail.com<mailto:sprag...@gmail.com>>
Reply-To: "user@hive.apache.org<mailto:user@hive.apache.org>" 
<user@hive.apache.org<mailto:user@hive.apache.org>>
Date: Friday, May 3, 2013 11:29 AM
To: "user@hive.apache.org<mailto:user@hive.apache.org>" 
<user@hive.apache.org<mailto:user@hive.apache.org>>
Subject: Re: hive cli escaping TAB and NEW LINE Characters.



hate to sound like a broken record but when all else fails think about the 
transform() function. The notion here is of encoding your tabs and newlines to 
something like '\t' and '\n' (literally) for instance. If those aren't unique 
enough use '<<tab>>' and "<<newline>>' (you get the idea)  then having your app 
decode those strings to real tabs and real newlines when reading it.

What do you think?









On Fri, May 3, 2013 at 2:07 AM, Valluri, Sathish 
<sathish.vall...@emc.com<mailto:sathish.vall...@emc.com>> wrote:

Hi All,



We have an application which parses hive cli output and displays results.

I have an external table with data in avro format, the contents in this avro 
file have TAB and NEW LINES in the Avro data part.

Since hive cli output rows are delimited by NEWLINES and columns are delimited 
by TABS, if the actual content have TABS and NEW LINE characters parsing the 
result set is giving wrong results.

Can anyone suggest some ideas regarding delimiting the TABS and NEW LINE 
characters in the hive cli output if the actual contents of the columns have 
TABS and NEW LINES.



Regards

Sathish Valluri





CONFIDENTIALITY NOTICE
======================
This email message and any attachments are for the exclusive use of the 
intended recipient(s) and may contain confidential and privileged information. 
Any unauthorized review, use, disclosure or distribution is prohibited. If you 
are not the intended recipient, please contact the sender by reply email and 
destroy all copies of the original message along with any attachments, from 
your computer system. If you are the intended recipient, please be advised that 
the content of this message is subject to access, review and disclosure by the 
sender's Email System Administrator.

Reply via email to