RE: Building Custom RCFiles

2011-03-18 Thread Severance, Steve
Got it working using the columnar serde with the default seperators. Steve -Original Message- From: yongqiang he [mailto:heyongqiang...@gmail.com] Sent: Friday, March 18, 2011 3:50 PM To: user@hive.apache.org Subject: Re: Building Custom RCFiles what's your table definition?

Re: Building Custom RCFiles

2011-03-18 Thread yongqiang he
; Hive doesn't seem to want to take that. I have also tried using the ascii OCT > codes. > > What do I need to setup to make this Map work? > > Thanks. > > Steve > > -Original Message- > From: yongqiang he [mailto:heyongqiang...@gmail.com] > Sent: Thu

RE: Building Custom RCFiles

2011-03-18 Thread Severance, Steve
rsday, March 17, 2011 5:09 PM To: user@hive.apache.org Subject: Re: Building Custom RCFiles Yes. It is the same with normal hive tables. thanks yongqiang On Thu, Mar 17, 2011 at 4:54 PM, Severance, Steve wrote: > Thanks Yongqiang. > > So for more complex types like map do I just

Re: Building Custom RCFiles

2011-03-17 Thread yongqiang he
ks. > > Steve > > -Original Message- > From: yongqiang he [mailto:heyongqiang...@gmail.com] > Sent: Thursday, March 17, 2011 4:35 PM > To: user@hive.apache.org > Subject: Re: Building Custom RCFiles > > A side note, in hive, we make all columns saved as Text interna

RE: Building Custom RCFiles

2011-03-17 Thread Severance, Steve
ubject: Re: Building Custom RCFiles A side note, in hive, we make all columns saved as Text internally (even the column's type is int or double etc). And with some experiments, string is more friendly to compression. But it needs CPU to decode to its original type. Thanks Yongqiang On Thu, Ma

Re: Building Custom RCFiles

2011-03-17 Thread yongqiang he
A side note, in hive, we make all columns saved as Text internally (even the column's type is int or double etc). And with some experiments, string is more friendly to compression. But it needs CPU to decode to its original type. Thanks Yongqiang On Thu, Mar 17, 2011 at 4:04 PM, yongqiang he wrot

Re: Building Custom RCFiles

2011-03-17 Thread yongqiang he
You need to customize Hive's ColumnarSerde (maybe functions in LazySerde)'s serde and deserialize function (depends you want to read or write.). And the main thing is that you need to use your own type def (not LazyInt/LazyLong). If your type is int or long (not double/float), casting it to string

Building Custom RCFiles

2011-03-17 Thread Severance, Steve
Hi, I am working on building a MR job that generates RCFiles that will become partitions of a hive table. I have most of it working however only strings (Text) are being deserialized inside of Hive. The hive table is specified to use a columnarserde which I thought should allow the writable typ