Re: Aggregated table larger than expected

2016-06-29 Thread Matt Olson
. The latter case is more conducive to storing compactly. On Thu, Jun 23, 2016 at 2:37 PM, Matt Olson wrote: > Hi, > > I am working with an hourly table and a daily table in Hive 1.0.1. Both > tables have the same schema except that the hourly table is partitioned by > dt and hour

Aggregated table larger than expected

2016-06-23 Thread Matt Olson
Hi, I am working with an hourly table and a daily table in Hive 1.0.1. Both tables have the same schema except that the hourly table is partitioned by dt and hour, but the daily table is partitioned only by dt. At the end of each day, the records from the hourly table are aggregated into the daily

Re: Container out of memory: ORC format with many dynamic partitions

2016-05-02 Thread Matt Olson
d for sorting. Try running with the defaults > for hive.exec.orc.memory.pool. > > Thanks > Prasanth > > On May 2, 2016, at 9:41 PM, Matt Olson wrote: > > Hi Prasanth, > > Thank you for the helpful information. I have been using the default ORC > stripe size, which

Re: Container out of memory: ORC format with many dynamic partitions

2016-05-02 Thread Matt Olson
reduce, reduce the > compression buffer size. > > Because of the bug outlined in HIVE-12893, in your case the optimization > to have single orc writer is not kicking in causing OOM. > > Thanks > Prasanth > > On May 2, 2016, at 3:30 PM, Matt Olson wrote: > > Hi Prasan

Re: Container out of memory: ORC format with many dynamic partitions

2016-05-02 Thread Matt Olson
-only maybe date + > title_type, but adding 2000+ dynamic partitions that each have 300 rows of > data in them is asking for problems in Hive IMO... > > > *From:* Matt Olson [mailto:maolso...@gmail.com ] > *Sent:* Friday, April 29, 2016 7:50 PM > *To:* user@hive.apache.org &

Container out of memory: ORC format with many dynamic partitions

2016-04-29 Thread Matt Olson
Hi all, I am using Hive 1.0.1 and trying to do a simple insert into an ORC table, creating dynamic partitions. I am selecting from a table partitioned by dt and category, and inserting into a table partitioned by dt, title, and title_type. Other than the partitioning, the tables have the same sche