It is the
responsibility of the recipient to ensure that this email is virus free,
therefore neither Peridale Technology Ltd, its subsidiaries nor their employees
accept any responsibility.
From: Philip Lee [mailto:philjj...@gmail.com]
Sent: 02 February 2016 16:10
To: user@hive.apache.o
ient to ensure that this email is virus
> free, therefore neither Peridale Technology Ltd, its subsidiaries nor their
> employees accept any responsibility.
>
>
>
> *From:* Lefty Leverenz [mailto:leftylever...@gmail.com]
> *Sent:* 02 February 2016 10:26
>
> *To:* user
nsibility.
From: Lefty Leverenz [mailto:leftylever...@gmail.com]
Sent: 02 February 2016 10:26
To: user@hive.apache.org
Subject: Re: ORC format
Can't resist teasing Mich about this: "Indeed one often demoralises data
taking advantages of massive parallel processing in Hive."
ale Technology
> Ltd, its subsidiaries or their employees, unless expressly so stated. It is
> the responsibility of the recipient to ensure that this email is virus
> free, therefore neither Peridale Technology Ltd, its subsidiaries nor their
> employees accept any responsibility.
>
&g
heir employees
accept any responsibility.
From: Alan Gates [mailto:alanfga...@gmail.com]
Sent: 01 February 2016 17:07
To: user@hive.apache.org
Subject: Re: ORC format
ORC does not currently expose a primary key to the user, though we have talked
of having it do that. As Mich says the i
ORC does not currently expose a primary key to the user, though we have
talked of having it do that. As Mich says the indexing on ORC is
oriented towards statistics that help the optimizer plan the query.
This can be very important in split generation (determining which parts
of the input wil
shall not be understood as given or endorsed by Peridale Technology Ltd, its
subsidiaries or their employees, unless expressly so stated. It is the
responsibility of the recipient to ensure that this email is virus free,
therefore neither Peridale Technology Ltd, its subsidiaries nor their employees
a
essly so stated. It is the
responsibility of the recipient to ensure that this email is virus free,
therefore neither Peridale Technology Ltd, its subsidiaries nor their employees
accept any responsibility.
From: Philip Lee [mailto:philjj...@gmail.com]
Sent: 01 February 2016 15:49
To: user@hive.a
What do you mean by the silver bullet? so you mean it is not that stored as
primary key on each column. It is just stored as storage indexing, right?
"The statistics helps the optimiser. So whether one table or many, the
optimiser will take advantage of stats to push down the predicate for
faster
Also,
when making ORC from CSV,
for indexing every key on each coulmn is made, or a primary on a table is
made ?
If keys are made on each column in a table, accessing any column in some
functions like filtering should be faster.
On Mon, Feb 1, 2016 at 4:21 PM, Philip Lee wrote:
> Hello,
>
> I e
heir employees
accept any responsibility.
From: Philip Lee [mailto:philjj...@gmail.com]
Sent: 01 February 2016 15:27
To: user@hive.apache.org
Subject: Re: ORC format
Also,
when making ORC from CSV,
for indexing every key on each coulmn is made, or a primary on a table is made ?
If keys ar
Hi,
Orc table use what is known as storage index with stats (min, max. sum etc)
stored at the table, stripe and rowindex (rows of 10K batches) level. The
statistics helps the optimiser. So whether one table or many, the optimiser
will take advantage of stats to push down the predicate for fa
12 matches
Mail list logo