Re: How to write Block of queries in Hive?

Aniket Mokashi Wed, 04 Jan 2012 22:13:58 -0800

Hi Bhavesh,

[moving discussion to hive user list]

I would suggest you to send your discussion to hive user list in order to
reach a broader audience.

As per my understanding, in the query- map_script and reduce_script are
custom scripts that run as a streaming jobs. You are asking hive to run
map_script as mapper job on 3 columns to generate 3 new values- c1, c2, c3.
After this, hive will sort your records on c1 and c2 and distribute them to
reducers based on c3 values. 'reduce_scripts' will consume these 3 records
and generate 2 records to store in pv_users_reduced.

Hope it helps.

Thanks,
Aniket

On Wed, Jan 4, 2012 at 8:55 PM, Bhavesh Shah <bhavesh25s...@gmail.com>wrote:

> Hello,
> I am new to hive. I want to write block of queries in Hive so that one
> query give result to another one like in SQL.
>
> I have also visited one link given below:
> http://karmasphere.com/ksc/hive-user-defined-functions.html
>
> In above link I am looking for functions but I get below one and I dont
> understand following things:
>
> USING 'map_script'USING 'reduce_script'
>
> in following block:
>
>
> FROM (
>  FROM pv_users
>  MAP ( pv_users.userid, pv_users.date )
>  USING 'map_script'
>  AS c1, c2, c3
>  DISTRIBUTE BY c2
>  SORT BY c2, c1) map_output
>  INSERT OVERWRITE TABLE pv_users_reduced
>  REDUCE ( map_output.c1, map_output.c2, map_output.c3 )
>  USING 'reduce_script'
>  AS date, count;
>
>
> Pls can anyone tell what is the use of scripts and how to write block
> of queries in hive?
>
>
>
>
> --
> Regards,
> Bhavesh Shah
>

-- 
"...:::Aniket:::... Quetzalco@tl"

Re: How to write Block of queries in Hive?

Reply via email to