debugging tips

2011-04-07 Thread Viral Bajaria
Hello, I have been trying to optimize one of my longer running queries using a MAPJOIN hint. The query is fairly complex and it joins my base table (1+ billion rows) with multiple metadata tables (which are relatively small in size). I already use a STREAMTABLE hint for my large table and have pr

Mapper OOMs disappear after disabling JVM reuse

2011-04-07 Thread Steven Wong
When the following query was run with mapred.job.reuse.jvm.num.tasks=20, some of the map tasks failed with "Error: Java heap space", causing the job to fail. After changing to mapred.job.reuse.jvm.num.tasks=1, the job succeeded. FROM ( FROM intable1 SELECT acct_id, esn) b JOIN ( FROM

Re: job counters in customized hive map/reduce script.

2011-04-07 Thread Carl Steinbach
Right now Hive's TRANSFORM mechanism does not allow you set counters. There's a ticket open for adding this feature: https://issues.apache.org/jira/browse/HIVE-1649 Thanks. Carl On Thu, Apr 7, 2011 at 10:35 AM, Avram Aelony wrote: > > Can you try using using streaming from Hive? > >htt

Re: job counters in customized hive map/reduce script.

2011-04-07 Thread Avram Aelony
Can you try using using streaming from Hive? http://wiki.apache.org/hadoop/Hive/LanguageManual/Transform Cheers, ~Avram On Apr 7, 2011, at 10:28 AM, Henry Liu wrote: > > I'd like to report some counters in my customized map and reduce ruby scripts > for hive, just like hadoop strea

job counters in customized hive map/reduce script.

2011-04-07 Thread Henry Liu
I'd like to report some counters in my customized map and reduce ruby scripts for hive, just like hadoop streaming does by writing to stderr. Such as. #increment hadoop counters $stderr.puts("reporter:counter:Impression,Total,1") if first_click == 0 && click_details.empty?

Re: Text output separator for Hive Output

2011-04-07 Thread Jasper Knulst
Thanks Edward, That worked. 2011/4/7 Edward Capriolo > On Thu, Apr 7, 2011 at 8:38 AM, Jasper Knulst > wrote: > > Hi, > > How would I set the field separator for Hive output to files? > > I see that the default is a space (or tab, don't know exactly) but I > would > > like to use another ch

Re: Text output separator for Hive Output

2011-04-07 Thread Edward Capriolo
On Thu, Apr 7, 2011 at 8:38 AM, Jasper Knulst wrote: > Hi, > How would I set the field separator for Hive output to files? > I see that the default is a space  (or tab, don't know exactly) but I would > like to use another character to facilitate loading of the output in another > system. > I alre

Text output separator for Hive Output

2011-04-07 Thread Jasper Knulst
Hi, How would I set the field separator for Hive output to files? I see that the default is a space (or tab, don't know exactly) but I would like to use another character to facilitate loading of the output in another system. I already tried set mapred.textoutputformat.separator='~'; But this

Re: How To Use Hive

2011-04-07 Thread Geoff Howard
Well, I'm new here but I can point you to the docs as well as old timers probably.  First, I think the developers would prefer that you only direct questions like this to the users list, not to both.  Other answers are inline below: On Thu, Apr 7, 2011 at 12:00 AM, komara nagarjuna wrote: > > In