RE: Single Map task for Hive queries

2011-08-16 Thread Steven Wong
all Hadoop configs. From: Jon Bender [mailto:jonathan.ben...@gmail.com] Sent: Monday, August 15, 2011 11:08 AM To: user@hive.apache.org Subject: Re: Single Map task for Hive queries Yeah MapReduce itself is set up to use all of my task trackers--only one Map Task gets created one the external table q

Re: Single Map task for Hive queries

2011-08-15 Thread Jon Bender
Yeah MapReduce itself is set up to use all of my task trackers--only one Map Task gets created one the external table queries. I tried querying another external table (composed of some 20 files) and it created 20 map tasks in turn during the query. I will try the LINES TERMINATED BY clause next t

Re: Single Map task for Hive queries

2011-08-15 Thread Loren Siebert
You should not have to do anything special to Hive to make it use all of your TT’s. The actual MR job should be governed by your mapred-site.xml file. When you run sample MR jobs (like the Pi example) and look at the job tracker, are you seeing all your TT’s getting used? On Aug 15, 2011, at 10

Re: Single Map task for Hive queries

2011-08-15 Thread Ayon Sinha
, August 15, 2011 10:47 AM Subject: Re: Single Map task for Hive queries It's actually just an uncompressed UTF-8 text file. This was essentially the create table clause: CREATE EXTERNAL TABLE foo  ROW FORMAT DELIMITED  STORED AS TEXTFILE LOCATION '/data/foo' Using Hive 0.7. On Mon,

Re: Single Map task for Hive queries

2011-08-15 Thread Jon Bender
It's actually just an uncompressed UTF-8 text file. This was essentially the create table clause: CREATE EXTERNAL TABLE foo ROW FORMAT DELIMITED STORED AS TEXTFILE LOCATION '/data/foo' Using Hive 0.7. On Mon, Aug 15, 2011 at 10:37 AM, Loren Siebert wrote: > Is your external file compressed wit

Re: Single Map task for Hive queries

2011-08-15 Thread Loren Siebert
Is your external file compressed with GZip or BZip? Those file formats aren’t splittable, so they get assigned to one mapper. On Aug 15, 2011, at 10:23 AM, Jon Bender wrote: > Hello, > > I have external tables in Hive stored in a single flat text file. When I > execute queries against it, al