How to load quote-separated fields?

2011-09-22 Thread Mark Kerzner
Hi, I have an apache web log (sample below), and want to LOAD DATA INPATH. My fields are separated by a space, and those that contains spaces are enclosed in quotes. I tried this, ROW FORMAT DELIMITED FIELDS TERMINATED BY " " COLLECTION ITEMS TERMINATED BY '"' MAP KEYS TERMINATED BY "," but

Re: problems with indices

2011-09-22 Thread rohan monga
Hi Prajakta, I get it now. But are there plans to support groupbys of that format, or would I be able to write a UDF of sorts to do that? Also, could you please give me an example of accelerating equi-joins with indices, I haven't been able to get that to work either :( Regards, -- Rohan Monga

\x01 considered harmful

2011-09-22 Thread Adriaan Tijsseling
I've been having problems getting data properly into something like CREATE TABLE raw(a string, b string, c string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\001' STORED AS TEXTFILE; using a script for the INSERT OVERWRITE call. Only column a data would be filled. The problems whent away when I

Re: Reducer hanging ( swapping? )

2011-09-22 Thread john smith
Hi, I am CC'ing this to hive-user as well . I tried to do a simple join between two tables 2.2GB and 137MB. select count(*) from A JOIN B ON (A.a = B.b); The query ran for 7 hours . I am sure this is not normal. The reducer gets stuck at reduce > reduce phase . Map, copy phases complete just in