Hi,
I have an apache web log (sample below), and want to LOAD DATA INPATH.
My fields are separated by a space, and those that contains spaces are
enclosed in quotes.
I tried this,
ROW FORMAT DELIMITED
FIELDS TERMINATED BY " "
COLLECTION ITEMS TERMINATED BY '"'
MAP KEYS TERMINATED BY ","
but
Hi Prajakta,
I get it now. But are there plans to support groupbys of that format,
or would I be able to write a UDF of sorts to do that?
Also, could you please give me an example of accelerating equi-joins
with indices, I haven't been able to get that to work either :(
Regards,
--
Rohan Monga
I've been having problems getting data properly into something like
CREATE TABLE raw(a string, b string, c string) ROW FORMAT DELIMITED FIELDS
TERMINATED BY '\001' STORED AS TEXTFILE;
using a script for the INSERT OVERWRITE call.
Only column a data would be filled. The problems whent away when I
Hi,
I am CC'ing this to hive-user as well .
I tried to do a simple join between two tables 2.2GB and 137MB.
select count(*) from A JOIN B ON (A.a = B.b);
The query ran for 7 hours . I am sure this is not normal. The reducer gets
stuck at reduce > reduce phase . Map, copy phases complete just in