Hi, I am a hive newbie.I just finished setting up hive on a cluster of two servers for my organisation.As a test drill, we operated some simple queries.It took the standard map-reduce algorithm around 4 minutes just to execute this query:
count(1) from tablename; The answer returned was around 2200.Clearly, this is not a big number by hadoop standards.My question is whether this is a standard performance or is there some configuration that is not optimised?Will scaling up of data to say,50 times, produce any drastic slowness?I tried reading the documentation but was not clear on these issues, and i would like to have an idea before this setup starts working in a production environment. Thanks in advance, Regards, Abhishek Pathak