Hi
Since you are on a pseudo distributed/ single node environment the hadoop
mapreduce parallelism is limited.
You might be having just a few map slots and map tasks might be in queue
waiting for others to complete. In a larger cluster your job should be faster.
As a side note, Certain SQL que
Hi
Since you are on a pseudo distributed/ single node environment the hadoop
mapreduce parallelism is limited.
You might be having just a few map slots and map tasks might be in queue
waiting for others to complete. In a larger cluster your job should be faster.
Certain SQL queries that uliliz
nnie.
From: Gobinda Paul [mailto:gobi...@live.com]
Sent: Tuesday, March 12, 2013 11:01 AM
To: user@hive.apache.org
Subject: RE: Getting Slow Query Performance!
Thnx for your reply , i am new to hadoop and hive .My goal is to process a big
data using hadoop,
this is my university project ( Data Mi
many node is
required to show the performance and what type of configuration is required
for each node.
From: bsc...@ebuddy.com
To: user@hive.apache.org
CC: gobi...@live.com
Date: Tue, 12 Mar 2013 10:40:33 +0100
Subject: RE: Getting Slow Query Performance!
Generally a single hadoop machine
Generally a single hadoop machine will perform worse then a single mysql
machine. People normally use hadoop when they have so much data it won't really
fit on a single machine and it would require specialized hardware (Stuff like
SAN's) to run.
30GB of data really isn't that much and 2GB of ram