Hadoop on Cloud or Not

Adarsh Sharma Thu, 09 Dec 2010 02:26:38 -0800

Hello,

I have Eucalyptus 1.6.2 installed on ubuntu 10.04 using sourceinstallation with kvm. Currently I have ten nodes in my cloud in asingle cluster architecture.

Also I have tested Hadoop on VM's and run several  jobs

I am trying to run Hadoop in a cloud environment. So I will launchhadoop instances on the cloud. Now there is huge data on each Hadoopnode so I am planning to use volumes as of now to store that data ofeach instance i.e Hadoop node. But since volumes are stored at Storagecontrollers so this means that there is continuous movement of data(lots of GBs) in cloud network from SC to node and also the responsetime of work done on Hadoop instances will be slow due to time taken bydata to travel in the network.

So, now is it possible to store volumes (or any other way) on the nodesso that above problem can be resolved.

Second case : I can store data on the hard disk attached to the nodesand Hadoop instances can access that data easily but for that I would berequired to start the instances on the node where data has been stored.So for this can I by using any hack or by anything decide the node for ainstance to be started.

Can anyone who has some working experience with Hadoop on cloudenvironment give me any pointers?

I will really appreciate any sort of support on this.

Finally is it worthful to do this as I previously recieve some responselike this :

Is it possible to run Hadoop in VMs on Production Clusters so that we
have 10000s of nodes on 100s of servers to achieve high performance
through Cloud Computing.

you don't achieve performance that way. You are better off with 1VM perphysical host, and you will need to talk to a persistent filestore forthe data you want to retain. Running >1 VM per physical host justcreates conflict for things like disk, ether and CPU that the virtual OSwon't be aware of. Also, VM to disk performance is pretty bad right now,though that's improving.



Thanks & Regards

Adarsh Sharma

Hadoop on Cloud or Not

Reply via email to