Edward,

Is distcp from csf cluster to Hadoop cluster (push) is going to work similar to 
the example given?.

Rk

On Feb 11, 2012, at 5:59 AM, Edward Capriolo <edlinuxg...@gmail.com> wrote:

> It mostly works as normal with one caveat. 
> http://www.edwardcapriolo.com/roller/edwardcapriolo/entry/possibly_the_worlds_first_briskcp
> 
> In the other direction hadoop may not know how to "talk" to cfs:/// without 
> having to install extra stuff. So this is where htfp:// comes in...
> 
> Copying between versions of HDFS
> 
> For copying between two different versions of Hadoop, one will usually use 
> HftpFileSystem. This is a read-only FileSystem, so DistCp must be run on the 
> destination cluster (more specifically, on TaskTrackers that can write to the 
> destination cluster). Each source is specified as 
> hftp://<dfs.http.address>/<path> (the default dfs.http.address is 
> <namenode>:50070).
> 
> Also distcp can push or pull data so usually you have a few options.
> 
> On Fri, Feb 10, 2012 at 2:56 PM, rk vishu <talk2had...@gmail.com> wrote:
> Could any one tell me how can we copy data from Cassandra-Brisk cluster to 
> Hadoop-HDFS cluster?
> 
> 1) Is there a way to do hadoop distcp between clusters?
> 2) If hive table is created on Brisk cluster, will it similar like HDFS file 
> format? can we run map reduce on the other cluster to transform hive data (on 
> brisk)?
> 
> Thanks and Regards
> RK
> 
> 
> 

Reply via email to