Re: Moving TB of data from NFS to HDFS

2012-01-24 Thread Ajit Ratnaparkhi
Hi raj, If you have all data on NFS mounted disk, meaning on single machine, then your upload will be limited by network bandwidth. You can try running dfs -put in multiple parallel threads for distinct data sets, you might be able to utilise network bandwidth to its maximum(take care not to have

[jira] [Resolved] (HADOOP-7989) [ec2] hadoop Could not create the Java virtual machine

2012-01-24 Thread Jai Kumar Singh (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-7989?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jai Kumar Singh resolved HADOOP-7989. - Resolution: Not A Problem It was a setting problem. (by whirr) ulimit was not set to unl

Re: Moving TB of data from NFS to HDFS

2012-01-24 Thread Praveen Sripati
> If it is divided up into several files and you can mount your NFS directory on each of the datanodes. Just curious, how will this help. Praveen On Wed, Jan 25, 2012 at 12:39 AM, Robert Evans wrote: > If it is divided up into several files and you can mount your NFS > directory on each of the

Re: Modifying source code of hadoop

2012-01-24 Thread Harsh J
Unsure on what could cause that but I'd first check if Cygwin's bin/ items are on my Windows PATH. FWIW, if you are stepping into development, and your development isn't windows-specific, best to use any modern Linux distro as your environment instead. Ubuntu works great for starters. I prefer usi

[jira] [Created] (HADOOP-7991) HA: the FailoverController should check the standby is ready before failing over

2012-01-24 Thread Eli Collins (Created) (JIRA)
HA: the FailoverController should check the standby is ready before failing over Key: HADOOP-7991 URL: https://issues.apache.org/jira/browse/HADOOP-7991 Project: Hadoop C

[jira] [Resolved] (HADOOP-7571) hadoop-config.sh needs to be updated post mavenization

2012-01-24 Thread Eli Collins (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/HADOOP-7571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eli Collins resolved HADOOP-7571. - Resolution: Fixed Looks like this was fixed. > hadoop-config.sh needs to be upd

Re: Modifying source code of hadoop

2012-01-24 Thread Samaneh Shokuhi
Hello, I am new to hadoop as well. I just followed the instruction,got the source code and tried to build it by maven . during the mvn install, build failed and got an error which says : [ERROR] Failed to execute Goal org.codebus.mojo: Exec-Maven-plugin: 1.2: Exec on project hadoop-yarn-common

[jira] [Created] (HADOOP-7990) TFile to allow Snappy compression

2012-01-24 Thread Prashant Kommireddi (Created) (JIRA)
TFile to allow Snappy compression - Key: HADOOP-7990 URL: https://issues.apache.org/jira/browse/HADOOP-7990 Project: Hadoop Common Issue Type: Improvement Reporter: Prashant Kommireddi TFile curr

Re: Moving TB of data from NFS to HDFS

2012-01-24 Thread Robert Evans
If it is divided up into several files and you can mount your NFS directory on each of the datanodes, you could possibly use distcp to do it. I have never tried using distcp for this, but it should work. Or you can write your own streaming Map/Reduce script that does more or less the same thin

Re: getting hadoop job resource usage statistics

2012-01-24 Thread Arun C Murthy
You can currently get CPU & memory stats for each task and aggregated stats per job via MapReduce Counters. Arun On Jan 24, 2012, at 7:27 AM, arindam choudhury wrote: > Hi, > > How can I get CPU, memory, network and disk usage statistics of a hadoop > job? > > Thanks, > Arindam

getting hadoop job resource usage statistics

2012-01-24 Thread arindam choudhury
Hi, How can I get CPU, memory, network and disk usage statistics of a hadoop job? Thanks, Arindam

Re: Modifying source code of hadoop

2012-01-24 Thread Harsh J
Ashok, Following http://wiki.apache.org/hadoop/HowToContribute should get you started at development. Let us know if you have any further, specific questions once you get ahead. On Tue, Jan 24, 2012 at 7:30 PM, ashok chandrasekar wrote: > Hi, > I am new to hadoop. I want to make a few changes to

Modifying source code of hadoop

2012-01-24 Thread ashok chandrasekar
Hi, I am new to hadoop. I want to make a few changes to the hadoop source code for my college project. How can I do this? How to compile and test the modified code? What tools are needed to perform this? Your reply will be of great help to me. Thanks in advance.