Significance of file.out.index during Shuffle Phase ?

2012-08-18 Thread Pavan Kulkarni
*file.out.index* play any role? Any help is appreciated .Thanks -- --With Regards Pavan Kulkarni

Re: Checksum Error during Reduce Phase hadoop-1.0.2

2012-08-16 Thread Pavan Kulkarni
uld be caused by a corrupt disk - which is why checking > if it's happening on a specific node(s) can help. > > > > Arun > > > > On Aug 16, 2012, at 10:04 AM, Pavan Kulkarni wrote: > > > >> Harsh, > >> > >> I see this on couple

Re: Checksum Error during Reduce Phase hadoop-1.0.2

2012-08-16 Thread Pavan Kulkarni
10, 2012 at 11:43 PM, Pavan Kulkarni > wrote: > > Hi, > > > > I am running a Terasort with a cluster of 8 nodes.The map phase > completes > > but when the reduce phase is around 68-70% I get this following error. > > >

Re: Setting number of parallel Reducers and Mappers for optimal performance

2012-08-10 Thread Pavan Kulkarni
with 3G/4G each depending on how you > want to slice/dice your slots. > > Arun > > On Aug 10, 2012, at 1:24 PM, Pavan Kulkarni wrote: > > > Hi, > > > > I was trying to optimize Hadoop-1.0.2 performance by setting > > *mapred.tasktracker.map.tasks.maximum &

Setting number of parallel Reducers and Mappers for optimal performance

2012-08-10 Thread Pavan Kulkarni
if my assumption is wrong.Also suggest a way to get the optimal performance by setting these parameters. Thanks. -- --With Regards Pavan Kulkarni

Checksum Error during Reduce Phase hadoop-1.0.2

2012-08-10 Thread Pavan Kulkarni
sue is appreciated .Thanks -- --With Regards Pavan Kulkarni

Re: Shifting to Java 7 . Is it good choice?

2012-07-17 Thread Pavan Kulkarni
and for this I needed few packages > >>which > >> are > >> only present in Java 7 like "java.nio.file" , So I was wondering If I > >>can > >> shift my > >> development environment of Hadoop to Java 7? Would this break anything ? > >openjdk 7 works, but nio async file access is slower then traditional. > > -- --With Regards Pavan Kulkarni

Shifting to Java 7 . Is it good choice?

2012-07-17 Thread Pavan Kulkarni
Hi, I have to tweak a few classes and for this I needed few packages which are only present in Java 7 like "java.nio.file" , So I was wondering If I can shift my development environment of Hadoop to Java 7? Would this break anything ? Thanks -- --With Regards Pavan Kulkarni

Where are the Map-output files produced ?

2012-07-16 Thread Pavan Kulkarni
couldn't find out the fully specified path of the files on Map-side i.e the names of the partitioned Map-output files. Anyone has any idea how to find out the fully qualified pathnames of these files? Any help is highly appreciated.Thanks -- --With Regards Pavan Kulkarni

debugging a class hadoop-1.0.2 ,log4j.properties

2012-07-12 Thread Pavan Kulkarni
ebug messages in the TaskTracker log file. Any idea about how to debug a class? Thanks -- --With Regards Pavan Kulkarni

Re: Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-11 Thread Pavan Kulkarni
Display Socket belongs to different user then, the error pops up. So to avoid this execute "*xhost +local:all*" in the terminal of the user who owns the X Display socket and then try issuing ant from root. It works fine. This does have some security concerns but this is the easiest work

Re: Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-10 Thread Pavan Kulkarni
Hey Pavan, > > Try an "ant tar". For more ant targets, read the build.xml at the root > of your checkout. > > On Wed, Jul 11, 2012 at 1:15 AM, Pavan Kulkarni > wrote: > > Thanks a lot Harsh.I could set it up without any errors. > > It would be great if you could

Re: Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-10 Thread Pavan Kulkarni
$ git clone http://github.com/apache/hadoop-common.git hadoop-1 > $ cd hadoop-1; git checkout branch-1 > $ ant eclipse > > (Now export this directory into Eclipse as a Java project) > > HTH. > > On Wed, Jul 11, 2012 at 12:00 AM, Pavan Kulkarni > wrote: > > Hi al

Problem setting up 1st generation Hadoop-0.20 (ANT build) in Eclipse

2012-07-10 Thread Pavan Kulkarni
? Also my second question was how to build a binary tar file for hadoop-0.20 which still uses ANT. The wiki pages only have information for maven. Any help is highly appreciated.Thanks -- --With Regards Pavan Kulkarni

Re: No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-09 Thread Pavan Kulkarni
all kind of > needed. The exact configs that you need to set may very a lot based off > of what you are trying to do. > --Bobby Evans > > On 7/6/12 6:58 PM, "Pavan Kulkarni" wrote: > > >Hi Robert, > > > > Can you please share what all configuration file

Re: JAVA_HOME setup error in Hadoop-0.23.3 single node

2012-07-09 Thread Pavan Kulkarni
this thread? > > Thanks! :) > > On Mon, Jul 9, 2012 at 10:24 PM, Pavan Kulkarni > wrote: > > I resolved this issue. Sorry for the inconvenience. > > > > On Mon, Jul 9, 2012 at 11:58 AM, Pavan Kulkarni >wrote: > > > >> Hi all, > >> > >> I

Re: JAVA_HOME setup error in Hadoop-0.23.3 single node

2012-07-09 Thread Pavan Kulkarni
I resolved this issue. Sorry for the inconvenience. On Mon, Jul 9, 2012 at 11:58 AM, Pavan Kulkarni wrote: > Hi all, > > I was trying to setup Hadoop-0.23.3 which is the 2nd generation release > on single node. > I configured all the files as specified and also set the JAVA_HOM

JAVA_HOME setup error in Hadoop-0.23.3 single node

2012-07-09 Thread Pavan Kulkarni
: Error: JAVA_HOME is not set and could not be found.* * * I have checked the settings and .bashrc file and everything seems to correct , also researched for the solution online but couldn't find much. So would really appreciate if anyone knows how to resolve this ?Thanks -- --With Regards

Re: No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-06 Thread Pavan Kulkarni
Perhaps others on > the list might know better then I do. > > On 7/6/12 12:05 PM, "Pavan Kulkarni" wrote: > > >Bobby, > > > > Thanks a lot for your clarification. > >Yes as you said it is just a template, but it may > >be quite confusin

Re: No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-06 Thread Pavan Kulkarni
od, but I don't know how quickly we will be able to get around > to doing it. > > --Bobby Evans > > On 7/5/12 7:23 PM, "Pavan Kulkarni" wrote: > > >Hi, > > > > I downloaded the Hadoop-0.23.3 source and tweaked a few classes and > >when I

[jira] [Created] (HADOOP-8575) No mapred-site.xml present in the configuration directory. This is very trivial but thought would be less confusing for a new user if it came packaged.

2012-07-06 Thread Pavan Kulkarni (JIRA)
Pavan Kulkarni created HADOOP-8575: -- Summary: No mapred-site.xml present in the configuration directory. This is very trivial but thought would be less confusing for a new user if it came packaged. Key: HADOOP-8575

No mapred-site.xml in the hadoop-0.23.3 distribution

2012-07-05 Thread Pavan Kulkarni
configured right? So I was just wondering if we are supposed to create the mapred-site.xml , or it doesn't exist at all? Thanks -- --With Regards Pavan Kulkarni

Re: Doubt regarding use of HTTP during Shuffle phase

2012-06-27 Thread Pavan Kulkarni
Oh.Thanks a lot Owen. I'll have a look into it. On Wed, Jun 27, 2012 at 10:21 AM, Owen O'Malley wrote: > Pavan, > This is a very big project. Look at the users of IFile.java. IFile is the > format for storing the shuffle outputs. > > -- Owen > -- --With Regards Pavan Kulkarni

Re: Doubt regarding use of HTTP during Shuffle phase

2012-06-27 Thread Pavan Kulkarni
52 AM, Roman Shaposhnik wrote: > On Wed, Jun 27, 2012 at 9:44 AM, Pavan Kulkarni > wrote: > > Yes you are correct, but we can use Lustre FS and it does scale right? > > I am new to this so please excuse if I am wrong in some assumptions. > > You can use hybrid approaches, bu

Re: Doubt regarding use of HTTP during Shuffle phase

2012-06-27 Thread Pavan Kulkarni
Yes you are correct, but we can use Lustre FS and it does scale right? I am new to this so please excuse if I am wrong in some assumptions.Thanks On Wed, Jun 27, 2012 at 9:40 AM, Owen O'Malley wrote: > On Wed, Jun 27, 2012 at 9:33 AM, Pavan Kulkarni >wrote: > > Why is HTT

Doubt regarding the classes to look for tweaking Reducer during Shuffle phase

2012-06-07 Thread Pavan Kulkarni
stated problem?* I went through couple of books but couldn't find much detailed information. Looking into the source code I felt it must be the OutputCollector Class. Please correct me if am wrong. Any help or pointers are highly appreciated.Thanks. -- --With Regards Pavan Kulkarni

Doubt regarding finding Databaode --> Block map.

2011-11-18 Thread Pavan Kulkarni
highly appreciated. Thanking You, -- With Regards Pavan Kulkarni

Help:Integrating Hadoop with Openstack

2011-09-14 Thread Pavan Kulkarni
, can you suggest some ideas involving Hadoop that has not yet been implemented? I went through http://wiki.apache.org/hadoop/ProjectSuggestions for ideas but didn't find any concrete development project. -- With Regards Pavan Kulkarni