Hadoop execution improvement in a heterogeneous environment

2009-07-14 Thread roman kolcun
Hello everyone, I've got an idea of how to improve the execution time of map phase in a heterogeneous environment (when other processes may run on the machines rendering it slower than other machines). Currently map phase process data in larger chunks (usually 64MB). At the end of map phase data as

[jira] Created: (HADOOP-6150) Need to be able to instantiate a comparator instance from a comparator string without creating a TFile.Reader object

2009-07-14 Thread Hong Tang (JIRA)
Need to be able to instantiate a comparator instance from a comparator string without creating a TFile.Reader object Key: HADOOP-6150 URL: https:/

Re: fastlz replacement for LZO

2009-07-14 Thread Owen O'Malley
On Tue, Jul 14, 2009 at 2:30 PM, william kinney wrote: > Sorry, I should mention, I'm using LZO but need to switch in the > future b/c of the GPL. If you have a patch that implements it, please file a jira and attach it. It would be good to have an Apache License compatible codec that is fast in

[jira] Created: (HADOOP-6149) FileStatus can support a fileid per path

2009-07-14 Thread dhruba borthakur (JIRA)
FileStatus can support a fileid per path Key: HADOOP-6149 URL: https://issues.apache.org/jira/browse/HADOOP-6149 Project: Hadoop Common Issue Type: New Feature Reporter: dhruba borthakur

Re: fastlz replacement for LZO

2009-07-14 Thread william kinney
Sorry, I should mention, I'm using LZO but need to switch in the future b/c of the GPL. On Tue, Jul 14, 2009 at 5:27 PM, william kinney wrote: > Hi, > > Does anyone know if fastlz is still intended to replace LZO? > > Thanks, > Will > > "Remove bindings to lzo" > https://issues.apache.org/jira/bro

fastlz replacement for LZO

2009-07-14 Thread william kinney
Hi, Does anyone know if fastlz is still intended to replace LZO? Thanks, Will "Remove bindings to lzo" https://issues.apache.org/jira/browse/HADOOP-4874