Did you end up finding the answer? How fast is this method of insertion relative to a simple insert of List<Put> ?
On 13 בנוב 2012, at 02:29, Bijieshan <[email protected]> wrote: > I think one possible reason is block caching. Have you turned the block > caching off during scanning? > > Regards, > Jieshan > ________________________________________ > From: Mohammad Tariq [[email protected]] > Sent: Tuesday, November 13, 2012 1:04 > To: [email protected] > Subject: Re: scan is slower after bulk load > > may be because bulk load writes to the same region thus putting the entire > load on a single region server. > > Regards, > Mohammad Tariq > > > > On Mon, Nov 12, 2012 at 9:15 PM, Michael Segel > <[email protected]>wrote: > >> Just a guess... have you done any compactions on the table post bulk load? >> >> On Nov 12, 2012, at 8:44 AM, Marcos Ortiz <[email protected]> wrote: >> >>> Regards, Amit. >>> Did you tuned the RegionServer where you has that data range hosted? >>> Why do you say that scans are slower after a bulk load? >>> Did you test it before bulk load? >>> >>> HBase version? >>> >>> On 11/12/2012 09:39 AM, Amit Sela wrote: >>>> Hi all, >>>> >>>> Anyone has any idea why scanning over specific range in a table is about >>>> 20% slower if that data (that specific range) was just inserted into >> HBase >>>> using bulk load ? >>>> >>>> I do the bulk load programmatically with LoadIncrementalHFiles. >>>> >>>> Thanks. >>>> >>> >>> -- >>> >>> Marcos Luis Ortíz Valmaseda >>> about.me/marcosortiz <http://about.me/marcosortiz> >>> @marcosluis2186 <http://twitter.com/marcosluis2186> >>> >>> >>> >>> 10mo. ANIVERSARIO DE LA CREACION DE LA UNIVERSIDAD DE LAS CIENCIAS >> INFORMATICAS... >>> CONECTADOS AL FUTURO, CONECTADOS A LA REVOLUCION >>> >>> http://www.uci.cu >>> http://www.facebook.com/universidad.uci >>> http://www.flickr.com/photos/universidad_uci >> >>
