Oh, sorry. You're right. You already said that and I forgot to update
it. It's working fine when I add this parameter. And as you are
saying, I can get the respons time I want by playing with the
chance...
I get (34758 lines/seconds) with 0.99 as the chance, and only (7564
lines/seconds) with 0.09
For the filter list my guess is that you're filtering out all rows
because RandomRowFilter#chance is not initialized (it should be
something like RandomRowFilter rrf = new RandomRowFilter(0.5);)
But note that this test will never be comparable to the test with a
list of gets. You can make it as slo
Oh! I see! KeyOnlyFilter is overwriting the RandomRowFilter! Bad. I
mean, bad I did not figured that. Thanks for pointing that. That
definitively explain the difference in the performances.
I have activated the bloomfilters with this code:
HBaseAdmin admin = new HBaseAdmin(config);
HTable table =
> Seems blocks you are getting from cache.
>>> You can check with Blooms also once.
>>>
>>> You can enable the usage of bloom using the config param
>>> "io.storefile.bloom.enabled" set to true . This will enable the usage of
>>> bloom globally
&
true . This will enable the usage of
>> bloom globally
>> Now you need to set the bloom type for your CF
>> HColumnDescriptor#setBloomFilterType() U can check with type
>> BloomType.ROW
>>
>> -Anoop-
>>
>> _
>>
tor#setBloomFilterType() U can check with type
> BloomType.ROW
>
> -Anoop-
>
> _
> From: Jean-Marc Spaggiari [jean-m...@spaggiari.org]
> Sent: Thursday, June 28, 2012 5:42 PM
> To: user@hbase.apache.org
> Subject: Re: Scan vs Put vs Get
>
&g
lue)results[i]).isEmptyColumn())
> System.out.println("Result[" + i + "]: " +
> results[i]); // co
> BatchExample-9-Dump Print all results.
> }
>
> 2012/6/28, Ramkrishna.S.Vasudevan :
>> Hi
>>
>> You can also check the cac
type for your CF
HColumnDescriptor#setBloomFilterType() U can check with type BloomType.ROW
-Anoop-
_
From: Jean-Marc Spaggiari [jean-m...@spaggiari.org]
Sent: Thursday, June 28, 2012 5:42 PM
To: user@hbase.apache.org
Subject: Re: Scan vs Put vs Get
Oh! I ne
[mailto:jean-m...@spaggiari.org]
>> Sent: Thursday, June 28, 2012 4:44 PM
>> To: user@hbase.apache.org
>> Subject: Re: Scan vs Put vs Get
>>
>> Wow. First, thanks a lot all for jumping into this.
>>
>> Let me try to reply to everyone in a single post.
>
> Sent: Thursday, June 28, 2012 4:44 PM
> To: user@hbase.apache.org
> Subject: Re: Scan vs Put vs Get
>
> Wow. First, thanks a lot all for jumping into this.
>
> Let me try to reply to everyone in a single post.
>
> > How many Gets you batch together in one call
> I tr
// co
BatchExample-9-Dump Print all results.
}
2012/6/28, Ramkrishna.S.Vasudevan :
> Hi
>
> You can also check the cache hit and cache miss statistics that appears on
> the UI?
>
> In your random scan how many Regions are scanned whereas in gets may be
> many
> due to
, June 28, 2012 2:00 PM
> To: user@hbase.apache.org
> Subject: Re: Scan vs Put vs Get
>
> Hi Jean-Marc,
>
> Interesting :-)
>
> Added to Anoop questions:
>
> What's the hbase version you're using?
>
> Is it repeatable, I mean if you try twice the
Hi Jean-Marc,
Interesting :-)
Added to Anoop questions:
What's the hbase version you're using?
Is it repeatable, I mean if you try twice the same "gets" with the
same client do you have the same results? I'm asking because the
client caches the locations.
If the locations are wrong (region
Hi
How many Gets you batch together in one call? Is this equal to the
Scan#setCaching () that u are using?
If both are same u can be sure that the the number of NW calls is coming almost
same.
Also you are giving random keys in the Gets. The scan will be always
sequential. Seems in your ge
14 matches
Mail list logo