Re: strange get_range_slices behaviour v0.6.1

2010-05-02 Thread aaron morton
He there, I'm still getting odd behavior with get_range_slices. I've created a JUNIT test that illustrates the case. Could someone take a look and either let me know where my understanding is wrong or is this is a real issue? I added the following to ColumnFamilyStoreTest.java private

Bootstrap source code

2010-05-02 Thread Bill Hastings
Hi I have looking at the bootstrap source and seem to understand it for the most part. This is what I do not follow: (1) New node joins and doesn't advertise its token. (2) Requests nodes to send it data. The nodes that need to send it data first flush memtables and then transfer SSTables. Once

Row slice / cache performance

2010-05-02 Thread James Golick
1. I wrote the list a while back about less-than-great performance when reading thousands of columns even on cache hits. Last night, I decided to try to get to the bottom of why. I tested this by setting the row cache capacity on a TimeUUIDType-sorted CF to 10, filling up a single row wi

Re: Row slice / cache performance

2010-05-02 Thread James Golick
Not sure why the first paragraph turned in to a numbered bullet... On Sun, May 2, 2010 at 11:00 AM, James Golick wrote: > >1. I wrote the list a while back about less-than-great performance when >reading thousands of columns even on cache hits. Last night, I decided to >try to get to

Search Sample and Relation question because UDDI as Key

2010-05-02 Thread CleverCross | Falk Wolsky
Hello, 1) Can you provide a solution or a sample for searching (Column and SuperColumn) (Fulltext). What is the Way to realize this? Hadoop/MapReduce? See you a posibility to build/use a index for columns? Why this: In a given Data-Model we "must" use UUIDs as Key and have actually no chance t

How do you, Bloom filter of the false positive rate or remove the problem of distributed databases?

2010-05-02 Thread Kauzki Aranami
Hi This data structure recognizes to the way based on the idea of Eventually Consistency of BASE though Bloom filter is adopted for the data structure in Cassandra as shape to allow no limited adjustment. In a word, there is a problem of generating the false positive rate. Moreover, data is delet

Re: Row slice / cache performance

2010-05-02 Thread James Golick
Just an update on this. I wrote a patch which attempts to solve this problem by keeping an index of columns that are marked for deletion to avoid having to iterate over the whole column set and call columns_.get() over and over again. My patch works, and the time spent in removeDeleted() is now cl

Re: Row slice / cache performance

2010-05-02 Thread James Golick
Got a ~50% improvement by making UUID comparison less heavy-weight. https://issues.apache.org/jira/browse/CASSANDRA-1043 On Sun, May 2, 2010 at 7:49 PM, James Golick wrote: > Just an update on this. I wrote a patch which attempts to solve this > problem by keeping an index of columns that are m

Re: inserting new rows with one key vs. inserting new columns in a row performance

2010-05-02 Thread Даниел Симеонов
Hello, It seems that I have experienced network problems (local pre-installed firewall) and some rest http inefficiencies, so I think that it behaves the same in both cases. I am sorry to have taken from your time. Best regards, Daniel. На 30 април 2010 20:46, Даниел Симеонов написа: > Hi, >