Re: Compaction and total disk space used for highly overwritten CF

2011-10-06 Thread aaron morton
You will only have tombstones in your data if you issue deletes. What you are seeing is an artifact of the fundamental way Cassandra stores data. Once data is written to disk it is never modified. If you overwrite a column value that has already been committed to disk the old value is not chang

Hinted handoff question

2011-10-06 Thread Rene Kochen
I'am using Cassandra 0.7.7 and have a question about hinted handoff. I have a cluster of three nodes. I stop node 3. I see that the hint count for node 3 increases on node 1 (countPendingHints = 28709). However, when I start node 3 again, I cannot see anything in the log regarding hinted handoff

Re: Hinted handoff question

2011-10-06 Thread aaron morton
Check that node 1 sees node 3 as UP (via ring). Check the tpstats on node 1, is there an active HH task ? Take another crawl through the logs. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 6/10/2011, at 10:35 PM, Rene Kochen

RE: Hinted handoff question

2011-10-06 Thread Rene Kochen
Node 3 is up (using ring on node 1). There is no HH task (active = 0, pending = 0, completed = 0, blocked = 0). This is the log from node 1 when nodes 3 starts: 2011-10-06 12:36:54,985 INFO 12:36:54,985 Node /172.16.108.19 has restarted, now UP again 2011-10-06 12:36:54,985DEBUG 12:36:54,985 No

Batch writes getting slow

2011-10-06 Thread Stefan Reek
Hi guys, We're currently testing an application against a very high load, which runs against Cassandra 0.6.13 (I know, we just never got the time to upgrade). The nature of our app is that it will write to two different SuperColumnFamilies in bursts, and to some other columnfamilies less frequent

Re: Batch writes getting slow

2011-10-06 Thread Jonathan Ellis
On Thu, Oct 6, 2011 at 10:09 AM, Stefan Reek wrote: > I can see that during the times the writing gets slow there are ~3000 > pending tasks, but they disappear quickly. Your best bet is to make the write load more constant and less bursty. If you really do need to handle bursts like that with lo

Re: Batch writes getting slow

2011-10-06 Thread Stefan Reek
On 10/06/2011 05:26 PM, Jonathan Ellis wrote: On Thu, Oct 6, 2011 at 10:09 AM, Stefan Reek wrote: I can see that during the times the writing gets slow there are ~3000 pending tasks, but they disappear quickly. Your best bet is to make the write load more constant and less bursty.

Re: Batch writes getting slow

2011-10-06 Thread Jonathan Ellis
On Thu, Oct 6, 2011 at 10:53 AM, Stefan Reek wrote: > We do have the commitlogs on separate devices, are there any other basics > that I could have forgotten, or > any parameters that are important for write performance? 1.0 write performance is something like 30% better... I don't think there's

read on multiple SS tables

2011-10-06 Thread Ramesh Natarajan
Lets assume I perform frequent insert & update on a column family.. Over a period of time multiple sstables will have this row/column data. I have 2 questions about how reads work in cassandra w.r.t. multiple SS tables. -If you perform a query for a specific row key and a column name, does it read

CF design

2011-10-06 Thread Courtney Robinson
I was hoping someone could share their opinions on the following CF designs or suggest a better way of doing it. My app is constantly receiving new data that contains URLs. I was thinking of hashing this URL to form a key. The data is a JSON object with several properties. For now many of its pr

cassandra.bat install

2011-10-06 Thread Stephen Pope
I've got the 1.0 rc2 binaries, but it looks like somebody forgot to include the Apache Daemon in the zip. According to the batch file there should be a bin\daemon directory, with a prunsrv executable in there. Cheers, Steve

Re: cassandra.bat install

2011-10-06 Thread Ben Coverston
It looks like it's missing from the binary distribution. If you download the source distribution it will be present. I'll see if I can get it included in the source build. Ben On Thu, Oct 6, 2011 at 2:07 PM, Stephen Pope wrote: > I’ve got the 1.0 rc2 binaries, but it looks like somebody forgot

Re: Hinted handoff question

2011-10-06 Thread aaron morton
At debug level you should see some log messages such as… "Checking remote schema before delivering hints…" "Sleeping {}ms to stagger hint delivery…" "Endpoint {} died before hint delivery, aborting…" "Started hinted handoff for endpoint…" If you want to move on, you can deliver the hints using JM

Re: read on multiple SS tables

2011-10-06 Thread aaron morton
> -If you perform a query for a specific row key and a column name, does > it read the most recent SSTable first and if it finds a hit, does it > stop there or does it need to read through all the SStables (to find > most recent one) regardless of whether if found a hit on the most > recent SSTable

Re: read on multiple SS tables

2011-10-06 Thread Brandon Williams
On Thu, Oct 6, 2011 at 3:56 PM, aaron morton wrote: > -If you perform a query for a specific row key and a column name, does > it read the most recent SSTable first and if it finds a hit, does it > stop there or does it need to read through all the SStables (to find > most recent one) regardless o

Re: Solandra distributed search

2011-10-06 Thread Jacob, Arun
does the Solandra specific partitioner distribute data relatively equally across nodes? Is this influenced by the shards.at.once property? If I'm writing to 3 nodes, how would the default setting of 4 for this property affect the distribution of data across my nodes? From: Jake Luciani mailt

Hector has a website

2011-10-06 Thread Patricio Echagüe
Hi, I wanted to let you all know that Hector client has a website. http://hector-client.org There are links to documentation, Javadoc and resources from the community. If you have a personal blog and want us to include the link, let us know. Feedback is always welcome. Thanks! Hector Team.

Copy data from 0.7.4 to 0.8

2011-10-06 Thread Cassa L
Hi, I want to transfer data from a ring which is on 0.7.4 to the separate ring running on 0.8. This ring does not even have schema definition of the data available on 0.7.4. What is the best way to copy data and schema from 0.7 cluster to 0.8. Do I need to define schema manually and then copy ssT

Re: Copy data from 0.7.4 to 0.8

2011-10-06 Thread Jonathan Ellis
Copy the schema ("show schema" from the cli), then copy the sstables using the bulk loader (http://www.datastax.com/dev/blog/bulk-loading). You might have to migrate the schema manually (copy the Schema and Migration system column families) if show schema doesn't work against a 0.7 Cassandra. On

Re: Solandra distributed search

2011-10-06 Thread Jake Luciani
The partitioner that comes with Solandra keeps groups of documents together on the same node. The number of documents that stick together and define a "shard" is set by the solandra.maximum.docs.per.shard property. The shards.at.once property relates to throughput. If you are indexing to one shar

Solandra: connection refused errors

2011-10-06 Thread Jacob, Arun
I'm seeing this error when trying to insert data into a core I've defined in Solandra INFO [pool-7-thread-319] 2011-10-06 16:21:34,328 HttpMethodDirector.java (line 445) Retrying request INFO [pool-7-thread-1070] 2011-10-06 16:21:34,328 HttpMethodDirector.java (line 445) Retrying request INFO [

invalid shard name encountered

2011-10-06 Thread Jacob, Arun
I'm seeing this in my logs: WARN [1832199239@qtp-673795938-0] 2011-10-06 16:15:46,424 CassandraIndexManager.java (line 364) invalid shard name encountered: WDPRO-NGELOG-DEV 1 WDPRO-NGELOG-DEV is the name of the index I'm creating. Is there a restriction on characters in the name?

Re: Hector has a website

2011-10-06 Thread Jérémy SEVELLEC
very good job! 2011/10/7 Patricio Echagüe > Hi, I wanted to let you all know that Hector client has a website. > > http://hector-client.org > > There are links to documentation, Javadoc and resources from the community. > > If you have a personal blog and want us to include the link, let us know