upgrade from 0.7.6 to 0.8.4

2011-08-15 Thread Jonathan Colby
Hi - sorry if this was asked before but I couldn't find any answers about it. Is the upgrade path from 0.7.6 to 0.8.4 possible via a simple rolling restart? Are nodes with these different versions compatible - i.e., can one node be upgraded in order to see if we run into any problems before

Re: Expiring Columns

2011-08-15 Thread Edward Capriolo
On Mon, Aug 15, 2011 at 6:51 PM, Stephen McKamey wrote: > I'm curious about Expiring Columns. Say I create a Column Family where > *all* of the Columns are set to be expiring columns. When a row's entire set > of columns have expired, will an empty row it sill be returned in range > queries? Or wi

Re: CQL query using 'OR' in WHERE clause

2011-08-15 Thread Jonathan Ellis
Disjunctions are not yet supported and probably will not be until after 1.0. On Mon, Aug 15, 2011 at 6:45 PM, Deeter, Derek wrote: > Hi, > > We are using CQL to obtain data from Cassandra 0.8.1 using Hector and > getting an error when using ‘OR’ on a secondary index.  I get the same error > when

Re: performance problems on new cluster

2011-08-15 Thread aaron morton
Just checking do you have read_repair_chance set to something ? The second request is going to all replicas which should only happen with CL ONE if read repair is running for the request. The exceptions are happening during read repair which is running async to the main read request. It's occu

Re: Max heap not sticking?

2011-08-15 Thread Ian Danforth
False alarm, typo further down in cassandra-env.sh was removing all the opts. Ian On Mon, Aug 15, 2011 at 6:08 PM, Ian Danforth wrote: > All, > > When I connect to a node through jconsole it's telling me that my max > heap is only 1.7gb. (eg http://screencast.com/t/7DP8ovdUv) However I > bel

Max heap not sticking?

2011-08-15 Thread Ian Danforth
All, When I connect to a node through jconsole it's telling me that my max heap is only 1.7gb. (eg http://screencast.com/t/7DP8ovdUv) However I believe I have properly specified that it should be 4GB in cassandra-env.sh. Total memory is 7.5GB I see greatly increased GC activity as the heap appr

Cassandra for numerical data set

2011-08-15 Thread Yi Yang
Dear all, I wanna report my use case, and have a discussion with you guys. I'm currently working on my second Cassandra project. I got into somehow a unique use case: storing traditional, relational data set into Cassandra datastore, it's a dataset of int and float numbers, no more strings, n

CQL query using 'OR' in WHERE clause

2011-08-15 Thread Deeter, Derek
Hi, We are using CQL to obtain data from Cassandra 0.8.1 using Hector and getting an error when using 'OR' on a secondary index. I get the same error when using CQL 1.0.3. All the items in the WHERE clause are secondary indices and they are all UTF8Type validation. The query works when leaving

Re: Expiring Columns

2011-08-15 Thread aaron morton
I believe (have not tested) that you would still see the range ghosts talked about here http://wiki.apache.org/cassandra/FAQ#range_ghosts until compaction had removed all the columns, and the row once all the columns are gone. Expired columns are purged during compaction when their ttl runs out.

Expiring Columns

2011-08-15 Thread Stephen McKamey
I'm curious about Expiring Columns. Say I create a Column Family where *all* of the Columns are set to be expiring columns. When a row's entire set of columns have expired, will an empty row it sill be returned in range queries? Or will it just be nicely compacted away?

Re: Scalability question

2011-08-15 Thread Teijo Holzer
Hi, we have come across this as well. We run continuously run rolling repairs followed by major compactions followed by a gc() (or node restart) to get rid of all these sstables files. Combined with aggressive ttls on most inserts, the cluster stays nice and lean. You don't want your working

Re: Cassandra in Multiple Datacenters Active - Standby configuration

2011-08-15 Thread Jeremiah Jordan
Assign the tokens like they are two separate rings, just make sure you don't have any duplicate tokens. http://wiki.apache.org/cassandra/Operations#Token_selection The two datacenters are treated as separate rings, LOCAL_QUORUM will only delay the client as long as it takes to write the data to

Cassandra in Multiple Datacenters Active - Standby configuration

2011-08-15 Thread Oleg Tsvinev
Hi all, I have a question that documentation has not clear answer for. I have the following requirements: 1. Synchronously store data in datacenter DC1 on 2+ nodes 2. Asynchronously replicate the same data to DC2 and store it on 2+ nodes to act as a hot standby Now, I have configured keyspaces w

Re: Scalability question

2011-08-15 Thread Jonathan Ellis
This is more an artifact of repair's problems than compaction per se. We're addressing these in https://issues.apache.org/jira/browse/CASSANDRA-2816 and https://issues.apache.org/jira/browse/CASSANDRA-2280. On Mon, Aug 15, 2011 at 3:06 PM, Philippe wrote: >> It's another reason to avoid major / m

Re: CompositeType

2011-08-15 Thread Benoit Perroud
You should give a look at https://github.com/edanuff/CassandraIndexedCollections This is a rather good starting point for Composites. 2011/8/15 Stephen Pope : >  Hey, is there any documentation or examples of how to use the CompositeType? > I can't find anything about it on the wiki or the datas

Re: Scalability question

2011-08-15 Thread Philippe
Forgot to mention that stopping & restarting the server brought the data directory down to 283GB in less than 1 minute. Philippe 2011/8/15 Philippe > It's another reason to avoid major / manual compactions which create a >> single big SSTable. Minor compactions keep things in buckets which mea

Re: Scalability question

2011-08-15 Thread Philippe
> > It's another reason to avoid major / manual compactions which create a > single big SSTable. Minor compactions keep things in buckets which means > newer SSTable can be compacted needing to read the bigger older tables. > I've never run a major/manual compaction on this ring. In my case runni

Re: Solandra multiple schemas

2011-08-15 Thread Ashley Martens
Multiple cores it is. Thanks.

Re: Solandra distributed search

2011-08-15 Thread Jake Luciani
Solandra manages the "shard" parameters for you. you don't need to specify anything. On Mon, Aug 15, 2011 at 3:00 PM, Jeremiah Jordan < jeremiah.jor...@morningstar.com> wrote: > When using Solandra, do I need to use the Solr sharding synxtax in my > queries? I don't think I do because Cassandra i

Solandra distributed search

2011-08-15 Thread Jeremiah Jordan
When using Solandra, do I need to use the Solr sharding synxtax in my queries? I don't think I do because Cassandra is handling the "sharding", not Solr, but just want to make sure. The Solandra wiki references the distributed search limitations, which talks about the shard syntax further down

Re: Solandra multiple schemas

2011-08-15 Thread Jake Luciani
You want the solandra data stored under two keyspaces? Or you just want two different logical indexes. The former requires changing the keyspace name located in solandra.properties but you can only access one per process. The latter would involve creating two different solr cores at different end

Re: CassandraUnit

2011-08-15 Thread Jonathan Ellis
Thanks, Jérémy! 2011/8/15 Jérémy SEVELLEC : > Hi all, > I have published some documentations on it : > https://github.com/jsevellec/cassandra-unit/wiki > regards > Jérémy > Le 10 août 2011 23:20, Jérémy SEVELLEC a écrit : >> >> Hi everyone, >> Let me present you CassandraUnit, a test framework to

Re: CassandraUnit

2011-08-15 Thread Jérémy SEVELLEC
Hi all, I have published some documentations on it : https://github.com/jsevellec/cassandra-unit/wiki regards Jérémy Le 10 août 2011 23:20, Jérémy SEVELLEC a écrit : > Hi everyone, > > Let me present you CassandraUnit, a test framework to develop application > with Cassandra backend in TDD S

Solandra multiple schemas

2011-08-15 Thread Ashley Martens
Does Solandra support multiple schemas? For example I have staging and test data in two different keyspaces in Cassandra and want that echoed in Solandra. Possible?

CompositeType

2011-08-15 Thread Stephen Pope
Hey, is there any documentation or examples of how to use the CompositeType? I can't find anything about it on the wiki or the datastax docs. Cheers, Steve

Re: Planet Cassandra is now live

2011-08-15 Thread Konstantin Naryshkin
Thanks. I did not see a link to it when I was sending my message. - Original Message - From: "Zhu Han" To: user@cassandra.apache.org Sent: Saturday, August 13, 2011 12:11:37 AM Subject: Re: Planet Cassandra is now live On Sat, Aug 13, 2011 at 4:35 AM, Konstantin Naryshkin < konstant

Re: Internal error processing get_range_slices

2011-08-15 Thread Jonathan Ellis
The count you specify is the worst case, so if you can't even allocate a List to handle it, you shouldn't be specifying such a high count. Better find that out immediately, then when your data set grows in production. On Mon, Aug 15, 2011 at 8:15 AM, Patrik Modesto wrote: > On Mon, Aug 15, 2011 a

Re: Cassandra Certification

2011-08-15 Thread Edward Capriolo
A friends friend developed the FreeBSD certification. It is a actually a difficult process either you need to give thousands of dollars to a place like prometrics or you need to have people across the world that can administer the test. You also need to design and keep changing the test because it

Re: Internal error processing get_range_slices

2011-08-15 Thread Patrik Modesto
On Mon, Aug 15, 2011 at 15:09, Jonathan Ellis wrote: > On Mon, Aug 15, 2011 at 7:13 AM, Patrik Modesto > wrote: >> PS: while reading the email before I'd send it, I've noticed the >> keyRange.count =... is it possible that Cassandra is preallocating >> some internal data acording the KeyRange.cou

Re: Merged counter shard with a count != 0

2011-08-15 Thread Jonathan Ellis
Can you create a bug report on https://issues.apache.org/jira/browse/CASSANDRA ? On Mon, Aug 15, 2011 at 2:24 AM, Philippe wrote: >> Did you try what it says to do first? "You need to restart this node >> with -Dcassandra.renew_counter_id=true to fix." > > Yes I did and it still logged that error

Re: Internal error processing get_range_slices

2011-08-15 Thread Jonathan Ellis
On Mon, Aug 15, 2011 at 7:13 AM, Patrik Modesto wrote: > PS: while reading the email before I'd send it, I've noticed the > keyRange.count =... is it possible that Cassandra is preallocating > some internal data acording the KeyRange.count parameter? That's exactly what it does. -- Jonathan Ell

Internal error processing get_range_slices

2011-08-15 Thread Patrik Modesto
Hi, on our dev cluster of 4 cassandra nodes 0.7.8 I'm suddenly getting: ERROR 13:40:50,848 Internal error processing get_range_slices java.lang.OutOfMemoryError: Java heap space at java.util.ArrayList.(ArrayList.java:112) at org.apache.cassandra.service.StorageProxy.getRangeSlice

Re: Cassandra Certification

2011-08-15 Thread aaron morton
Depending on where in the world you are, keep an eye / ear out for Data Stax training http://www.datastax.com/events Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 15/08/2011, at 5:56 PM, Joe Stein wrote: > Certification is good

Re: performance problems on new cluster

2011-08-15 Thread Anton Winter
OK, node latency is fine and you are using some pretty low consistency. You said NTS with RF 2, is that RF 2 for each DC ? Correct, I'm using RF 2 for each DC. I was able to reproduce the cli timeouts on the non replica nodes. The debug log output from dc1host1 (non replica node): DEBUG [po

Re: Merged counter shard with a count != 0

2011-08-15 Thread Philippe
> > It looks like the error was thrown during a minor compaction. There should > be a log line from the CompactionManager before hand that says "Compacting…" > and lists the SSTables it is going to compact. Check that it's from the same > thread, i.e. [CompactionExecutor:6] in the example below > O

Re: Merged counter shard with a count != 0

2011-08-15 Thread Philippe
> > Did you try what it says to do first? "You need to restart this node > with -Dcassandra.renew_counter_id=true to fix." > Yes I did and it still logged that error upon restarting. I'm loath to removing the SSTable as every single repair I run on any node is streaming data because of out of sync