Re: reduced cached mem; resident set size growth

2011-01-28 Thread Zhu Han
On Fri, Jan 28, 2011 at 4:15 PM, Oleg Anastasyev wrote: > > > > http://img24.imageshack.us/img24/1754/cassandrarss.png > > > This looks like cassandra leaking memory inside java heap. > I remember, there was some leaking issues with java versions <1.6.u21, > correct > me if I wrong. Try to upgrad

Cassandra and count

2011-01-28 Thread Sébastien Druon
Hello, I have a question concerning count in cassandra, as I would like to count the rows of a CF: - is it mandatory to specify a range? - what is the cost of a count operation on a CF? Thanks in advance for the answers Sebastien

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Zhu Han
Chris, Somebody else and I have the same problem as you, and reported it here: http://www.apacheserver.net/Very-high-memory-utilization-not-caused-by-mmap-on-sstables-at1082970.htm [NB: It is not solved although the titles said so. Some response from me in the thread is not accurate.] IMHO, you

Secondary Index information

2011-01-28 Thread Sasha Dolgy
Hi there, Where can I find information regarding secondary indexes? Spent the past 2 days looking for some good details. http://wiki.apache.org/cassandra/SecondaryIndexes doesn't yet exist, althought it's referenced from http://wiki.apache.org/cassandra/StorageConfiguration Trying to understand

Re: Secondary Index information

2011-01-28 Thread Jake Luciani
http://www.datastax.com/blog/whats-new-cassandra-07-secondary-indexes On Fri, Jan 28, 2011 at 7:15 AM, Sasha Dolgy wrote: > Hi there, > > Where can I find information regarding secondary indexes? Spent the > past 2 days looking for some good details. > > http://wiki.apache.org/cassandra/Seconda

Re: Secondary Index information

2011-01-28 Thread Sasha Dolgy
Thank you. So, after reading, I'm still unsure if this feature will afford me a larger benefit when compared to an inverted index solution. Has anyone done a pros / cons ? -sd On Fri, Jan 28, 2011 at 3:22 PM, Jake Luciani wrote: > http://www.datastax.com/blog/whats-new-cassandra-07-secondary-

Re: Secondary Index information

2011-01-28 Thread Victor Kabdebon
Dear Sasha, I am currently thinking about using secondary index in the future. I have seen two pros : _Simplicity, it is "more simple" to query based on a second index than going for a first column then a second. _ "Consistency" : depending on where you store your inverted index, it may be unavail

Re: Secondary Index information

2011-01-28 Thread Sasha Dolgy
Hi Victor, In my research and playing around with nosql, specifically cassandra, I see the true benefit in defining search requirements and then creating CF's and the hierarchy based on this. So for me, I see quite a bit of simplicity in defining alternate CF's to allow me to efficiently find a s

unsubscribe

2011-01-28 Thread S Ahmed

Re: reduced cached mem; resident set size growth

2011-01-28 Thread sridhar basam
On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs wrote: > java -version > java version "1.6.0_20" > Java(TM) SE Runtime Environment (build 1.6.0_20-b02) > Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, mixed mode) > > cmd line arg (paths edited): > /usr/java/jdk1.6.0_20/bin/java -Xms1500M -c

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 10:51 AM, sridhar basam wrote: > On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs > wrote: > >> java -version >> java version "1.6.0_20" >> Java(TM) SE Runtime Environment (build 1.6.0_20-b02) >> Java HotSpot(TM) 64-Bit Server VM (build 16.3-b01, mixed mode) >> >> cmd line arg (path

Re: Cassandra and count

2011-01-28 Thread buddhasystem
As far as I know, there are no aggregate operations built into Cassandra, which means you'll have to retrieve all of the data to count it in the client. I had a thread on this topic 2 weeks ago. It's pretty bad. -- View this message in context: http://cassandra-user-incubator-apache-org.3065146

Re: Cassandra and count

2011-01-28 Thread Victor Kabdebon
Buddasystem is right. A count returns columns to the client which count it. My advice : do not count big columns / supercolumns. People in the dev team are trying to develop distributed counters but I don't know the state of this research. Best regards, Victor Kabdebon http://www.voxnucleus.fr 20

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Jake Luciani
Are you using a row cache? if so what is it set too? in general it should not be a percentage. On Thu, Jan 27, 2011 at 12:23 PM, Chris Burroughs wrote: > We have a 6 node Cassandra 0.6.8 cluster running on boxes with 4 GB of > RAM. Over the course of several weeks cached memory slowly decreases

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 11:29 AM, Jake Luciani wrote: > Are you using a row cache? if so what is it set too? in general it should > not be a percentage. > row_cache_size == row_cache_capacity before the start of RSS data collection. According to jconsole heap size is not growing larger than the

Re: Secondary Index information

2011-01-28 Thread B. Todd Burruss
batch_mutate doesn't guarantee consistency. each mutation in the batch is guaranteed to be consistent based on your CL, but if it returns an error it means that it couldn't complete all mutations ... but the converse isn't true. it may have successfully completed some mutations. if you get a

Re: unsubscribe

2011-01-28 Thread Eric Evans
Unsubscribing from this is list is easy, provided you read the instructions sent to you when confirming your subscription, or those conveniently located on the web. http://lmgtfy.com/?q=apache+mailing+lists+unsubscribe&l=1 If after reading these directions, you still require help getting off the

Re: unsubscribe

2011-01-28 Thread Sasha Dolgy
it's also in the header info of every email ... List-Help: List-Unsubscribe: On Fri, Jan 28, 2011 at 6:28 PM, Eric Evans wrote: > > Unsubscribing from this is list is easy, provided you read the > instructions

Re: reduced cached mem; resident set size growth

2011-01-28 Thread sridhar basam
What about your permgen usage? Do you track that? Use something like "jstat -gc -t 5s 100" to track it. Or turn up verbose GC on your command line options to what is happening. Sridhar On Fri, Jan 28, 2011 at 11:38 AM, Chris Burroughs wrote: > On 01/28/2011 11:29 AM, Jake Luciani wrote: >

Is it recommended to store two types of data (not related to each other but need to be retrieved together) in one super column family ?

2011-01-28 Thread Ertio Lew
Hi, I have two kinds of data that I would like to fit in one super column family; I am trying this, for the reasons of implementing fast database retrievals by combining the data of two rows into just one row. First kind of data, in supercolumn family, is named with timeUUIDs as supercolumn names

0.7.1 release

2011-01-28 Thread B. Todd Burruss
any word on when to expect 0.7.1? lots of good fixes we need. trying to decide if i should apply patches or wait. thx!

Re: 0.7.1 release

2011-01-28 Thread Jonathan Ellis
Eric just posted to -dev that he's putting a new set of artifacts together for vote. On Fri, Jan 28, 2011 at 12:48 PM, B. Todd Burruss wrote: > any word on when to expect 0.7.1?  lots of good fixes we need.  trying to > decide if i should apply patches or wait. > > thx! > -- Jonathan Ellis Pr

Re: Node going down when streaming data, what next?

2011-01-28 Thread aaron morton
nodetool remotetoken or nodetool decommission http://wiki.apache.org/cassandra/Operations#Removing_nodes_entirely Hope that helps Aaron On 28 Jan 2011, at 11:30, buddhasystem wrote: > > OK, after running "repair" and waiting overnight the rebalancing worked and > now 3 nodes share the load as

Re: Node going down when streaming data, what next?

2011-01-28 Thread buddhasystem
Sorry Aaron but this doesn't help. As I said, machine is dead, kaput, finished. So I can't do "decommission". I can "remove token" to any other node, but -- the dead machine is going to hang around in my "ring" reports like a zombie. -- View this message in context: http://cassandra-user-incuba

[RELEASE] 0.6.11

2011-01-28 Thread Eric Evans
It seems like it was just earlier this week that we announced the release of 0.6.10. Oh wait, it was. In the time since though, CASSANDRA-2058[1] was found and fixed, and that seemed like reason enough to fast-track a new release. Source and binary archives are available from the Downloads page

Re: [RELEASE] 0.6.11

2011-01-28 Thread Chris Goffinet
+1 On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans wrote: > > It seems like it was just earlier this week that we announced the > release of 0.6.10. Oh wait, it was. In the time since though, > CASSANDRA-2058[1] was found and fixed, and that seemed like reason > enough to fast-track a new release

Re: [RELEASE] 0.6.11

2011-01-28 Thread Chris Goffinet
Err. I mean't, thanks Evan for getting this released so fast :) On Fri, Jan 28, 2011 at 3:18 PM, Chris Goffinet wrote: > +1 > > > On Fri, Jan 28, 2011 at 3:13 PM, Eric Evans wrote: > >> >> It seems like it was just earlier this week that we announced the >> release of 0.6.10. Oh wait, it was.

get_slice OOM on large row

2011-01-28 Thread Rajat Chopra
Hi! Trying to test the 0.7 release with some offbeat settings to check the behavior. - Single node cluster - Key_cache_size - default - Row_cache_size - default - Min/max compaction threshold - 0 (so this is disabled) - Disk_access_mode : stand

Re: Node going down when streaming data, what next?

2011-01-28 Thread Robert Coli
On Fri, Jan 28, 2011 at 1:51 PM, buddhasystem wrote: > > I can "remove token" to any other > node, but -- the dead machine is going to hang around in my "ring" reports > like a zombie. If you "remove token" on the other nodes and the dead machine "hangs around", that sounds like a bug? I haven't

Re: get_slice OOM on large row

2011-01-28 Thread Jonathan Ellis
Requesting too much data in a single request is user error. That is why you have start columns/rows, so you can page through a large set. On Fri, Jan 28, 2011 at 6:48 PM, Rajat Chopra wrote: > Hi! > >    Trying to test the 0.7 release with some offbeat settings to check the > behavior. > > > > -

RE: get_slice OOM on large row

2011-01-28 Thread Rajat Chopra
Thanks Jonathan. But the read fails in all cases, even when start_column/end_column span is 10 columns here, and even when column_count is set appropriately. Or did I miss what you said? The trace seems to suggest an entire super_column is being deserialized. Rajat -Original Message- Fr

Re: Node going down when streaming data, what next?

2011-01-28 Thread buddhasystem
It does remove tokens, and the "ring" shows that the problematic node owns 0 tokens, which is OK. However, it's still there, listed. It's not a bug but kind of like a feature -- you can move that node back in two days later and "move" tokens in same or different way. What I wish happened was tha

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 12:42 PM, sridhar basam wrote: > What about your permgen usage? Do you track that? Use something like "jstat > -gc -t 5s 100" to track it. Or turn up verbose GC on your command > line options to what is happening. > http://img59.imageshack.us/img59/1056/permgen.png This is ove

Re: reduced cached mem; resident set size growth

2011-01-28 Thread Chris Burroughs
On 01/28/2011 04:12 AM, Zhu Han wrote: > Chris, > > Somebody else and I have the same problem as you, and reported it here: > http://www.apacheserver.net/Very-high-memory-utilization-not-caused-by-mmap-on-sstables-at1082970.htm > > [NB: It is not solved although the titles said so. Some response

Re: get_slice OOM on large row

2011-01-28 Thread Jonathan Ellis
http://wiki.apache.org/cassandra/CassandraLimitations "Any request for a subcolumn deserializes _all_ the subcolumns in that supercolumn, so you want to avoid a data model that requires large numbers of subcolumns." On Fri, Jan 28, 2011 at 7:40 PM, Rajat Chopra wrote: > Thanks Jonathan. > But th

Cassandra events next week around Strata

2011-01-28 Thread Jonathan Ellis
Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon: http://strataconf.com/strata2011/public/schedule/detail/16911 2. Cassandra Meetup organized by Jeremy Hanna Tuesday evening: http://cassa

Re: Cassandra events next week around Strata

2011-01-28 Thread Roshan Dawrani
HelloNo India-based Cassandra / NoSQL events? :-( :-) On Sat, Jan 29, 2011 at 8:31 AM, Jonathan Ellis wrote: > Next week is the Strata conference and not one, not two, but five > Cassandra events! > > In chronological order: > > 1. My Strata Cassandra tutorial Tuesday afternoon: > http://str

Re: Cassandra events next week around Strata

2011-01-28 Thread B. Todd Burruss
web site says sold out, too bad for me ;) On 01/28/2011 07:01 PM, Jonathan Ellis wrote: Next week is the Strata conference and not one, not two, but five Cassandra events! In chronological order: 1. My Strata Cassandra tutorial Tuesday afternoon: http://strataconf.com/strata2011/public/schedul