Re: GCInspector info messages in cassandra log

2012-08-16 Thread Tamar Fraenkel
Thank you very much! *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956 On Thu, Aug 16, 2012 at 12:11 AM, aaron morton wrote: > Is there anything to do before that? like drain or

SSTable Index and Metadata - are they cached in RAM?

2012-08-16 Thread Maciej Miklas
Hi all, bloom filter for row keys is always in RAM. What about SSTable index, and Metadata? Is it cached by Cassandra, or it relays on memory mapped files? Thanks, Maciej

Re: Migrating to a new cluster (using SSTableLoader or other approaches)

2012-08-16 Thread Filippo Diotalevi
> > ERROR 09:02:38,614 Error in ThreadPoolExecutor > > java.lang.RuntimeException: java.io.EOFException: unable to seek to > > position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db > > (65737276 bytes) in read-only mode > > > This one looks like an error. > > Can you run nodeto

Re: indexing question related to playOrm on github

2012-08-16 Thread Hiller, Dean
Yes, the synch may work, and no, I do "not" want a transaction…I want a different kind of eventually consistent That might work. Let's say server 1 sends a mutation (65 is the pk) Remove: <65> Add <65> Server 2 also sends a mutation (65 is the pk) Remove: <65> Add <65> What everyone does not wa

wild card on query

2012-08-16 Thread Swathi Vikas
Hi, I am trying to run query on cassandra cluster with predicate on row key. I have column family called "Users" and rows with row key like "projectid_userid_photos". Each user within a project can have rows like projectid_userid_blog, projectid_userid_status and so on.  I want to retrieve all

Re: indexing question related to playOrm on github

2012-08-16 Thread Hiller, Dean
Maybe this would be a special type of column family that could contain these as my other tables definitely don't want the feature below by the way. Dean On 8/16/12 6:29 AM, "Hiller, Dean" wrote: >Yes, the synch may work, and no, I do "not" want a transactionŠI want a >different kind of eventual

nodetool repair uses insane amount of disk space

2012-08-16 Thread Michael Morris
Occasionally as I'm doing my regular anti-entropy repair I end up with a node that uses an exceptional amount of disk space (node should have about 5-6 GB of data on it, but ends up with 25+GB, and consumes the limited amount of disk space I have available) How come a node would consume 5x its nor

Many ParNew collections

2012-08-16 Thread Rene Kochen
Hi I have a cluster of 7 nodes: - Windows Server 2008 - Cassandra 0.7.10 - The nodes are identical (hardware, configuration and client request load) - Standard batch file with 8GB heap - I use disk_access_mode = standard - Random partitioner - TP stats shows no problems - Ring command shows no pr

Opscenter 2.1 vs 1.3

2012-08-16 Thread Robin Verlangen
Hi there, I just upgraded to opscenter 2.1 (from 1.3). It appears that my writes have tripled. Is this a change in the display/measuring of opscenter? Best regards, Robin Verlangen *Software engineer* * * W http://www.robinverlangen.nl E ro...@us2.nl Disclaimer: The information contained in th

C++ Bulk loader and Result set streaming.

2012-08-16 Thread Swathi Vikas
Hi All,   I am using C++ client libQtCassandra. I have two questions.   1) I want to bulk load data into cassandra through C++ interface. It is required by my group where i am doing internship. I could bulk load using sstableloader as specified in Datastax :http://www.datastax.com/dev/blog/bulk-

'WHERE' with several indexed columns

2012-08-16 Thread A J
Hi If I have a WHERE clause in CQL with several 'AND' and each column is indexed, which index(es) is(are) used ? Just the first field in the where clause or all the indexes involved in the clause ? Also is index used only with an equality operator or also with greater than /less than comparator as

Why the StageManager thread pools have 60 seconds keepalive time?

2012-08-16 Thread Guillermo Winkler
Hi, I have a cassandra cluster where I'm seeing a lot of thread trashing from the mutation pool. MutationStage:72031 Where threads get created and disposed in 100's batches every few minutes, since it's a 16 core server concurrent_writes is set in 100 in the cassandra.yaml. concurrent_writes: 10

Re: SSTable Index and Metadata - are they cached in RAM?

2012-08-16 Thread aaron morton
> What about SSTable index, Not sure what you are referring to there. Each row has a in a SStable has a bloom filter and may have an index of columns. This is not cached. See http://thelastpickle.com/2011/07/04/Cassandra-Query-Plans/ or http://www.slideshare.net/aaronmorton/cassandra-sf-2012-t

Re: Migrating to a new cluster (using SSTableLoader or other approaches)

2012-08-16 Thread Filippo Diotalevi
> > ERROR 09:02:38,614 Error in ThreadPoolExecutor > > java.lang.RuntimeException: java.io.EOFException: unable to seek to > > position 93069003 in /opt/analytics/analytics/chart-hd-104-Data.db > > (65737276 bytes) in read-only mode > > > This one looks like an error. > > Can you run nodeto

Re: Migrating to a new cluster (using SSTableLoader or other approaches)

2012-08-16 Thread aaron morton
> Which nodetool command are you referring to? (info, cfstats, ring,….) My bad. I meant to write sstableloader > Do I modify the log4j-tools.properties in $CASSANDRA_HOME/conf to set the > nodetool logs to DEBUG? You can use the --debug option with sstableloader to get a better exception message

Re: wild card on query

2012-08-16 Thread aaron morton
> I want to retrieve all the photos from all the users of certain project. My > sql like query will be "select projectid * photos from Users". How can i run > this kind of row key predicate while executing query on cassandra? You cannot / should not do that using the data model you have. (i.e. yo

Re: indexing question related to playOrm on github

2012-08-16 Thread aaron morton
>> I am not sure synchronization fixes thatŠŠIt would be kind of >> nice if the column <65> would not actually be removed until after >> all servers are eventually consistent... Not sure thats possible. You can either serialise updating your custom secondary index on the client site or resolv

Re: nodetool repair uses insane amount of disk space

2012-08-16 Thread aaron morton
What version are using ? There were issues with repair using lots-o-space in 0.8.X, it's fixed in 1.X Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 17/08/2012, at 2:56 AM, Michael Morris wrote: > Occasionally as I'm doing my regular ant

Re: Opscenter 2.1 vs 1.3

2012-08-16 Thread aaron morton
You may have better luck on the Data Stax forums http://www.datastax.com/support-forums/ Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 17/08/2012, at 4:36 AM, Robin Verlangen wrote: > Hi there, > > I just upgraded to opscenter 2.1 (fro

Re: C++ Bulk loader and Result set streaming.

2012-08-16 Thread aaron morton
> But i couldn't find any information on bulk loading using C++ client > interface. You cannot. To bulk load data use the sstableloader, otherwise you need to use the RPC / CQL API. > 2) I want to retrieve all the result of the query(not just first 100 result > set) using C++ client. Is there

Omitting empty columns from CQL SELECT

2012-08-16 Thread Mat Brown
Hello all, I've noticed that when performing a SELECT statement with a list of columns specified, Cassandra returns all columns in the resulting row(s) even if they have no value. This creates an apparently considerable amount of transport and deserialization overhead, particularly in one use case

Re: 'WHERE' with several indexed columns

2012-08-16 Thread aaron morton
> If I have a WHERE clause in CQL with several 'AND' and each column is > indexed, which index(es) is(are) used ? The most selective based on the average number of columns per row https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/db/index/keys/KeysSearcher.java > Also i

Re: nodetool repair uses insane amount of disk space

2012-08-16 Thread Michael Morris
Upgraded to 1.1.3 from 1.0.8 about 2 weeks ago. On Thu, Aug 16, 2012 at 5:57 PM, aaron morton wrote: > What version are using ? There were issues with repair using lots-o-space > in 0.8.X, it's fixed in 1.X > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > htt

Re: Why the StageManager thread pools have 60 seconds keepalive time?

2012-08-16 Thread aaron morton
That's some pretty old code. I would guess it was done that way to conserve resources. And _i think_ thread creation is pretty light weight. Jonathan / Brandon / others - opinions ? Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 17/08/2

Cassandra 1.0 row deletion

2012-08-16 Thread Terry Cumaranatunge
Hi, We have a Cassandra 1.0 cluster that we run with RF=3 and perform operations using a consistency level of quorum. We use batch_mutate for all inserts and updates for atomicity across column families with the same row key, but use the thrift interface remove API call in C++ to delete a row so t

Re: Cassandra 1.0 row deletion

2012-08-16 Thread Derek Williams
On Thu, Aug 16, 2012 at 9:08 PM, Terry Cumaranatunge wrote: > > We have a Cassandra 1.0 cluster that we run with RF=3 and perform > operations using a consistency level of quorum. We use batch_mutate for all > inserts and updates for atomicity across column families with the same row > key, but use