Re: Upgrade 1.1.2 -> 1.1.6

2012-11-19 Thread Alain RODRIGUEZ
"This looks like the counters were more out of sync before the upgrade than after?" My guess is the update makes some counters over-count since I saw the value of the sum of our daily counter increase by 2000 after each restart at the exact moment that the node is marked as being up. This counter

Re: Cassandra nodes failing with OOM

2012-11-19 Thread Janne Jalkanen
Something that bit us recently was the size of bloom filters: we have a column family which is mostly written to, and only read sequentially, so we were able to free a lot of memory and decrease GC pressure by increasing bloom_filter_fp_chance for that particular CF. This on 1.0.12. /Janne O

Re: Remove crashed node

2012-11-19 Thread Robin Verlangen
Quite some time passed, but Alain, it worked. Thank you very much. Best regards, Robin Verlangen *Software engineer* * * W http://www.robinverlangen.nl E ro...@us2.nl Disclaimer: The information contained in this message and attachments is intended solely for the attention

Re: SchemaDisagreementException

2012-11-19 Thread Everton Lima
I was using cassandra direct because it has more performace than using CQL. Therefore, I am using cassandra because of replication factor and consistence of data. I am using it as a lib of my app. I only make sample querys, just use a key to point to a data. 2012/11/16 Everton Lima > I do that b

RE: Cassandra nodes failing with OOM

2012-11-19 Thread Viktor Jevdokimov
We've seen OOM in a situation, when OS was not properly prepared in production. http://www.datastax.com/docs/1.1/install/recommended_settings Best regards / Pagarbiai Viktor Jevdokimov Senior Developer Email: viktor.jevdoki...@adform.com Phone: +370 5 212 3

Re: Remove crashed node

2012-11-19 Thread Alain RODRIGUEZ
I am glad to here that. You're very welcome. Alain 2012/11/19 Robin Verlangen > Quite some time passed, but Alain, it worked. Thank you very much. > > Best regards, > > Robin Verlangen > *Software engineer* > * > * > W http://www.robinverlangen.nl > E ro...@us2.nl > > >

row cache re-fill very slow

2012-11-19 Thread Andras Szerdahelyi
Hey list, i was just wondering if anyone else is experiencing very slow ( ~ 3.5 MB/sec ) re-fill of the row cache at start up. We operate with a large row cache ( 10-15GB currently ) and we already measure startup times in hours :-) How is the "saved row cache file" processed? Are the cached ro

Re: Datatype Conversion in CQL-Client?

2012-11-19 Thread Brian O'Neill
I don't think Michael and/or Jonathan have published the CQL java driver yet. (CCing them) Hopefully they'll find a public home for it soon, I hope to include it in the Webinar in December. (http://www.datastax.com/resources/webinars/collegecredit) -brian --- Brian O'Neill Lead Architect, Softw

Re: Datatype Conversion in CQL-Client?

2012-11-19 Thread Timmy Turner
What I meant was the method that the Cassandra-jars give you when you include them in your project: TTransport tr = new TFramedTransport(new TSocket("localhost", 9160)); TProtocol proto = new TBinaryProtocol(tr); Cassandra.Client client = new Cassandra.Client(proto); tr.open(); client.ex

Re: Collections, query for "contains"?

2012-11-19 Thread Edward Capriolo
This was my first question after I git the inserts working. Hive has udfs like array contains. It also has lateral view syntax that is similar to transposed. On Monday, November 19, 2012, Timmy Turner wrote: > Is there no option to query for the contents of a collection? > Something like > sele

Re: SchemaDisagreementException

2012-11-19 Thread Edward Capriolo
even if you made the calls through cql you would have the same issue since cql uses thrift. 1.2:0 is supposed to be nicer with concurrent modifications. On Monday, November 19, 2012, Everton Lima wrote: > I was using cassandra direct because it has more performace than using CQL. Therefore, I am

Re: Datatype Conversion in CQL-Client?

2012-11-19 Thread Brian O'Neill
Gotcha Timmy. That is the Thrift API. You are operating at a pretty low-level. I'm not sure that is considered the "official" CQL client. IMHO, you might be better off moving up a level. I'd probably either wait for the official CQL Java Driver, or access CQL via a higher-level client like He

Re: Datatype Conversion in CQL-Client?

2012-11-19 Thread John Sanda
You might want to take look a org.apache.cassandra.transport.SimpleClient and org.apache.cassandra.transport.messages.ResultMessage. On Mon, Nov 19, 2012 at 9:48 AM, Timmy Turner wrote: > What I meant was the method that the Cassandra-jars give you when you > include them in your project: > >

Re: Collections, query for "contains"?

2012-11-19 Thread Sylvain Lebresne
It's not supported yet, no, but we have a ticket for it: https://issues.apache.org/jira/browse/CASSANDRA-4511 On Mon, Nov 19, 2012 at 3:56 PM, Edward Capriolo wrote: > This was my first question after I git the inserts working. Hive has udfs > like array contains. It also has lateral view syntax

Datastax Java Driver

2012-11-19 Thread Sylvain Lebresne
Everyone, We've just open-sourced a new Java driver we have been working on here at DataStax. This driver is CQL3 only and is built to use the new binary protocol that will be introduced with Cassandra 1.2. It will thus only work with Cassandra 1.2 onwards. Currently, it means that testing it requ

Re: Datastax Java Driver

2012-11-19 Thread Brian O'Neill
Woohoo! Thanks for making this available. --- Brian O'Neill Lead Architect, Software Development Health Market Science The Science of Better Results 2700 Horizon Drive € King of Prussia, PA € 19406 M: 215.588.6024 € @boneill42 € healthmarketscience.com This

RE: SchemaDisagreementException

2012-11-19 Thread Everton Lima
For some reason I can not reply my old thread in that list. So I was creating a new one. The problem is that I do not use thrift to gain in performace. Why it is nicer with concurrent modifications? I do not know why I have falling in the problem of concurrent modification if I was creating 2 keys

Re: Datastax Java Driver

2012-11-19 Thread John Sanda
Fantastic! As for the object mapping API, has there been any discussion/consideration of http://www.hibernate.org/subprojects/ogm.html? On Mon, Nov 19, 2012 at 1:50 PM, Sylvain Lebresne wrote: > Everyone, > > We've just open-sourced a new Java driver we have been working on here at > DataStax. T

Re: Upgrade 1.1.2 -> 1.1.6

2012-11-19 Thread Rob Coli
On Thu, Nov 15, 2012 at 6:21 AM, Alain RODRIGUEZ wrote: > We had an issue with counters over-counting even using the nodetool drain > command before upgrading... You're sure the over-count was caused by the upgrade?Counts can be counted on (heh) to overcount. What is the scale of the over-count?

Re: Query regarding SSTable timestamps and counts

2012-11-19 Thread Rob Coli
On Sun, Nov 18, 2012 at 7:57 PM, Ananth Gundabattula wrote: > As per the above url, " After running a major compaction, automatic minor > compactions are no longer triggered, frequently requiring you to manually > run major compactions on a routine basis." ( Just before the heading Tuning > Column

Re: SchemaDisagreementException

2012-11-19 Thread Tyler Hobbs
Have you actually tested to see that the Thrift API is more performant than CQL for your application? As far as I know, CQL almost always has a performance advantage over the Thrift API. On Mon, Nov 19, 2012 at 1:05 PM, Everton Lima wrote: > For some reason I can not reply my old thread in that

Re: Datatype Conversion in CQL-Client?

2012-11-19 Thread Timmy Turner
Do these other clients use the thrift API internaly? 2012/11/19 John Sanda > You might want to take look a org.apache.cassandra.transport.SimpleClient > and org.apache.cassandra.transport.messages.ResultMessage. > > > On Mon, Nov 19, 2012 at 9:48 AM, Timmy Turner wrote: > >> What I meant was

Re: Datatype Conversion in CQL-Client?

2012-11-19 Thread Brian O'Neill
Hector does, but the newer clients/drivers no longer use Thrift. (Thrift is the legacy protocol) If you are still in early stages and you know you want your primary interface to be CQL, you may want to look at the java driver that Datastax just released. http://github.com/datastax/java-driver

Re: row cache re-fill very slow

2012-11-19 Thread aaron morton
> i was just wondering if anyone else is experiencing very slow ( ~ 3.5 MB/sec > ) re-fill of the row cache at start up. It was mentioned the other day. What version are you on ? Do you know how many rows were loaded ? When complete it will log a message with the pattern "completed loading

Re: Invalid argument

2012-11-19 Thread aaron morton
Are you running a 32 bit JVM ? What is the full JVM version ? As a work around you can try disabling memory mapped access set disk_access_mode to standard. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 20/11/2012

Re: row cache re-fill very slow

2012-11-19 Thread Wei Zhu
Last time I checked, it took about 120 seconds to load up 21125 keys with total about 500M in memory ( We have a pretty wide row:). So it's about 4 MB/sec. Just curious Andras, how can you manage such a big row cache (10-15GB currently)? By recommendation, you will have 10% of your heap as row c

Re: row cache re-fill very slow

2012-11-19 Thread Andras Szerdahelyi
Aaron, What version are you on ? 1.1.5 Do you know how many rows were loaded ? INFO [OptionalTasks:1] 2012-11-19 13:08:58,868 ColumnFamilyStore.java (line 451) completed loading (5175655 ms; 13259976 keys) row cache In both cases I do not believe the cache is stored in token (or key) order.

Re: row cache re-fill very slow

2012-11-19 Thread Andras Szerdahelyi
Wei, i'm using the off-heap ( serialised ) row cache and front the entire thing with memcached in the middle layer ( to prevent the most actively requested rows from pressuring the Cassandra heap ). If you ask how much the pointers to the off-heap memory will take.. time will tell ( it should b

Re: row cache re-fill very slow

2012-11-19 Thread Rob Coli
On Mon, Nov 19, 2012 at 6:17 AM, Andras Szerdahelyi < andras.szerdahe...@ignitionone.com> wrote: > How is the "saved row cache file" processed? Are the cached row keys > simply iterated over and their respective rows read from SSTables - > possibly creating random reads with small enough sstable f

Using Hector from .Net

2012-11-19 Thread Peter Lin
I thought others might be interested to know. If you've ever asked "what drivers are available for Cassandra in .Net and can I use Hector?" the answer is yes and it works quite well. Just use IKVM with hector peter lin

Re: Datastax Java Driver

2012-11-19 Thread michael.figui...@gmail.com
We're currently talking with some guys from Hibernate OGM team. This kind of integration is likely to happen on top of this core module. The mapping module that will add soon isn't alike OGM as it'll be a very thin mapping library, covering most use cases with low overhead. Michaël On Mon, Nov 19

Re: Datastax Java Driver

2012-11-19 Thread Timmy Turner
Why is this being released as a separate project, instead of being bundled up with Cassandra? Is it not a part of Cassandra? 2012/11/19 John Sanda > Fantastic! As for the object mapping API, has there been any > discussion/consideration of http://www.hibernate.org/subprojects/ogm.html? > > > On

Re: Datastax Java Driver

2012-11-19 Thread michael.figui...@gmail.com
The Apache Cassandra project has traditionally not focused on client side. Rather than modifying the scope of the project and jeopardizing the current driver ecosystem we've preferred to open source it this way. Not that this driver's license is Apache License 2 and it will remain so, making it eas

Re: SchemaDisagreementException

2012-11-19 Thread Everton Lima
Yes I already have tested. I use the Object CassandraServer to do the operations instead of open conection with CassandraClient. Both of this object implements Iface. I think the performace of use CassandraServer improve because it does not open an connection, and CassandraClient (that use thrift)

Re: SchemaDisagreementException

2012-11-19 Thread Michael Kjellman
While this might not be helpful (I don't have all the thread history here), have you checked that all your servers are properly synced with NTP? From: Everton Lima mailto:peitin.inu...@gmail.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apach

Re: Upgrade 1.1.2 -> 1.1.6

2012-11-19 Thread Mike Heffner
Alain, We performed a 1.1.3 -> 1.1.6 upgrade and found that all the logs replayed regardless of the drain. After noticing this on the first node, we did the following: * nodetool flush * nodetool drain * service cassandra stop * mv /path/to/logs/*.log /backup/ * apt-get install cassandra I also

Re: SchemaDisagreementException

2012-11-19 Thread Edward Capriolo
http://www.acunu.com/2/post/2011/12/cql-benchmarking.html Last I checked, thrift still had an edge over cql due to string serialization and de serialization. Might be even more dramatic for later columns. Not that client speed matters much overall in cassandra's speed, but CQL client "does more".