org.apache.cassandra.io.sstable.CorruptSSTableException: java.io.IOException: Corrupt (negative) value length encountered

2014-02-28 Thread Shammi Jayasinghe
Hi , We are using apache cassandra 1.2.13 version with three nodes. In that with a high load we are getting following exception.[1] . Could some one help on this. This is already reported in [2] [1] INFO [ScheduledTasks:1] 2014-02-27 21:56:59,928 GCInspector.java (line 119) GC for ParNew: 241 m

CorruptSSTableException in system_auth keyspace

2014-02-28 Thread Ondřej Černoš
Hello, we are trying to add authentication to our Cassandra cluster. We add our authenticated users during puppet deployment using the default user, which is then disabled. We have the following issues: - we see CorruptSSTableException in system_auth.users table - we are not able to add users af

Re: CorruptSSTableException in system_auth keyspace

2014-02-28 Thread Ondřej Černoš
Sorry, I sent the mail too early. This is the stack trace: 2014-02-28 10:56:03.205+0100 [SSTableBatchOpen:1] [ERROR] DebuggableThrea dPoolExecutor.java(218) org.apache.cassandra.concurrent.DebuggableThr eadPoolExecutor: Error in ThreadPoolExecutor org.apache.cassandra.io.sstable.CorruptSSTableExc

Re: CQL: Any way to have inequalities on multiple clustering columns in a WHERE clause?

2014-02-28 Thread Clint Kelly
Yes, thank you! On Thu, Feb 27, 2014 at 10:26 PM, DuyHai Doan wrote: > Clint, what you want is this : > https://issues.apache.org/jira/browse/CASSANDRA-4851 > > select * from foo where key=something and fam = 'Info' and (qual,version) > > ('A',2013) and qual < 'D' ALLOW FILTERING >

Compaction does not remove tombstones if column has higher TTL

2014-02-28 Thread Keith Wright
FYI – I recently filed https://issues.apache.org/jira/browse/CASSANDRA-6654 and wanted to let everyone know the result as it was not what I expected. I am using C* 1.2.12 and found that my droppable tombstone ratio kept increasing on an LCS table (currently > .3). Documentation states that co

Query on blob col using CQL3

2014-02-28 Thread Senthil, Athinanthny X. -ND
Anyone can suggest how to query on blob column via CQL3. I get bad request error saying cannot parse data. I want to lookup on key column which is defined as blob. But I am able to lookup data via opscenter data explorer. Is there a conversion functions I need to use? Sent from my Galaxy

Re: Query on blob col using CQL3

2014-02-28 Thread Mikhail Stepura
Did you try http://cassandra.apache.org/doc/cql3/CQL.html#blobFun ? On 2/28/14, 9:14, Senthil, Athinanthny X. -ND wrote: Anyone can suggest how to query on blob column via CQL3. I get bad request error saying cannot parse data. I want to lookup on key column which is defined as blob. But I am

Re: Query on blob col using CQL3

2014-02-28 Thread Peter Lin
why are you trying to view a blob with CQL3? and what kind of blob is it? if the blob is an object, there's no way to view that in CQL3. You'd need to do extra work like user defined types, but I don't know of anyone that's actually using that. On Fri, Feb 28, 2014 at 12:14 PM, Senthil, Athinant

Re: Getting the most-recent version from time-series data

2014-02-28 Thread Clint Kelly
Hi Tupshin, Thanks for your help once again, I really appreciate it. Quick question regarding the issue of token-aware routing, etc. Let's say that I am using the table described earlier: CREATE TABLE time_series_stuff ( key text, family text, version int, val text, PRIMARY KEY (key,

Re: Combine multiple SELECT statements into one RPC?

2014-02-28 Thread Clint Kelly
Hi Sylvain, Thanks for your response. I am writing code to allow users to query a table that looks something like this: CREATE TABLE time_series_stuff ( key text, family text, qualifier text, version long, val blob, PRIMARY KEY (key, family, qualifier, version) ) WITH CLUSTERING ORDE

Re: Getting the most-recent version from time-series data

2014-02-28 Thread Clint Kelly
Hi Tupshin, BTW, you asked earlier about the number of different distinct "family" values. There could easily be millions of different families, each with many different values. Right now I see two options: 1. Query the table once just to get all of the distinct families, then do separate

Caching prepared queries and different consistency levels

2014-02-28 Thread Wayne Schroeder
After upgrading to the 2.0 driver branch, I received a lot of warnings about re-preparing previously prepared statements. I read about this issue, and my work around was to cache my prepared statements in a Map internally in my app via a common prepare method, where the string key was the CQL q

Any way to get a list of per-node token ranges using the DataStax Java driver?

2014-02-28 Thread Clint Kelly
Hi everyone, I've been working on a rewrite of the Cassandra InputFormat for Hadoop 2 using the DataStax Java driver instead of the Thrift API. I have a prototype working now, but there is one bit of code that I have not been able to replace with code for the Java driver. In the InputFormat#getS

Re: Caching prepared queries and different consistency levels

2014-02-28 Thread Wayne Schroeder
Well, it may seem like I'm talking to myself now with this response, but I cracked open the source and found the answer in fairly short order so I figured I would share what I found. Datastax folks, please do verify that I'm correct if you don't mind. Long story short, BoundStatement initializ

Re:

2014-02-28 Thread Tyler Hobbs
Can you clarify exactly what you need help with? It seems like you already know how to fetch the timestamps. Are you just looking for python code to filter data that's not in a time range? By the way, there's a pycassa-specific mailing list here: https://groups.google.com/forum/#!forum/pycassa-d

Re:

2014-02-28 Thread Kumar Ranjan
Yes, filter out based on time range. Currently i do this in python . Just curious to see if this can be done using pycassa somehow?— Sent from Mailbox for iPhone On Fri, Feb 28, 2014 at 2:13 PM, Tyler Hobbs wrote: > Can you clarify exactly what you need help with? It seems like you already > k

Re:

2014-02-28 Thread Tyler Hobbs
No, pycassa won't do anything fancy with timestamps automatically, you'll have to keep doing yourself. On Fri, Feb 28, 2014 at 1:28 PM, Kumar Ranjan wrote: > Yes, filter out based on time range. Currently i do this in python . Just > curious to see if this can be done using pycassa somehow? > -

Re:

2014-02-28 Thread Kumar Ranjan
Thanks Tyler. Yes, I scanned through pycassaShell code couple of times but did not find anything like that. On Fri, Feb 28, 2014 at 3:24 PM, Tyler Hobbs wrote: > No, pycassa won't do anything fancy with timestamps automatically, you'll > have to keep doing yourself. > > > On Fri, Feb 28, 2014 a

Resetting a counter in CQL

2014-02-28 Thread Clint Kelly
Folks, What is the best known method for resetting a counter in CQL? Is it best to read the counter and then increment it by a negative amount? Or to delete the row and then increment it by zero? These are the two methods I could come up with. Both of these seem fine to me---I'm just wondering

Re: Resetting a counter in CQL

2014-02-28 Thread Tyler Hobbs
On Fri, Feb 28, 2014 at 6:32 PM, Clint Kelly wrote: > > > What is the best known method for resetting a counter in CQL? Is it best > to read the counter and then increment it by a negative amount? Do this. > Or to delete the row and then increment it by zero? > Don't do this. When you de

Re: Resetting a counter in CQL

2014-02-28 Thread Clint Kelly
Great, thanks! On Fri, Feb 28, 2014 at 4:38 PM, Tyler Hobbs wrote: > > On Fri, Feb 28, 2014 at 6:32 PM, Clint Kelly wrote: > >> >> >> What is the best known method for resetting a counter in CQL? Is it best >> to read the counter and then increment it by a negative amount? > > > Do this. > > >

Re: Any way to get a list of per-node token ranges using the DataStax Java driver?

2014-02-28 Thread Tupshin Harper
For the first question, try "select * from system.peers" http://www.datastax.com/documentation/cql/cql_using/use_query_system_c.html?pagename=docs&version=1.2&file=cql_cli/using/query_system_tables For the second, there is a JMX and nodetool command, but I'm not aware of any way to get it directl

Re: Getting the most-recent version from time-series data

2014-02-28 Thread Tupshin Harper
You are correct that with that schema, all data for a give key would be in a single partition, and hence on the same node(s). I missed that before. -Tupshin On Fri, Feb 28, 2014 at 12:47 PM, Clint Kelly wrote: > Hi Tupshin, > > Thanks for your help once again, I really appreciate it. Quick