Re: Is there a way to get only keys with get_indexed_slices?

2011-11-11 Thread Filipe Gonçalves
You can, just set the number of columns returned to zero (count parameter in the slice range). The indexed slices thrift call is get_indexed_slices(ColumnParent column_parent, IndexClause index_clause, SlicePredicate predicate, ConsistencyLevel consistency_level) the count parameter is in th

Re: Debian package jna bug workaroung

2011-11-11 Thread Timothy Wall
Peter Tillotson yahoo.co.uk> writes: > > Full stack traces: > > java.lang.NoClassDefFoundError: > Could not initialize class com.sun.jna.Native > > >         at com.sun.jna.Pointer.(Pointer.java:42) >         at org.apache.cassandra.cache.SerializingCache.serialize(SerializingCache.java:92)

Re: is that possible to add more data structure(key-list) in cassandra?

2011-11-11 Thread Yan Chunlu
I thought currently no one is maintaining supercolumns related code, and also it not quite efficient. On Fri, Nov 11, 2011 at 2:46 PM, Radim Kolar wrote: > Dne 11.11.2011 5:58, Yan Chunlu napsal(a): > > I think cassandra is doing great job on key-value data store, it saved me >> tremendous wo

Re: is that possible to add more data structure(key-list) in cassandra?

2011-11-11 Thread Filipe Gonçalves
You could use composite columns.For example, key: composite(listname:listindex) : value A simple get_range would give you access to list as you would normally have in any programming language, and a "get" could give you direct access to any index. Obviously, this would not be a good fit for

Client Timeouts on incrementing counters

2011-11-11 Thread Carlos Rolo
Hi, I was having lots of problems with cassandra 0.8.x running OOM. After moving to Cassandra 1.0.x OOM just disappeared, but now, my python client is having troubles incrementing counters. 2/3 of the times it tries to increment a counter it get a Timeout exception. Also incrementing on CLI I

Re: range slice with TimeUUID column names

2011-11-11 Thread footh
To make sure I had an all-encompassing date range, I tried this: UUID startId = new UUID(UUIDGen.createTime(0L), UUIDGen.getClockSeqAndNode()); UUID finishId = new UUID(UUIDGen.createTime(Long.MAX_VALUE), UUIDGen.getClockSeqAndNode()); For the range slice parameters, and I am still not getting a

Re: Efficient map reduce over ranges of Cassandra data

2011-11-11 Thread Jeremy Hanna
Nice! Thanks Ed. On Nov 10, 2011, at 11:20 PM, Edward Capriolo wrote: > Hey all, > > I know there are several tickets in the pipe that should make it possible do > use secondary indexes to run map reduce jobs that do not have to ingest the > entire dataset such as: > > https://issues.apache.

Re: Apache Cassandra Hangout in Mumbai-Pune area (India)

2011-11-11 Thread Adi
Hey GeekTalks/any other cassandra users around Mumbai/Pune, I will be around Mumbai from last week of Nov through Third week of December. I have actively used/deployed a couple of cassandra clusters and a bunch of hadoop projects over the past year. I am keenly interested in meeting any cassandra/

multi datacenter cluster, without fibre speeds

2011-11-11 Thread M Vieira
Has anyone experimented running cassandra clusters in geographicly separated locations connected thru ordinary broadband? By ordinary broadband I mean 30Mbps or 50Mbps Thanks Marco

Re: Second Cassandra users survey

2011-11-11 Thread Aaron Turner
Lately I've been working on some data processing code in Cassandra and apparently I don't write bug-free code the very first time. :) Hence, while debugging, I often need to look at data in Cassandra to see what my code is doing/should be finding, etc. This turns out to be harder then it should be

Store profile pics of users in Cassandra or file system ?

2011-11-11 Thread Aditya Narayan
Would it be recommended to store the profile pics of users on an application in Cassandra ? Or file system would be a better way to go. I came across an interesting paper which advocates storing in DB for blobs sized up to 1 MB. I was planning to store the image bytes in the same row that contained

Re: Store profile pics of users in Cassandra or file system ?

2011-11-11 Thread Aditya Narayan
just forgot to add the paper link if this is useful at all : To BLOB or Not To BLOB: Large Object Storage in a Database or a Filesystem On Sat, Nov 12, 2011 at 12:34 AM, Aditya Narayan wrote: > Would it be recommended to store the pr

Re: multi datacenter cluster, without fibre speeds

2011-11-11 Thread Radim Kolar
Dne 11.11.2011 19:14, M Vieira napsal(a): Has anyone experimented running cassandra clusters in geographicly separated locations connected thru ordinary broadband? By ordinary broadband I mean 30Mbps or 50Mbps for backup purposes, like place 1 replica on remote location over WAN?? yes. it works

Re: Client Timeouts on incrementing counters

2011-11-11 Thread Tyler Hobbs
On Fri, Nov 11, 2011 at 7:17 AM, Carlos Rolo wrote: > Also Cassandra logs have lots (as in, several times per second) of this > message now: > > ** ** > > INFO 14:15:25,740 Saved ClusterCassandra-CounterFamily-RowCache (52 items) > in 1 ms > > > What does the CLI say the row_cache_save_

Re: Store profile pics of users in Cassandra or file system ?

2011-11-11 Thread Radim Kolar
We store multi gigabyte sized files in cassandra chunked to 10 MB parts. Storage is cheap today and replication will make your reads faster.

Re: changing ownership when nodes have same token

2011-11-11 Thread hani elabed
I have had this happen to me when I copied the installation from one node to the other( 2 Virtual machines with same configuration). I had already started the first node with the default setup, i.e a single node without providing a seed IP address. See step 3 in here http://wiki.apache.org/cassan

Re: Restoring 0.8 ring in new cluster says nodes have the same token

2011-11-11 Thread blafrisch
I figured it out. If I started node 2 first and used that as the seed for the rest then the ring formed correctly. I guess the ring information stored in the LocationInfo CF of node 1 was wrong. For future information, when does this information get updated? All of the nodes have different ring

questions on frequency and timing of async replication between DCs

2011-11-11 Thread Subrahmanya Harve
I have cross dc replication set up using 0.8.7 with 3 nodes on each DC by following the +1 rule for tokens. I am seeing an issue where the insert into a DC happened successfully but on querying from cli or through Hector, i am not seeing the data being returned. i used cli on every node of both DCs

Re: questions on frequency and timing of async replication between DCs

2011-11-11 Thread Jeremiah Jordan
If you query with ALL do you get the data? If you query with a range slice do you get the data (list from the cli)? On 11/11/2011 04:10 PM, Subrahmanya Harve wrote: I have cross dc replication set up using 0.8.7 with 3 nodes on each DC by following the +1 rule for tokens. I am seeing an issu

Re: questions on frequency and timing of async replication between DCs

2011-11-11 Thread Subrahmanya Harve
If i query by list command, i do see the data i am looking for. Interestingly, if do list on each of the nodes using cli, i get back a different number of rows! i did not want to query by ALL because of 2 reasons - 1. It might take too long to return. 2 it might not return me anything just because

Re: Second Cassandra users survey

2011-11-11 Thread Aaron Turner
Oh, and one more thing: If you're doing a select and you get no results, then an indication of no columns or no rows matching would be nice. Kinda painful when you're typing in long strings and get no result, wonder why, only to find out you fat fingered your row key. :( On Fri, Nov 11, 2011 at

After deleting some data from the cluster under Solandra, we keep seeing this assertion.

2011-11-11 Thread Jacob, Arun
After removing some data from Solandra via a Solr query, we are getting DecoratedKey assertions. Our setup: latest version of Solandra (I think it supports 0.8.6, please correct if wrong) 3 solandra nodes, with replication set to 2 and sharding set to 3. No systems are currently running (ingest

Help with Cassandra Row Caches

2011-11-11 Thread Oleg Tsvinev
Hi everybody, We set row cache too high, 1 or so and now all our 6 nodes fail with OOM. I believe that high row cache causes OOMs. Now, we trying to change row cache sizes using cassandra-cli update column family command but Cassandra nodes keep dying. Any help is appreciated, it's a product

Re: Help with Cassandra Row Caches

2011-11-11 Thread Mohit Anchlia
Can you temporarily increase the size of Heap and try? On Fri, Nov 11, 2011 at 5:21 PM, Oleg Tsvinev wrote: > Hi everybody, > > We set row cache too high, 1 or so and now all our 6 nodes fail > with OOM. I believe that high row cache causes OOMs. > > Now, we trying to change row cache sizes u

Re: Second Cassandra users survey

2011-11-11 Thread Edward Capriolo
It seems like you could use a composite key partioner to accomplish this On Monday, November 7, 2011, Daniel Doubleday wrote: > Allow for deterministic / manual sharding of rows. > > Right now it seems that there is no way to force rows with different row keys will be stored on the same nodes in

Re: Help with Cassandra Row Caches

2011-11-11 Thread Tyler Hobbs
You could also delete any saved caches, which might give you some more breathing room. On Fri, Nov 11, 2011 at 7:53 PM, Mohit Anchlia wrote: > Can you temporarily increase the size of Heap and try? > > On Fri, Nov 11, 2011 at 5:21 PM, Oleg Tsvinev > wrote: > > Hi everybody, > > > > We set row ca

secondary indexes streaming building - when there are none

2011-11-11 Thread Jeremy Hanna
We're using 0.8.4 in our cluster and two nodes needed rebuilding. When building and streaming data to the nodes, there were multiple instances of building secondary indexes. We haven't had secondary indexes in that keyspace since like mid-August. Is that a bug? Thanks, Jeremy