Why data is not even distributed.

2012-10-03 Thread Andrey Ilinykh
Hello, everybody! I'm observing very strange behavior. I have 3 node cluster with ByteOrderPartitioner. (I run 1.1.5) I created a key space with replication factor of 1. Then I created one column family and populated it with random data. I use UUID as a row key, and Integer as a column name. Row k

RE: Read latency issue

2012-10-03 Thread Arindam Barua
Hi Roshni, In the JDBC request we do use CQL3 explicitly. We also attempted to read the data via Hector with similar read times. Supplying all (or more) the components of the key reduces the response time and the number of columns in the result a lot. However, we do need back the entire row da

RE: Read latency issue

2012-10-03 Thread Roshni Rajagopal
Hi Arindam, There were some changes for CQL3 for composite keys storage , and you may be using CQL2 by default.You could try for a non composite key or supply all the components of the key in the search...and see if you get different results... Regards,roshni > From: aba...@247-inc.com > To:

Re: unsubscribe

2012-10-03 Thread Eric Evans
On Wed, Oct 3, 2012 at 1:50 PM, Tomek Kuprowski wrote: > unsubscribe http://assets0.ordienetworks.com/misc/bf7.gif -- Eric Evans Acunu | http://www.acunu.com | @acunu

1000's of CF's. PlayOrm solves the cassandra limit on #ColFamily

2012-10-03 Thread Hiller, Dean
Okay, so it only took me two solid days not a week. PlayOrm in master branch now supports virtual CF's or virtual tables in ONE CF, so you can have 1000's or millions of virtual CF's in one CF now. It works with all the Scalable-SQL, works with the joins, and works with the PlayOrm command lin

Re: Simple data model for 1 simple range query?

2012-10-03 Thread Hiller, Dean
Is timeframe/date your composite key? Where timeframe is the first time of a partition of time (ie. If you partition by month, it is the very first time of that month). If so, then, yes, it will be very fast. The smaller your partitions are, the smaller your indexes are as well(ie. B-trees whi

Simple data model for 1 simple range query?

2012-10-03 Thread T Akhayo
Good evening, I have a quite simple data model. Pseudo CQL code: create table bars( timeframe int, date Date, info1 double, info2 double, .. primary key( timeframe, date ) ) My most important query is (which might be the only one actually): select * from bars where timeframe=X and date>Y and dat

RE: Read latency issue

2012-10-03 Thread Arindam Barua
Thanks for your responses. Just to be clear our table declaration looks something like this: CREATE TABLE sessionevents ( atag text, col2 uuid, col3 text, col4 uuid, col5 text, col6 text, col7 blob, col8 text, col9 timestamp, col10 uuid, col11 int, col12 uuid, PRIMARY KE

Re: Any way to put a hard limit on memory cap for Cassandra ?

2012-10-03 Thread Jonathan Ellis
There are three places that Cassandra will use non-heap memory: One is JVM overhead like permgen. This is a normal part of running Java-based services and will be very stable and predictable. Another is the off-heap row cache. By default no row caching is done, you have to explicitly enable it

Re: Read latency issue

2012-10-03 Thread Sergey Tryuber
Hi Arindam, Just want to share our experience about latency. We use Cassandra to read all columns for a specified key (5-10 columns, 10-15 each, no composites). All our dataset fits memory (40GB on each node) and we use LeveledCompactionStrategy. Our RF=2 and we read with CL=ONE. Also we use Astya

RandomPartitioner and the token limits

2012-10-03 Thread Carlos Pérez Miguel
Hello, Reading the wiki of operations (http://wiki.apache.org/cassandra/Operations) I noticed something strange. When using RandomPartitioner, tokens are integers in the range [0,2**127] (both limits included) but keys are converted into this range using MD5. MD5 has 128 bits, so, tokens should no

Re: Running repair negatively impacts read performance?

2012-10-03 Thread Charles Brophy
I should have done more research before asking the question. I mean real research, too :) I did a before repair, after repair, and after scrub cfstat. On a hunch I also did a before/after repair but with no scrub - instead I left the cluster alone for the length of time that a scrub normally takes

Re: Persistent connection among nodes to communicate and redirect request

2012-10-03 Thread Niteesh kumar
m sry, all connection among servers on port 9160 was some another application... thanx for support regard Niteesh On Wednesday 03 October 2012 02:59 PM, mdione@orange.com wrote: De : Hiller, Dean [mailto:dean.hil...@nrel.gov] Can you just use netstat and dig into the process id and do a

RE: Persistent connection among nodes to communicate and redirect request

2012-10-03 Thread mdione.ext
De : Hiller, Dean [mailto:dean.hil...@nrel.gov] > Can you just use netstat and dig into the process id and do a ps -ef | > grep to clear up all the confusion. A better way is to use: sudo netstat --tcp --programs Optionally, add --numeric to avoid hostname and port resolution. -- Marcos Di

Re: Getting serialized Rows from CommitLogSegment file

2012-10-03 Thread Felipe Schmidt
Wow, fast! Thank you very much Aaron. And about this Schema.getTables thing is there any idea? Regards, Felipe Mathias Schmidt *(Computer Science UFRGS, RS, Brazil)* 2012/10/3 aaron morton > > Do you know where (which e-mail thread) was it discussed? I would like to > know a little furthe

Re: Getting serialized Rows from CommitLogSegment file

2012-10-03 Thread aaron morton
> > Do you know where (which e-mail thread) was it discussed? I would like to > know a little further about it. http://www.mail-archive.com/user@cassandra.apache.org/msg25033.html Cheers - Aaron Morton Freelance Developer @aaronmorton http://www.thelastpickle.com On 3/10/2012,

Re: Persistent connection among nodes to communicate and redirect request

2012-10-03 Thread aaron morton
May end up with a local 9160 connection if you are using cassandra-cli, cqlsh or hadoop on the node. > I also observed that local write latency is around 30-40 microsecond, while > its takes around .5 miliseconds if the chosen node is not the node > responsible for the key for 50K QPS What C

Re: Read latency issue

2012-10-03 Thread aaron morton
> Running a query to like “select * from where atag=”, where > ‘atag’ is the first column of the composite key, from either JDBC or Hector > (equivalent code), results in read times of 200-300ms from a remote host on > the same network. If you send a query to select columns from a row and do

Re: cql3 and cli composites columns

2012-10-03 Thread aaron morton
> If it's just a printing issue it's really minor though I would guess because timeline CF has defined columns and line CF does not. All cqlsh knows is that every column name has two parts, it does not know what the column names will be. The difference between a static and dynamic CF. Hope th

Re: Getting serialized Rows from CommitLogSegment file

2012-10-03 Thread Felipe Schmidt
Hey Ben. Yes, that's what I'm doing. Do you know where (which e-mail thread) was it discussed? I would like to know a little further about it. Anyway, if I try to use the method Schema.instance.getTables, that returns a list of all registered tables (keyspaces) into the system, it returns null. Do

Re: How to stop streaming

2012-10-03 Thread aaron morton
> Also, how do I lookup the current value of the setstreamthroughput. The default / startup value is specified in the yaml file. If you need to check the running value it is on the StorageProxy MBean, accessed using JConsole or some other JMX thingy. Cheers - Aaron Morton Fre