[no subject]

2013-10-06 Thread Ran Tavory
Hi, I have a small cluster of 1.2.6 and after some config changes I started seeing errors int the logs. Not sure that's related, but the changes I performed were to disable hinted handoff and disable auto snapshot. I'll try to reverte these, see if the picture changes. But anyway, that seems like

AssertionError: DecoratedKey(... ) != DecoratedKey (...)

2013-10-06 Thread Ran Tavory
Pardon me, now with the appropriate subject line... Hi, I have a small cluster of 1.2.6 and after some config changes I started seeing errors int the logs. Not sure that's related, but the changes I performed were to disable hinted handoff and disable auto snapshot. I'll try to reverte these, see

com.datastax.driver.core.exceptions.WriteTimeoutException: Cassandra timeout during write query

2013-10-06 Thread Ran Tavory
Hi all, when using the java-driver I see this error on the client, for reads (as well as for writes). Many of the ops succeed, however I do see a significant amount of errors. com.datastax.driver.core.exceptions.WriteTimeoutException: Cassandra timeout during write query at consistency ONE (1 repl

Re: AssertionError: DecoratedKey(... ) != DecoratedKey (...)

2013-10-06 Thread Ran Tavory
Update: I've reverted hinted_handoff_enabled back to its default value of true and the errors stopped. Is this just a coincidence, or could be related? On Sun, Oct 6, 2013 at 7:23 PM, Ran Tavory wrote: > Pardon me, now with the appropriate subject line... > > Hi, I have a sm

Re: Cassandra Agent

2013-10-10 Thread Ran Tavory
Seems like the greeks are all used out, how about moving the the japanese mythology? it's a brand new pool of names... http://en.wikipedia.org/wiki/Japanese_mythology On Fri, Oct 11, 2013 at 8:29 AM, Blair Zajac wrote: > On 10/10/2013 10:28 PM, Blair Zajac wrote: > >> On 10/10/2013 08:53 PM, S

Re: Cassandra Agent

2013-10-11 Thread Ran Tavory
--DRS > >> On Oct 11, 2013, at 6:55 AM, Edward Capriolo wrote: >> >> Stick sandra on the end. Restsandra. >> >>> On Friday, October 11, 2013, Ran Tavory wrote: >>> Seems like the greeks are all used out, how about moving the the japanes

Re: understanding the cassandra storage scaling

2010-12-09 Thread Ran Tavory
there are two numbers to look at, N the numbers of hosts in the ring (cluster) and R the number of replicas for each data item. R is configurable per column family. Typically for large clusters N >> R. For very small clusters if makes sense for R to be close to N in which case cassandra is useful s

Re: understanding the cassandra storage scaling

2010-12-09 Thread Ran Tavory
> On Thu, Dec 9, 2010 at 11:35 AM, Ran Tavory wrote: > > there are two numbers to look at, N the numbers of hosts in the ring > > (cluster) and R the number of replicas for each data item. R is > configurable > > per column family. > > Typically for large clusters N >

Re: iterate over all the rows with RP

2010-12-12 Thread Ran Tavory
This should be the case, yes, semantics isn't affected by the connection and state isn't kept. What might happen if you read/write with low consistency levels then when you hit a different host on the ring it might have an inconsistent state in case of partition. On Sunday, December 12, 2010, shim

Re: Cassandra Monitoring

2010-12-19 Thread Ran Tavory
FYI, I just added an mx4j section to the bottom of this page http://wiki.apache.org/cassandra/Operations On Sun, Dec 19, 2010 at 4:30 PM, Jonathan Ellis wrote: > mx4j? https://issues.apache.org/jira/browse/CASSANDRA-1068 > > > On Sun, Dec 19, 2010 at 8:36 AM, Peter Schuller < > peter.schul...@i

Re: Cassandra Monitoring

2010-12-19 Thread Ran Tavory
are with the earlier jmx-to-rest bridge listed in the > operations page: > "JMX-to-REST bridge available > at http://code.google.com/p/polarrose-jmx-rest-bridge"; > > ThanksDave Viner > > > On Sun, Dec 19, 2010 at 7:01 AM, Ran Tavory wrote: > FYI, I just add

Re: Retrieving TimeUUID sub-columns in reverse order

2010-12-26 Thread Ran Tavory
Roshan, in cassandra.yaml did you define compare_subcolumns_with for the SCF? The subcolumn names are the timeuuid, not the subcolumn values, right? On Sun, Dec 26, 2010 at 9:14 AM, Roshan Dawrani wrote: > Hi, > > I have a super ColumnFamily that has a few super columns, with each having > a gro

Re: The mutations to delete sub-columns can't be batched?

2010-12-27 Thread Ran Tavory
an oversight as well. as you can imagine, non of us is a fan of super columns... On Tue, Dec 28, 2010 at 6:28 AM, Roshan Dawrani wrote: > Hi, > > I think I may have found another oversight on the Hector API. > > OrderedRows has a convenient peekLast() method, but it is missing in > OrderedSuperRo

Re: Hector version

2010-12-30 Thread Ran Tavory
Use 0.6.0-19 On Friday, December 31, 2010, Zhidong She wrote: > Hi guys, > > We are trying Cassandra 0.6.8, and could you please kindly tell me which > Hector Java client is suitable for 0.6.8? > The Hector 0.7.0 says it's for Cassandra 0.7.X, and shall we use Hector 0.6.0? > > Thanks, > Br > Zh

Bootstrapping taking long

2011-01-04 Thread Ran Tavory
I asked the same question on the IRC but no luck there, everyone's asleep ;)... Using 0.6.6 I'm adding a new node to the cluster. It starts out fine but then gets stuck on the bootstrapping state for too long. More than an hour and still counting. $ bin/nodetool -p 9004 -h localhost streams > Mod

Re: Bootstrapping taking long

2011-01-04 Thread Ran Tavory
the other nodes logs to see if there is any node doing > anticompaction. > I don't remember how much data I had in the cluster when I needed to > add/remove nodes. I do remember that it took a few hours. > > The node will join the ring only when it will finish the bootstrap. > &g

Re: Bootstrapping taking long

2011-01-04 Thread Ran Tavory
ter it had finished moving the data nothing happened for a long time (I'm still waiting, but nothing seems to be happening). Any hints how to analyze a "stuck" bootstrapping node?? thanks On Tue, Jan 4, 2011 at 1:51 PM, Ran Tavory wrote: > Thanks Shimi, so indeed anticompaction

Re: Bootstrapping taking long

2011-01-04 Thread Ran Tavory
to monitor the anti-compaction > progress (it puts new SSTables for bootstrapping node in there) > > > On Tue, Jan 4, 2011 at 8:01 AM, Ran Tavory wrote: > >> Running nodetool decommission didn't help. Actually the node refused to >> decommission itself (b/c it wasn't pa

Re: Bootstrapping taking long

2011-01-04 Thread Ran Tavory
now if you think there is. Or any other advice... On Tuesday, January 4, 2011, Ran Tavory wrote: > Thanks Jake, but unfortunately the streams directory is empty so I don't > think that any of the nodes is anti-compacting data right now or had been in > the past 5 hours. It seems th

Re: Bootstrapping taking long

2011-01-04 Thread Ran Tavory
lhost streams > Mode: Bootstrapping > Not sending any streams. > Not receiving any streams. On Wed, Jan 5, 2011 at 1:20 AM, Nate McCall wrote: > Does the new node have itself in the list of seeds per chance? This > could cause some issues if so. > > On Tue, Jan 4, 2011 at

Re: Bootstrapping taking long

2011-01-04 Thread Ran Tavory
but if I do it in two phases it did work. So it's either my misunderstanding or a bug... On Wed, Jan 5, 2011 at 7:14 AM, Ran Tavory wrote: > The new node does not see itself as part of the ring, it sees all others > but itself, so from that perspective the view is consistent. > The

Re: Bootstrapping taking long

2011-01-05 Thread Ran Tavory
in their list of seeds - always did - and > everything works. (You may ask why I did this. I don't know, I must have > copied it from an example somewhere.) > > On Wed, Jan 5, 2011 at 9:42 AM, Ran Tavory wrote: > >> I was able to make the node join the ring but I'm

Re: Bootstrapping taking long

2011-01-05 Thread Ran Tavory
I haven't tried repair. Should I? On Jan 5, 2011 3:48 PM, "Jake Luciani" wrote: > Have you tried not bootstrapping but setting the token and manually calling > repair? > > On Wed, Jan 5, 2011 at 7:07 AM, Ran Tavory wrote: > >> My conclusion is lame: I tri

Re: Bootstrapping taking long

2011-01-05 Thread Ran Tavory
me, seed list should be the same >> across the cluster. >> I'm just thinking of other things to try, non-boostrapped nodes should >> join the ring instantly but reads will fail if you aren't using quorum. >> >> >> On Wed, Jan 5, 2011 at 8:51 AM, Ran Tavory w

Re: Bootstrapping taking long

2011-01-05 Thread Ran Tavory
ve to use at least 0.6.7 > > > > On Wed, Jan 5, 2011 at 4:19 PM, Edward Capriolo wrote: > >> On Wed, Jan 5, 2011 at 10:05 AM, Ran Tavory wrote: >> > In storage-conf I see this comment [1] from which I understand that the >> > recommended way to bootstrap a new node is

Re: Bootstrapping taking long

2011-01-05 Thread Ran Tavory
e problem (Don't know > if you have to redeploy your cluster though) > > http://www.mail-archive.com/user@cassandra.apache.org/msg07106.html > > > > On Wed, Jan 5, 2011 at 4:29 PM, Ran Tavory wrote: > >> @Thibaut wrong email? Or how's "Avoid dropping messages of

Re: Bootstrapping taking long

2011-01-05 Thread Ran Tavory
nough that they were dropping messages > older than RPC_TIMEOUT to cope, it could lose part of the bootstrap > communication permanently. > > On Wed, Jan 5, 2011 at 10:01 AM, Ran Tavory wrote: >> OK, thanks, so I see we had the same problem (I too had multiple keyspace, >> not t

Re: maven cassandra plugin

2011-01-06 Thread Ran Tavory
Stephen, just FYI cassandra cannot be stopped cleanly. It's jvm must be taken down. So the plugin would need to probably fork a jvm and kill it when it's done. On Thursday, January 6, 2011, B. Todd Burruss wrote: > > > > > > > would u like some testers?  we were about to write one. > > On

Re: Do you have a site in production environment with Cassandra? What client do you use?

2011-01-14 Thread Ran Tavory
I use Hector, if that counts. .. On Jan 14, 2011 7:25 PM, "Ertio Lew" wrote: > Hey, > > If you have a site in production environment or considering so, what > is the client that you use to interact with Cassandra. I know that > there are several clients available out there according to the > lang

Re: Do you have a site in production environment with Cassandra? What client do you use?

2011-01-14 Thread Ran Tavory
Java On Jan 14, 2011 8:25 PM, "Ertio Lew" wrote: > what is the technology stack do you use? > > On 1/14/11, Ran Tavory wrote: >> I use Hector, if that counts. .. >> On Jan 14, 2011 7:25 PM, "Ertio Lew" wrote: >>> Hey, >>> >>>

Re: 0.7.0 mx4j, get attribute

2011-02-03 Thread Ran Tavory
Try adding this to the end of the URL: ?template=identity On Thu, Feb 3, 2011 at 4:23 PM, Chris Burroughs wrote: > On 02/02/2011 01:41 PM, Ryan King wrote: > > On Wed, Feb 2, 2011 at 10:40 AM, Chris Burroughs > > wrote: > >> I'm using 0.7.0 and experimenting with the new mx4j support. > >> > >>

performance tuning - where does the slowness come from?

2010-05-03 Thread Ran Tavory
I'm looking into performance issues on a 0.6.1 cluster. I see two symptoms: 1. Reads and writes are slow 2. One of the hosts is doing a lot of GC. 1 is slow in the sense that in normal state the cluster used to make around 3-5k read and writes per second (6-10k operations per second), but how it's

Re: performance tuning - where does the slowness come from?

2010-05-04 Thread Ran Tavory
lance the load evenly between nodes in cluster just manually >> fix >> > their token.(the "formula" is i * 2^127 / nb_nodes). >> > >> > Jordzn >> > >> > On Tue, May 4, 2010 at 8:20 AM, Ran Tavory wrote: >> >> >> >> I&#

Re: performance tuning - where does the slowness come from?

2010-05-04 Thread Ran Tavory
It won't make you swap more > than using buffered i/o. > > On Tue, May 4, 2010 at 1:57 PM, Ran Tavory wrote: > > I canceled mmap and indeed memory usage is sane again. So far performance > > hasn't been great, but I'll wait and see. > > I'm also intere

Re: cassandra jvm crash in GCTaskThread

2010-05-05 Thread Ran Tavory
://java.sun.com/javase/6/webnotes/6u18.html > > Note that there is an apparent work-around jvm option to add: > -XX:-ReduceInitialCardMarks > > -Nate > > > On Tue, May 4, 2010 at 11:24 PM, Ran Tavory wrote: > > Running a cluster of 0.6.1, one of the hosts crashed duri

Re: cassandra jvm crash in GCTaskThread

2010-05-05 Thread Ran Tavory
> > http://java.sun.com/javase/6/webnotes/6u18.html > > Note that there is an apparent work-around jvm option to add: > -XX:-ReduceInitialCardMarks > > -Nate > > > On Tue, May 4, 2010 at 11:24 PM, Ran Tavory wrote: > > Running a cluster of 0.6.1, one of the host

Re: performance tuning - where does the slowness come from?

2010-05-05 Thread Ran Tavory
I haven't tried it. On May 6, 2010 1:22 AM, "Mark Greene" wrote: Ran, Did you find differing results from stress.py? -Mark On Wed, May 5, 2010 at 5:59 PM, Ran Tavory wrote: > > let's see if I can make s...

Re: performance tuning - where does the slowness come from?

2010-05-06 Thread Ran Tavory
't fancy, not sure exactly what hardware we have there but it's "commodity"... Thanks! On Thu, May 6, 2010 at 5:22 PM, Jonathan Ellis wrote: > columns, not CFs. > > put another way, how wide are the rows in the slow CF? > > On Wed, May 5, 2010 at 11:30 PM, Ran

Re: Tuning Cassandra

2010-05-10 Thread Ran Tavory
Hector uses tsocket. not sure what you mean by "buffered" - is that framed? Hector by default does not use framed. The code is here if you'd like to have a look http://github.com/rantav/hector/blob/master/src/main/java/me/prettyprint/cassandra/service/CassandraClientFactory.java#L77

Re: what/how do you guys monitor "slow" nodes?

2010-05-12 Thread Ran Tavory
There is a per cf read and write latency jmx. On May 12, 2010 12:55 AM, "Jordan Pittier - Rezel" wrote: For sure you have to pay particular attention to memory allocation on each node, especially be sure your servers dont swap. Then you can monitor how load are balanced among your nodes (nodetoo

Re: JMX metrics for monitoring

2010-05-17 Thread Ran Tavory
There are many, but here's what I found useful so far: Per CF you have: - Recent read/write latency - PendingTasks - Read/Write count Globally you have, for each of the stages (e.g. org.apache.cassandra.concurrent:type=ROW-READ-STAGE): - PendingTasks - ActiveCount ... and as you go you'll find mo

mapreduce from cassandra to cassandra

2010-05-18 Thread Ran Tavory
In the wordcount example the process reads from cassandra and the result is written to a local file at /tmp/word_count* Is it possible to read from cassandra and write the result back to cassandra to a specified cf/row/column? I see that there exists a ColumnFamilyInputFormat but not ColumnFamilyO

Re: mapreduce from cassandra to cassandra

2010-05-18 Thread Ran Tavory
hbase - yes. But is that reusable for cassandra? On Tue, May 18, 2010 at 12:17 PM, Jeff Zhang wrote: > I believe it is possible to write result back to cassandra. If I > remember correctly, HBase has both InputFormat and OutputFormat for > hadoop. > > > > > On Tue, May

decommission and org.apache.thrift.TApplicationException: get_slice failed: unknown result

2010-05-18 Thread Ran Tavory
What's the correct way to remove a node from a cluster? According to this page http://wiki.apache.org/cassandra/Operations a decommission call should be enough. When decommissioning one of the nodes from my cluster I see an error in the client: org.apache.thrift.TApplicationException: get_slice f

ConcurrentModificationException in gossiper while decommissioning another node

2010-05-18 Thread Ran Tavory
While the node 192.168.252.61 was in the process of decommissioning I see this error in two other nodes: INFO [Timer-1] 2010-05-18 06:01:12,048 Gossiper.java (line 179) InetAddress /192.168.252.62 is now dead. INFO [GMFD:1] 2010-05-18 06:04:00,189 Gossiper.java (line 568) InetAddress /192.168.25

Re: is it possible to trace/debug cassandra?

2010-05-18 Thread Ran Tavory
Add to cassandra.in.sh -Xdebug -Xrunjdwp:transport=dt_socket,address=8000,server=y,suspend=n to the JVM_OPTS section. Then connect with jdb ( http://java.sun.com/j2se/1.3/docs/tooldocs/solaris/jdb.html) or your IDE as a remote process On Tue, May 18, 2010 at 1:18 PM, S Ahmed wrote: > Would it b

Re: Disk usage doubled after nodetool decommission and node still in ring

2010-05-19 Thread Ran Tavory
Run nodetool streams. On May 18, 2010 4:14 PM, "Maxim Kramarenko" wrote: Hi! After nodetool decomission data size on all nodes grow twice, node still up and in ring, and no streaming now / tmp SSTables now. BTW, I have ssh connection to server, so after run nodetool decommission I expect, that

Re: ConcurrentModificationException in gossiper while decommissioning another node

2010-05-19 Thread Ran Tavory
dmeijer > > > On Tue, May 18, 2010 at 1:55 PM, Ran Tavory wrote: > >> While the node 192.168.252.61 was in the process of decommissioning I see >> this error in two other nodes: >> >> INFO [Timer-1] 2010-05-18 06:01:12,048 Gossiper.java (line 179) >> InetA

Re: decommission and org.apache.thrift.TApplicationException: get_slice failed: unknown result

2010-05-19 Thread Ran Tavory
t something at the transport layer had > trouble. Possibly some kind of thrift hiccup that we haven't seen > before. > > Your description makes it sound as if the decommission is proceeding > normally though. > > Gary. > > On Tue, May 18, 2010 at 04:42, Ran Tavory wrote

how to decommission two slow nodes?

2010-05-19 Thread Ran Tavory
In my cluster setup I have two datacenters with 5 hosts in one DC and 3 in the other. In the 5 hosts DC I'd like to remove two hosts so I'd get 3 and 3 in each. The two nodes I'd like to decommission have less RAM than the other 3 so they operate slower. What's the most effective way to decommissio

Re: how to decommission two slow nodes?

2010-05-20 Thread Ran Tavory
. On Wed, May 19, 2010 at 11:57 PM, Ran Tavory wrote: > In my cluster setup I have... -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com

Re: how to decommission two slow nodes?

2010-05-21 Thread Ran Tavory
stead, kill the 2 nodes you want to remove, and then do removetoken, which would still do extra i/o but at least the slow nodes would not be involved. On Thu, May 20, 2010 at 8:54 PM, Ran Tavory wrote: > I forgot to mention that th... -- Jonathan Ellis Project Chair, Apache Cassandra

Re: oom in ROW-MUTATION-STAGE

2010-05-22 Thread Ran Tavory
HH rows can get large) > > On Fri, May 21, 2010 at 2:36 PM, Ran Tavory wrote: > > I see some OOM on one of the hosts in the cluster and I wonder if there's > a > > formula that'll help me calculate what's the required memory setting > given > > the paramete

Re: oom in ROW-MUTATION-STAGE

2010-05-23 Thread Ran Tavory
28 AE-SERVICE-STAGE 1 1 8 HINTED-HANDOFF-POOL 0 0 10 On Sat, May 22, 2010 at 11:05 PM, Ran Tavory wrote: > The message deserializer has 10m pending tasks before the oom. What do you > think makes the message deseriali

Re: oom in ROW-MUTATION-STAGE

2010-05-23 Thread Ran Tavory
: > looks like reads are backing up, which in turn is making deserialize back > up > > On Sun, May 23, 2010 at 4:25 AM, Ran Tavory wrote: > > Here's tpstats on a server with traffic that I think will get OOM > shortly. > > We have 4k pending reads and 123k pend

Re: oom in ROW-MUTATION-STAGE

2010-05-23 Thread Ran Tavory
at 8:15 PM, Jonathan Ellis wrote: > On Sun, May 23, 2010 at 10:59 AM, Ran Tavory wrote: > > Is there another solution except adding capacity? > > Either you need to get more performance/node or increase node count. :) > > > How does the ConcurrentReads (default 8) affect

Key cache capacity: 1 when using KeysCached="50%"

2010-05-23 Thread Ran Tavory
I've noticed that when defining KeysCached="50%" (or KeysCached="100%" and I didn't test other values with %) then cfstats reports Key cache capacity: 1 This looks weird... is this expected? (version 0.6.1) For example, in the default configuration: Keyspace: Keyspace1

setcachecapacity is forgotten

2010-05-24 Thread Ran Tavory
I use nodetool to set cache capacity on a certain node but the settings are "forgotten" after a few minutes. I run: $ nodetool -h localhost -p 9004 setcachecapacity outbrain_kvdb KvImpressions 1000 100 And then run nodetool cfstats immediately after and the settings are effective, I see t

Re: Key cache capacity: 1 when using KeysCached="50%"

2010-05-24 Thread Ran Tavory
24, 2010 at 6:30 PM, Jonathan Ellis wrote: > If you really want a cache capacity of 0 then you need to use 0 > explicitly, otherwise the % versions will give you at least 1. > > On Mon, May 24, 2010 at 12:34 AM, Ran Tavory wrote: > > I've noticed that when defining Keys

Is there a way to turn HH off?

2010-05-24 Thread Ran Tavory
For small clusters Hinted Handoff cost is not negligible. I'd like to test its effect. Is there a way to turn it off for my cluster?

Re: Is there a way to turn HH off?

2010-05-24 Thread Ran Tavory
ah, 0.6.2, worth waiting for... On Tue, May 25, 2010 at 7:52 AM, Jonathan Ellis wrote: > https://issues.apache.org/jira/browse/CASSANDRA-894 > > On Mon, May 24, 2010 at 11:30 PM, Ran Tavory wrote: > > For small clusters Hinted Handoff cost is not negligible. I'd like to &

Re: Hector vs cassandra-java-client

2010-05-24 Thread Ran Tavory
cassandra-java-client is up to cassandra's 0.4.2 version, so you probably can't use it out of the box. Hector is active and up to the latest 0.6.1 release with a bunch of committers, contributors and users. See http://wiki.github.com/rantav/hector/ and http://groups.google.com/group/hector-users O

Error reporting Key cache hit rate with cfstats or with JMX

2010-05-25 Thread Ran Tavory
It seems there's an error reporting the Key cache hit rate. The value is always 0.0 and I have a feeling it's incorrect. This is seen both by using notetool cfstats as well as accessing JMX directly (org.apache.cassandra.db:type=Caches,keyspace=outbrain_kvdb,cache=KvAdsKeyCache RecentHitRate)

Re: Key cache capacity: 1 when using KeysCached="50%"

2010-05-25 Thread Ran Tavory
https://issues.apache.org/jira/browse/CASSANDRA-1129 On Tue, May 25, 2010 at 3:42 PM, Jonathan Ellis wrote: > That does look like a bug. Can you create a ticket and upload a > (preferably small-ish) sstable that illustrates the problem? > > On Mon, May 24, 2010 at 12:07 PM, Ran T

Re: Hector samples -- where?

2010-05-25 Thread Ran Tavory
http://wiki.github.com/rantav/hector/examples On May 25, 2010 10:43 PM, "Asaf Lahav" wrote: Hi, Where can I find Hector code samples?

Re: RE: Hector samples -- where?

2010-05-25 Thread Ran Tavory
The best examples are in KeyspaceTest but don't include all scenarios On May 26, 2010 2:27 AM, "Nicholas Sun" wrote: I am also interested in this. It seems like adding multiple Cols into a CF or SuperCols would be very useful. Like a dataload type capability? Nick -Original Message-

Re: RE: Hector samples -- where?

2010-05-25 Thread Ran Tavory
it's here http://github.com/rantav/hector/blob/master/src/test/java/me/prettyprint/cassandra/service/KeyspaceTest.java On Wed, May 26, 2010 at 8:18 AM, Nicholas Sun wrote: > Could you please provide some indication as to their location? Thanks. > > > > Nick > &

nodetool move looks stuck

2010-05-26 Thread Ran Tavory
I ran nodetool move on one of the nodes and it seems stuck for a few hours now. I've been able to run it successfully in the past, but this time it looks stuck. Streams shows as if there's work in progress, but the same files have been at the same position for a few hours. I've also checked the c

Re: Questions regarding batch mutates and transactions

2010-05-26 Thread Ran Tavory
The summary of your question is: is batch_mutate atomic in the general sense, meaning when used with multiple keys, multiple column families etc, correct? On Wed, May 26, 2010 at 12:45 PM, Todd Nine wrote: > Hey guys, > I originally asked this on the Hector group, but no one was sure of the >

Re: Error reporting Key cache hit rate with cfstats or with JMX

2010-05-26 Thread Ran Tavory
On Tue, May 25, 2010 at 3:43 PM, Jonathan Ellis wrote: > What happens if you disable row cache? > > On Tue, May 25, 2010 at 4:53 AM, Ran Tavory wrote: > > It seems there's an error reporting the Key cache hit rate. The value is > > always 0.0 and I have a feeling it

Re: Error reporting Key cache hit rate with cfstats or with JMX

2010-05-26 Thread Ran Tavory
> It sure sounds like you're seeing the "my row cache contains the > entire hot data set, so the key cache only gets the cold reads" > effect. > > On Wed, May 26, 2010 at 2:54 PM, Ran Tavory wrote: > > If I disable row cache the numbers look good - key cache

Re: How to drop a column family (Cassandra 6.1)

2010-05-31 Thread Ran Tavory
1. nodetool flush 2. stop server 3. delete all files (data, index, filter) 4. start server note that this will delete the data, not the CF definition (not like "drop table" in sql-ish). system_drop_column_family will drop the CF definition truncate (available from 0.7) will delete the data, whic

HintedHandoffEnabled

2010-05-31 Thread Ran Tavory
In 0.6.2 I disabled hinted handoff, however tpstats and cfstats report seems odd. On all servers in the cluster I have: false tpstats reports 5 completed handoffs. $ nodetool -h cass25 -p 9004 tpstats Pool NameActive Pending Completed FILEUTILS-DELETE-POOL

nodetool cleanup isn't cleaning up?

2010-05-31 Thread Ran Tavory
I hope I understand nodetool cleanup correctly - it should clean up all data that does not (currently) belong to this node. If so, I think it might not be working correctly. Look at nodes 192.168.252.124 and 192.168.252.99 below 192.168.252.99Up 279.35 MB 35446079887597756610768188274

Re: nodetool cleanup isn't cleaning up?

2010-05-31 Thread Ran Tavory
t. > ... > Here, we defined a constant, GCGraceSeconds, and had each node track > tombstone age locally. Once it has aged past the constant, it can be GC'd. > === > > > > On 31.05.2010 16:23, Ran Tavory wrote: > >> I hope I understand nodetool cleanup correctly -

Re: HintedHandoffEnabled

2010-05-31 Thread Ran Tavory
attempt to deliver existing ones unless you manually > delete the hint files in the system/ dir. > > On Mon, May 31, 2010 at 6:04 AM, Ran Tavory wrote: > > In 0.6.2 I disabled hinted handoff, however tpstats and cfstats report > seems > > odd. > > On all servers in t

Re: nodetool cleanup isn't cleaning up?

2010-05-31 Thread Ran Tavory
yes, replication factor = 2 On Mon, May 31, 2010 at 10:07 PM, Jonathan Ellis wrote: > you have replication factor > 1 ? > > On Mon, May 31, 2010 at 7:23 AM, Ran Tavory wrote: > > I hope I understand nodetool cleanup correctly - it should clean up all > data > > that

Re: nodetool cleanup isn't cleaning up?

2010-05-31 Thread Ran Tavory
lis wrote: > well, there you are then. > > On Mon, May 31, 2010 at 2:34 PM, Ran Tavory wrote: > > yes, replication factor = 2 > > > > On Mon, May 31, 2010 at 10:07 PM, Jonathan Ellis > wrote: > >> > >> you have replication factor > 1 ? > >> &g

Re: nodetool cleanup isn't cleaning up?

2010-06-01 Thread Ran Tavory
the same DC). On Tue, Jun 1, 2010 at 4:00 PM, Jonathan Ellis wrote: > I'm saying that .99 is getting a copy of all the data for which .124 > is the primary. (If you are using RackUnawarePartitioner. If you are > using RackAware it is some other node.) > > On Tue, Jun 1, 2010 at

Re: nodetool cleanup isn't cleaning up?

2010-06-02 Thread Ran Tavory
15 AM, Jonathan Ellis wrote: > Then the next step is to check StorageService.getRangeToEndpointMap via jmx > > On Tue, Jun 1, 2010 at 11:56 AM, Ran Tavory wrote: > > I'm using RackAwareStrategy. But it still doesn't make sense I think... > > let's see what di

Re: Number of client connections

2010-06-02 Thread Ran Tavory
as far as I know, only the os level limitations, e.g. typically ~60k On Thu, Jun 3, 2010 at 9:34 AM, Lev Stesin wrote: > Hi, > > Is there a limit on the number of client connections to a node? Thanks. > > -- > Lev >

Re: Embedded usage

2010-06-04 Thread Ran Tavory
Cassandra expects a config file and does not expose an alternative API, for this file, that's correct. I think it's not hard to add such API but so far the demand for it didn't exist. On Jun 4, 2010 8:01 PM, "Sten Roger Sandvik" wrote: 2010/6/4 Jonathan Ellis > > look at o.a.c.service.Embedd

Re: Tree Search in Cassandra

2010-06-06 Thread Ran Tavory
sounds interesting... btree on top of cassandra ;) On Sun, Jun 6, 2010 at 12:16 PM, David Boxenhorn wrote: > I'm still thinking about the problem of how to handle range queries on very > large sets of data, using Random Partitioning. > > Has anyone used tree search to solve this? What do you thi

Re: Tree Search in Cassandra

2010-06-06 Thread Ran Tavory
s >> some level of atomic operations or locking, since updates are >> multi-step operations. From server side I guess it would be similar to >> work on allowing atomic multi-part operations (like ones being worked >> on to implement counters?). >> >> -+ Tatu +

Re: Is ReplicationFactor values number of replicas or number of copies of data?

2010-06-07 Thread Ran Tavory
to have two copies you need RF=2. RF=0 doesn't make sense as far as I understand it. On Mon, Jun 7, 2010 at 2:16 PM, Per Olesen wrote: > Hi, > > I am unclear about what the ReplicationFactor value means. > > Does RF=1 mean that there is only one single node that has the data in the > cluster (ac

Re: Passing client as parameter

2010-06-09 Thread Ran Tavory
Some languages have higher level clients that might help you. What language are you using? On Jun 9, 2010 9:01 PM, "Steven Haar" wrote: What is the best way to pass a Cassandra client as a parameter? If you pass it as a parameter, do you also have to pass the transport in order to be able to clo

Re: Passing client as parameter

2010-06-10 Thread Ran Tavory
ansport, create a client, make some cassandra operations and then close the transport. On Wed, Jun 9, 2010 at 10:35 PM, Steven Haar wrote: > C# > > > On Wed, Jun 9, 2010 at 2:34 PM, Ran Tavory wrote: > >> Some languages have higher level clients that might help you. What >>

Re: cassandra out of heap space crash

2010-06-10 Thread Ran Tavory
I can't say exactly how much memory is the correct amount, but surely 1G is very little. By replicating 3 times your cluster now makes 3 times more work than it used to do, both on reads and on writes while the readers/writers continue hammering it the same pace. So once you've upped your memory (

Re: cassandra out of heap space crash

2010-06-11 Thread Ran Tavory
Gary fwiw I get oom with Cl one quite commonly if I'm not careful with my writes On Jun 11, 2010 8:48 PM, "Jonathan Ellis" wrote: We give you enough rope to hang yourself. Don't use ZERO if that's not what you want. :) On Fri, Jun 11, 2010 at 9:23 AM, William Ashley wrote: > Would it be reas

Re: Pelops - a new Java client library paradigm

2010-06-12 Thread Ran Tavory
Nice going, Dominic, having a clear API for cassandra is a big step forward :) Interestingly, at hector we came up with similar approach, just didn't find the time for code that, as production systems keep me busy at nights as well... We started with the implementation of BatchMutation, but the res

Re: batch_mutate atomic?

2010-06-14 Thread Ran Tavory
no, it's not atomic. it just shortens the roundtrip of many update requests. Some may fail and some may succeed On Mon, Jun 14, 2010 at 2:40 PM, Per Olesen wrote: > Can I expect batch_mutate to work in what I would think of as an atomic > operation? > > That either all the mutations in the batch

Re: Client connection and data distribution across nodes

2010-06-16 Thread Ran Tavory
On Thu, Jun 17, 2010 at 8:52 AM, Mubarak Seyed wrote: > Hi All, > > Regarding client thrift connection, i have 4 nodes which formed a ring, but > client only knows the IP address of an one node (and thrift RPC port > number), > how does client can connect to any one other node without getting rin

Re: Best documentation for Java and Cassandra?

2010-06-17 Thread Ran Tavory
I can offer Hector which I've authored and maintain with the help of a few other folks http://wiki.github.com/rantav/hector/ http://github.com/rantav/hector Feel free to post questions to our mailing list http://groups.google.com/group/hector-users On Thu, Jun 17, 2010 at 10:04 AM, Anthony Ikeda

Re: Cassandra questions

2010-06-17 Thread Ran Tavory
On Thu, Jun 17, 2010 at 9:09 PM, F. Hugo Zwaal wrote: > Hi, > > Being fairly new to Cassandra I have a couple of questions: > > 1) Is there a way to remove multiple keys/rows in one operation (batch) or > must keys be removed one by one? > yes, batch_mutate > 2) I see API references to version 0

Re: 3-node balanced system

2010-06-17 Thread Ran Tavory
+ user, - dev (bcc actually) If you use a random partitioner use the following InitialToken for your nodes: $ bc bc 1.06 Copyright 1991-1994, 1997, 1998, 2000 Free Software Foundation, Inc. This is free software with ABSOLUTELY NO WARRANTY. For details type `warranty'. (2^127)/3 *5671372782015641

Re: what is the best way to truncate a column family

2010-06-18 Thread Ran Tavory
it will be immediate. But it will fail if not all hosts in the cluster are up, this is the tradeoff. We regard the truncate operation an admin api so I think it's a fair tradeoff. On Fri, Jun 18, 2010 at 11:50 PM, Phil Stanhope wrote: > In 0.6.x the iterating approach works ... but you need to f

Re: Instability and memory problems

2010-06-20 Thread Ran Tavory
I don't have the answer but if you provide jmap output, cfstats output that may help. Are you using mmap files? Do you see swap? Gc in the logs? On Jun 20, 2010 7:25 PM, "James Golick" wrote: As I alluded to in another post, we just moved from 2-4 nodes. Since then, the cluster has been incredib

Re: Hector - Java doc

2010-06-22 Thread Ran Tavory
There isn't an online javadoc page, but the code is online and well documented and there's a wiki and all sorts of documents and examples http://github.com/rantav/hector/blob/master/src/main/java/me/prettyprint/cassandra/service/Keyspace.java http://wiki.github.com/rantav/hector/ On Wed, Jun 23, 2

Re: hector or pelops

2010-06-23 Thread Ran Tavory
As the developer of hector I can only speak in favor of my child of love and I haven't tried pelops so take the following with a grain of salt... Hector sees wide adoption and has been coined the de-facto java client. It's been in use in production critical systems since version 0.5.0 by a few comp

Re: hector or pelops

2010-06-23 Thread Ran Tavory
of tests for each API call, do you have some > code that will help me build that. If my code ends up useful I intend to > publish it on my website for others to use. > > Regards > Gavan > > On Thu, Jun 24, 2010 at 4:43 AM, Ran Tavory wrote: > >> As the developer of

  1   2   >