Re: disk full and COMMIT-LOG-WRITER ?

2011-09-08 Thread Yang
ok, found past discussions: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/cassandra-server-disk-full-td6560725.html On Thu, Sep 8, 2011 at 11:00 PM, Yang wrote: > I found the reason of my server freeze: > > COMMIT-LOG-WRITER thread is gone, dead, so the blocking queue in > Per

disk full and COMMIT-LOG-WRITER ?

2011-09-08 Thread Yang
I found the reason of my server freeze: COMMIT-LOG-WRITER thread is gone, dead, so the blocking queue in PeriodicCommitLogExecutorService is full, then all mutationStage jobs are stuck on the mutations flushing. the COMMIT-LOG-WRITER thread died because at one time the disk was full, I cleaned up

Re: Anybody out there using 0.8 in production

2011-09-08 Thread Jeremy Hanna
We run 0.8 in production and it's been working well for us. There are some new settings that we had to tune for - for example, the default concurrent compaction is the number of cores. We had to tune that down because we also run hadoop jobs on our nodes. On Sep 8, 2011, at 4:44 PM, Anand Som

Re: Anybody out there using 0.8 in production

2011-09-08 Thread Chris Goffinet
Twitter runs 0.8 in production/closer to trunk. No big issues from us. On Thu, Sep 8, 2011 at 8:53 PM, Eric Czech wrote: > We just migrated from .7.5 to .8.4 in our production environment and it was > definitely the least painful transition yet (coming all the way from the .4 > release series).

Re: Anybody out there using 0.8 in production

2011-09-08 Thread Eric Czech
We just migrated from .7.5 to .8.4 in our production environment and it was definitely the least painful transition yet (coming all the way from the .4 release series). It's been about a week for us but so far so good. On Thu, Sep 8, 2011 at 9:25 PM, Dominic Williams < dwilli...@fightmymonster.co

Re: Anybody out there using 0.8 in production

2011-09-08 Thread Dominic Williams
Hi I've just migrated to 0.8.5 and from first looks it is a giant leap forward - better use of CPU and memory - able to scrub files previously unfixable on 0.7.6-2 etc On 9 September 2011 01:45, Anthony Ikeda wrote: > We plan to and have been using it in Dev and QA. There are some bugs that > hav

mysterious data disappearance - what happened?

2011-09-08 Thread Yiming Sun
Hello, If two different instances of cassandra are running on separate machines, but both are unfortunately configured to use the default cluster name, "Test Cluster", do they gang up as one cluster (even though they were intended to be two separate stand-alone instances), so that dropping keyspac

what's the difference between repair CF separately and repair the entire node?

2011-09-08 Thread Yan Chunlu
I have 3 nodes and RF=3. I tried to repair every node in the cluster by using "nodetool repair mykeyspace mycf" on every column family. it finished within 3 hours, the data size is no more than 50GB. after the repair, I have tried using nodetool repair immediately to repair the entire node, but

Re: Anybody out there using 0.8 in production

2011-09-08 Thread Anthony Ikeda
We plan to and have been using it in Dev and QA. There are some bugs that have been fixed that we are looking forward to in 0.8.5 and probably that would be the better build for production (there is a quorum bug that we will need). Otherwise no other 0.8 issues that we are aware of. We did go thro

Re: Question on using consistency level with NetworkTopologyStrategy

2011-09-08 Thread Jonathan Ellis
CL.QUORUM is supported with any replication strategy, not just simple. Also, Cassandra's optimizing of cross-DC writes only requires that it know (via a correctly configured Snitch) where each node is located. It is not affected by replication strategy choice. On Thu, Sep 8, 2011 at 3:14 PM, Anan

Re: Question on using consistency level with NetworkTopologyStrategy

2011-09-08 Thread Edward Capriolo
On Thu, Sep 8, 2011 at 4:14 PM, Anand Somani wrote: > Hi, > > Have a requirement, where data is spread across multiple DC for disaster > recovery. So I would use the NTS, that is clear, but I have some questions > with this scenario > >- I have 2 Data Centers >- RF - 2 (active DC) , 2 (pa

Re: possible feature request RP vs. OPP

2011-09-08 Thread Edward Capriolo
On Thu, Sep 8, 2011 at 5:12 PM, Dean Hiller wrote: > I was wondering something. Since I can take OPP and I can create a layer > that for certain column families, I hash the key so that some column > families are just like RP but on top of OPP and some of my other column > families are then on OP

Anybody out there using 0.8 in production

2011-09-08 Thread Anand Somani
Hi Currently we are using 0.7.4 and was wondering if I should upgrade to 0.7.8/9 or move to 0.8? Is anybody using 0.8 in production and what is their experience? Thanks

possible feature request RP vs. OPP

2011-09-08 Thread Dean Hiller
I was wondering something. Since I can take OPP and I can create a layer that for certain column families, I hash the key so that some column families are just like RP but on top of OPP and some of my other column families are then on OPP directly so I could use lucandra, why not make RP deprecate

Re: question on setup for writes into 2 datacenters

2011-09-08 Thread Dean Hiller
thanks this is perfect...yeah, we want 4 nodes(2 in each data center) so just a bit more than 3 ;) so we can have both datacenters be active at the same time and not have to write to the other datacenter except in the background. This is perfect..hbase couldn't do this which I wanted badly alo

Re: no stack trace :(

2011-09-08 Thread Dean Hiller
to close on this issue. I just had whatever the default cassandra 0.8.3 untar had and fixing DNS resolved the issue for me. My computer was not in DNS so I guess it has to be when trying a multinode install. thanks, Dean On Sat, Aug 6, 2011 at 10:10 PM, aaron morton wrote: > Do you have MX4J in

Re: row key as long type

2011-09-08 Thread Jonathan Ellis
Probably because you had some non-long data in it, then added the long type later. On Thu, Sep 8, 2011 at 2:51 PM, amulya rattan wrote: > But I explicitly remember List throwing "long is exactly 8 bytes" when i > invoked on a column family with long as key. > Why would that happen? > > > On Thu,

Question on using consistency level with NetworkTopologyStrategy

2011-09-08 Thread Anand Somani
Hi, Have a requirement, where data is spread across multiple DC for disaster recovery. So I would use the NTS, that is clear, but I have some questions with this scenario - I have 2 Data Centers - RF - 2 (active DC) , 2 (passive DC) - with NTS - Consistency level options are - LOCAL_QUOR

Re: row key as long type

2011-09-08 Thread amulya rattan
But I explicitly remember List throwing "long is exactly 8 bytes" when i invoked on a column family with long as key. Why would that happen? On Thu, Sep 8, 2011 at 10:07 AM, Jonathan Ellis wrote: > List should work fine on any schema, including long keys. > > > On Thu, Sep 8, 2011 at 8:23 AM, am

Re: Replicate On Write behavior

2011-09-08 Thread David Hawthorne
It was exactly due to 2890, and the fact that the first replica is always the one with the lowest value IP address. I patched cassandra to pick a random node out of the replica set in StorageProxy.java findSuitableEndpoint: Random rng = new Random(); return endpoints.get(rng.nextInt(endpoints.

Re: requests should be disabled when doing compaction?

2011-09-08 Thread Jonathan Ellis
(1) You should plan enough capacity that you can handle normal compaction load while serving requests (1a) You can control how aggressive compaction is using the compaction_throughput_mb_per_sec setting (2) It is not recommended to force compaction of everything using nodetool compact On Thu, Sep

requests should be disabled when doing compaction?

2011-09-08 Thread Yang
I did a nodetool compact while the Cassandra server was serving requests, it almost never finishes, and the service latency becomes very long then I terminated the traffic, compaction finishes shortly after. should I disable client requests to a node being compacted while I carry out the compacti

Re: [RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Anthony Ikeda
Cool, all those fixes will be going into our prod env! Congrats! Anthony On Thu, Sep 8, 2011 at 7:51 AM, Stephen Connolly < stephen.alan.conno...@gmail.com> wrote: > can take up to 12 hours for the sync to central > > - Stephen > > --- > Sent from my Android phone, so random spelling mistakes, r

Re: weird freeze

2011-09-08 Thread Yang
ok, found this is likely due to GC... I'm seeing full GC that runs 20seconds without actually removing anything: 35240.526: [Full GC [PSYoungGen: 1760704K->1668729K(1848128K)] [PSOldGen: 4095999K->4095999K(4096000K)] 5856703K->5764729K(5944128K) [PSPermGen: 24885K->24871K(25152K)], 20.6780790 sec

Re: SIGSEGV during compaction?

2011-09-08 Thread Yang
this is trunk. sorry I did more tests, the -XX:-UseCompressedOops suggested by Jonathan actually DOES solve the problem. my previous tries possibly used the wrong scripts. Thanks guys Yang On Thu, Sep 8, 2011 at 12:07 AM, Sylvain Lebresne wrote: > Are you using current trunk ? Or 0.8 ? > > B

Re: Read times on Cassandra

2011-09-08 Thread Jonathan Ellis
What is your bottleneck? spyced.blogspot.com/2010/01/linux-performance-basics.html On Thu, Sep 8, 2011 at 10:14 AM, Ben Ashton wrote: > Hi Guys, > > I have our multi DC setup running on EC2. > > Latency on the connections between the servers is <2ms, monitored > writes are ~3-4ms, reads are 50 t

Read times on Cassandra

2011-09-08 Thread Ben Ashton
Hi Guys, I have our multi DC setup running on EC2. Latency on the connections between the servers is <2ms, monitored writes are ~3-4ms, reads are 50 to 2200 ms, normally ~80ms Is this normal? I know there are some optimisation java guys can do with hector, but is there anything I can do on a sys

Re: [RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Stephen Connolly
can take up to 12 hours for the sync to central - Stephen --- Sent from my Android phone, so random spelling mistakes, random nonsense words and other nonsense are a direct result of using swype to type on the screen On 8 Sep 2011 06:40, "Roshan Dawrani" wrote: > Hi, > > The artefacts at > http:

Re: row key as long type

2011-09-08 Thread Jonathan Ellis
List should work fine on any schema, including long keys. On Thu, Sep 8, 2011 at 8:23 AM, amulya rattan wrote: > Row key can certainly be of type long..you'd just have to set > key_validataion_class to be LongType. > > However, doing list on column family would throw an error..please look at > h

Re: [RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Roshan Dawrani
Hi, The artefacts at http://mvnrepository.com/artifact/org.apache.cassandra/cassandra-all still do not reflect 0.8.5. Does the availability at maven lag behind by much? Cheers. On Thu, Sep 8, 2011 at 5:45 PM, Sylvain Lebresne wrote: > The Cassandra team is pleased to announce the release of Ap

Re: row key as long type

2011-09-08 Thread amulya rattan
Row key can certainly be of type long..you'd just have to set key_validataion_class to be LongType. However, doing list on column family would throw an error..please look at http://wiki.apache.org/cassandra/FAQ#a_long_is_exactly_8_bytes On Thu, Sep 8, 2011 at 8:14 AM, Thamizh wrote: > Hi All, >

Re: Secondary index update issue

2011-09-08 Thread Jonathan Ellis
If the queries are working successfully, I'm afraid I don't understand the problem. On Wed, Sep 7, 2011 at 12:48 PM, Thamizh wrote: > > Hi, > > Here is my sample row. I wanted to execute below 2 queries. CF:ip > 1. get ip where ip=19268678; > 2. get ip where domain='google.com'; > > Here both ip

Re: Exception encountered during startup.

2011-09-08 Thread Jonathan Ellis
See this thread: http://mail-archives.apache.org/mod_mbox/cassandra-user/201109.mbox/%3ccaldd-zhuflt3urdsk0ahsmqdj-n1keyxonn4rgzjjz13cag...@mail.gmail.com%3E On Thu, Sep 8, 2011 at 7:36 AM, Dikang Gu wrote: > I have a 4 cassandra 0.8.1 nodes in the cluster, one node crashes and I'm > trying to re

Re: [RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Helder Oliveira
Congrats guys! On Sep 8, 2011, at 1:37 PM, Dikang Gu wrote: > Congratulations! > > On Thu, Sep 8, 2011 at 8:15 PM, Sylvain Lebresne wrote: > The Cassandra team is pleased to announce the release of Apache Cassandra > version 0.8.5. > > Cassandra is a highly scalable second-generation distribu

Exception encountered during startup.

2011-09-08 Thread Dikang Gu
I have a 4 cassandra 0.8.1 nodes in the cluster, one node crashes and I'm trying to restart it. But I encounter the following errors during the startup, is this a known bug? DEBUG [main] 2011-09-08 20:26:17,959 Table.java (line 305) Initializing system.NodeIdInfo DEBUG [main] 2011-09-08 20:26:17

Re: [RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Dikang Gu
Congratulations! On Thu, Sep 8, 2011 at 8:15 PM, Sylvain Lebresne wrote: > The Cassandra team is pleased to announce the release of Apache Cassandra > version 0.8.5. > > Cassandra is a highly scalable second-generation distributed database, > bringing together Dynamo's fully distributed design an

Re: [RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Roshan Dawrani
Hi, Looking forward to having 0.8.5 artifacts available from maven repo too. Cheers. On Thu, Sep 8, 2011 at 5:45 PM, Sylvain Lebresne wrote: > The Cassandra team is pleased to announce the release of Apache Cassandra > version 0.8.5. > > Cassandra is a highly scalable second-generation dist

[RELEASE] Apache Cassandra 0.8.5 released

2011-09-08 Thread Sylvain Lebresne
The Cassandra team is pleased to announce the release of Apache Cassandra version 0.8.5. Cassandra is a highly scalable second-generation distributed database, bringing together Dynamo's fully distributed design and Bigtable's ColumnFamily-based data model. You can read more here: http://cassand

row key as long type

2011-09-08 Thread Thamizh
Hi All, Is there a way to store number(longtype) as row key in Cassadra? I wanted to execute range query based on row key value. e.g $list info[12345:]; . It should list all the rowkeys which are >= 12345. Is there a way accompolish this in cassandra? Secondary index does not helped me. So I

Re: SIGSEGV during compaction?

2011-09-08 Thread Sylvain Lebresne
Are you using current trunk ? Or 0.8 ? Because if on trunk, a SIGSEGV could also be due to CASSANDRA-2521, if we happen to force the unmapping of a file but tries to access it afterwards (which shouldn't happen but ...). -- Sylvain On Thu, Sep 8, 2011 at 7:36 AM, Yang wrote: > h, all other