Re: Cassandra Hackathon?

2011-05-16 Thread Joseph Stein
awesome! this week I am (finally) getting cassandra (0.8) going for existing projects we have in production. Looking at https://issues.apache.org/jira/browse/CASSANDRA-2495 is a place I was thinking maybe I could start to help out but I am not sure that is the best starting point though it is a s

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
Its now almost 4 hours. I still see commitlogs worth 1.2G on the machines. I see no activity On Mon, May 16, 2011 at 6:33 PM, Sanjeev Kulkarni wrote: > After I updated the memtable_throughput, I stopped all my writing > processes. I did a du /commitlog to find how much was cassandra commitlog at

Re: Cassandra Hackathon?

2011-05-16 Thread Edward Capriolo
I had it on our list of ideas for the Cassandra NYC meetup. I am down for action. On Mon, May 16, 2011 at 9:40 PM, Joseph Stein wrote: > Any interest for a Cassandra Hackathon evening in NYC? Any committer(s) > going to be in the NYC area together that can lead/guide this? > > http://www.meet

Cassandra Hackathon?

2011-05-16 Thread Joseph Stein
Any interest for a Cassandra Hackathon evening in NYC? Any committer(s) going to be in the NYC area together that can lead/guide this? http://www.meetup.com/NYC-Cassandra-User-Group/events/18635801/ I have a thumbs up to use our office www.medialets.com in the Milk Studios building. It is a big

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
After I updated the memtable_throughput, I stopped all my writing processes. I did a du /commitlog to find how much was cassandra commitlog at that time. For the three nodes it was around 1.4G each. I waited for about 30 minutes to see whether cassandra flushes things. When I look at du now, it sti

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
Hey guys, I have updated all my column families with 32 as the memtable_throughput. I will let you know how cassandra behaves. Thanks! On Mon, May 16, 2011 at 3:52 PM, mcasandra wrote: > You can try to update column family using cassandra-cli. Try to set > memtable_throughput to 32 first. > > [d

Re: How to configure internode encryption in 0.8.0?

2011-05-16 Thread Jeremy Hanna
Take a look at cassandra.yaml in your 0.8 download at the very bottom. There are docs and examples there. e.g. http://svn.apache.org/repos/asf/cassandra/tags/cassandra-0.8.0-beta2/conf/cassandra.yaml On May 16, 2011, at 6:36 PM, Sameer Farooqui wrote: > I understand that 0.8.0 has configurable

How to configure internode encryption in 0.8.0?

2011-05-16 Thread Sameer Farooqui
I understand that 0.8.0 has configurable internode encryption (CASSANDRA-1567, 2152). I haven't been able to find any info on how to configure it though on this mailing list or the Datastax website. Can somebody point me towards how to set this up? - Sameer

Counter Column family Cassandra 0.8 PHP Support ?

2011-05-16 Thread bhanu choudhary
I am using counters to read the counts on my website dynamically. I am looking for phpcassandra client(?) that supports counters natively. I was looking if any PHP developer could give me a lead in generating the PHP client code required using thrift. Thanks in advance!

Re: Using composite column names in the CLI

2011-05-16 Thread Sameer Farooqui
Cassandra wouldn't know that the column name is composite of two different things. So you could just request the column names and values for a specific key like this and then just look at the column names that get returned: [default@MyKeyspace] get DemoCF[ascii('key_42')]; => (column=CA_SanJose, v

Re: Questions about using MD5 encryption with SimpleAuthenticator

2011-05-16 Thread Sameer Farooqui
By the way, just noticed a typo in my email below. I'm using the correct keyspace name in all locations on the cluster... however in my examples below, I used MyKeyspace in some spots and MDR in other spots, but in the cluster I'm specifying the same keyspace name everywhere, so that's not the issu

Questions about using MD5 encryption with SimpleAuthenticator

2011-05-16 Thread Sameer Farooqui
Hi all, We are trying to use MD5 encrypted passwords. Quick question first - Is SHA-2 supported yet? US-CERT of the U. S. Department of Homeland Security has said that MD5 "should be considered cryptographically broken and unsuitable for further use”, and SHA-2 family of hash functions is recommen

Re: Commitlog Disk Full

2011-05-16 Thread mcasandra
You can try to update column family using cassandra-cli. Try to set memtable_throughput to 32 first. [default@unknown] help update column family; update column family Bar; update column family Bar with =; update column family Bar with = and =...; Update a column family with the specified values f

Re: Commitlog Disk Full

2011-05-16 Thread Sanjeev Kulkarni
Hi, Are you referring to the binary_memtable_throughput_in_mb which is a global parameter or the per col fam specific memtable_throughput_in_mb? The former is set to 256 and we dont override the default col fam specific value. Would just re-setting the global binary_memtable_throughput_in_mb to som

Re: Using composite column names in the CLI

2011-05-16 Thread Aaron Morton
What do you mean by composite column names? Do the data type functions supported by get and set help? Or the assume statement? Aaron On 17/05/2011, at 3:21 AM, David Boxenhorn wrote: > Is there a way to view composite column names in the CLI? > > Is there a way to input them (i.e. in the set

Re: AssertionError

2011-05-16 Thread Aaron Morton
The code is trying to follow the column index for a row in an sstable, but it cannot skip as many bytes as it would like to to get to the column. Helpfully the help says running out of bytes is only one of the reasons why this could happen:) Can you provide some more information about the query

Re: Cassandra and concurrent programming

2011-05-16 Thread Mark Kerzner
Thank you for a quick answer - I could impress my colleagues at the meeting :) On Mon, May 16, 2011 at 2:54 PM, Peter Schuller wrote: > > In threading, you would do an atomic "put if not present," is there such > a > > thing in Cassandra? > > No. In general, one works to avoid the need for stron

Re: Cassandra and concurrent programming

2011-05-16 Thread Peter Schuller
> In threading, you would do an atomic "put if not present," is there such a > thing in Cassandra? No. In general, one works to avoid the need for strong co-ordination. If strong co-ordination is truly required, some external method is required. Some people use ZooKeeper (http://zookeeper.apache.o

Cassandra and concurrent programming

2011-05-16 Thread Mark Kerzner
Hi, guys, what happens if I have two threads or two processes, both of which need to ask if some condition in Cassandra is fulfilled, and then, say, write the data based on that. If one receives the "no" answer and decides to write, but before he does, the other one receives the "no" answer and wr

Re: insert and batch_insert

2011-05-16 Thread Xiaowei Wang
Thanks Aaron, really help! 2011/5/16 aaron morton > batch_mutate() and insert() follow the a similar execution path to a single > insert in the server. It's not like putting multiple statements in a > Transaction in the RDBMS. > > Where they do differ is that you can provide multiple columns for

Re: problems using property snitch file

2011-05-16 Thread Anurag Gujral
All lines have different IP address,yes I am trying to move single node in DC1 Thanks Anurag On Mon, May 16, 2011 at 3:19 AM, aaron morton wrote: > Do all the lines have the same IP address or is that just how you redacted > them ? > > is the single node in DC1 the one you are moving? > > Cheers

Re: Concurrent Mark Sweep taking 12 seconds

2011-05-16 Thread Jonathan Ellis
Yes. 2011/5/16 Héctor Izquierdo Seliva : > Hi everyone. I see in the logs that Concurrent Mark Sweep is taking 12 > seconds to do its stuff. Is this normal? There is no stop-the-world GC, > it just takes 12 seconds. > > Configuration: 0.7.5 , 8GB Heap, 16GB machines. 7 * 64 MB memtables. > > --

Using composite column names in the CLI

2011-05-16 Thread David Boxenhorn
Is there a way to view composite column names in the CLI? Is there a way to input them (i.e. in the set command)?

Concurrent Mark Sweep taking 12 seconds

2011-05-16 Thread Héctor Izquierdo Seliva
Hi everyone. I see in the logs that Concurrent Mark Sweep is taking 12 seconds to do its stuff. Is this normal? There is no stop-the-world GC, it just takes 12 seconds. Configuration: 0.7.5 , 8GB Heap, 16GB machines. 7 * 64 MB memtables.

AssertionError

2011-05-16 Thread Desimpel, Ignace
Environment : java 64 bit server, java client, thrift get_slice method, Cassandra 0.7.4, single node Depending on the data I pass for a query on a CF I get the following listed below. Any suggestions what could be wrong based on the stack trace? java.lang.AssertionError at org

Re: Multiget_slice or composite column keys?

2011-05-16 Thread Charles Blaxland
Thanks Aaron, very useful. I'll give some of your suggestions a go... On 16 May 2011 19:13, aaron morton wrote: > I'd stick with the RandomPartitioner until you have a really good reason to > change :) > > I'd also go with your alternative design with some possible tweaks. > > Consider partition

Re: Converting separate clusters in mutliple dc to one cluster across multiple dcs

2011-05-16 Thread aaron morton
a) No, the tokens and the endpoints are for the entire cluster. You can only have one snitch per cluster, why do you want multiple ones? b) That happens if the endpoint / node is not listed in the topology, add the node to the topology first. created https://issues.apache.org/jira/browse/CASSA

Re: problems using property snitch file

2011-05-16 Thread aaron morton
Do all the lines have the same IP address or is that just how you redacted them ? is the single node in DC1 the one you are moving? Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 16 May 2011, at 12:28, Anurag Gujral wrote: > IP

Re: Multiget_slice or composite column keys?

2011-05-16 Thread aaron morton
I'd stick with the RandomPartitioner until you have a really good reason to change :) I'd also go with your alternative design with some possible tweaks. Consider partitioning the rows by year or some other sensible value. If you will generally be getting the most recent data this can reduce

Re: insert and batch_insert

2011-05-16 Thread aaron morton
batch_mutate() and insert() follow the a similar execution path to a single insert in the server. It's not like putting multiple statements in a Transaction in the RDBMS. Where they do differ is that you can provide multiple columns for a row in a column family, and these will be applied as on

Re: Import/Export of Schema Migrations

2011-05-16 Thread David Boxenhorn
What you describe below sounds like what I want to do. I think that the only additional thing I am requesting is to export the migrations from the dev cluster (since Cassandra already has a table that saves them - I just want that information!) so I can import it to the other clusters. This would e