Re: Reduce Cassandra GC

2013-04-17 Thread Joel Samuelsson
You're right, it's probably hard. I should have provided more data. I'm running Ubuntu 10.04 LTS with JNA installed. I believe this line in the log indicates that JNA is working, please correct me if I'm wrong: CLibrary.java (line 111) JNA mlockall successful Total amount of RAM is 4GB. My descr

Key-Token mapping in cassandra

2013-04-17 Thread Ravikumar Govindarajan
We would like to map multiple keys to a single token in cassandra. I believe this should be possible now with CASSANDRA-1034 Ex: Key1 --> 123/IMAGE Key2 --> 123/DOCUMENTS Key3 --> 123/MULTIMEDIA I would like all keys with "123" as prefix to be mapped to a single token. Is this possible? What sh

InvalidRequestException: Start key's token sorts after end token

2013-04-17 Thread Andre Tavares
Hi, I am getting an exception when I run Hadoop with Cassandra that follows: WARN org.apache.hadoop.mapred.Child (main): Error running child java.lang.RuntimeException: InvalidRequestException(why:Start key's token sorts after end token) at org.apache.cassandra.hadoop.ColumnFamilyRecordReader$Wid

Re: InvalidRequestException: Start key's token sorts after end token

2013-04-17 Thread Hiller, Dean
I literally jut replied to your stackoverflow comment then saw this email. I need the whole stack trace. My guess is the ColFamily is configured for one sort method where map/reduce is using another or something when querying but that's just a guess. Dean From: Andre Tavares mailto:andre...@

Re: InvalidRequestException: Start key's token sorts after end token

2013-04-17 Thread Andre Tavares
Dean, sorry, but I saw your comments on Stackoverflow ( http://stackoverflow.com/questions/16041727/operationtimeoutexception-cassandra-cluster-aws-emr) just after I sent this message ... and I think you may be right about the sort method, but Priam sets Cassandra partitioner with "RandomParti

Getting error while inserting data in cassandra table using Java with JDBC

2013-04-17 Thread himanshu.joshi
Hi, When I am trying to insert the data into a table using Java with JDBC, I am getting the error InvalidRequestException(why:cannot parse 'Jo' as hex bytes) My insert quarry is: insert into temp(id,name,value,url_id) VALUES(108, 'Aa','Jo',10); This insert quarry is running successfully fro

Re: InvalidRequestException: Start key's token sorts after end token

2013-04-17 Thread Hiller, Dean
What's the stack trace you see? At the time, I was thinking column scan not row scan as perhaps your code or priam's code was doing a column slice within a row set and the columns are sorted by Integer while priam is passing in UTF8 or vice-versa. Ie. Do we know if this is a column sorting iss

looking at making astyanax asynchronous but cassandra-thrift-1.1.1 doesn't look right

2013-04-17 Thread Hiller, Dean
Is cassandra-thrift-1.1.1.jar the generated code? I see a send() and recv() but I don't see a send(Callback cb) that is typicaly of true asynchronous platforms. Ie. I don't know when to call recv myself obviously if I am trying to make astyanax truly asynchronous. The reason I ask is we have

Re: How to stop Cassandra and then restart it in windows?

2013-04-17 Thread Raihan Jamal
Hello, Can anyone provide any help on this? Thanks in advance. *Raihan Jamal* On Tue, Apr 16, 2013 at 6:50 PM, Raihan Jamal wrote: > Hello, > > I installed single node cluster in my local dev box which is running > Windows 7 and it was working fine. Due to some reason, I need to restart

Re: Added extra column as composite key while creation counter column family

2013-04-17 Thread Robert Coli
On Tue, Apr 16, 2013 at 10:29 PM, Kuldeep Mishra wrote: > cassandra 1.2.0 > > Is it a bug in 1.2.0 ? > While I can't speak to this specific issue, 1.2.0 has meaningful known issues. I suggest upgrade to 1.2.3(/4) ASAP. =Rob

Re: Thrift message length exceeded

2013-04-17 Thread Lanny Ripple
That was our first thought. Using maven's dependency tree info we verified that we're using the expected (cass 1.2.3) jars $ mvn dependency:tree | grep thrift [INFO] | +- org.apache.thrift:libthrift:jar:0.7.0:compile [INFO] | \- org.apache.cassandra:cassandra-thrift:jar:1.2.3:compile I've also

Re: MySQL Cluster performing faster than Cassandra cluster on single table

2013-04-17 Thread aaron morton
How many threads / processes do you have performing the writes? How big are the mutations ? Where are you measuring the latency ? Look at the nodetool cfhistograms to see the time it takes for a single node to perform a write. Look at the nodetool proxyhistograms to see the end to end request

Re: differences between DataStax Community Edition and Cassandra package

2013-04-17 Thread aaron morton
It's the same as the Apache version, but DSC comes with samples and the free version of Ops Centre. Cheers - Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 17/04/2013, at 6:36 PM, Francisco Trujillo wrote: > Hi everyone

Re: Cassandra Client Recommendation

2013-04-17 Thread aaron morton
One node on the native binary protocol, AFAIK it's still considered beta in 1.2 Also +1 for Astyanax Cheers - Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 17/04/2013, at 6:50 PM, Francisco Trujillo wrote: > Hi > > We a

Re: Reduce Cassandra GC

2013-04-17 Thread aaron morton
> INFO [ScheduledTasks:1] 2013-04-15 14:00:02,749 GCInspector.java (line 122) > GC for ParNew: 338798 ms for 1 collections, 592212416 used; max is 1046937600 This does not say that the heap is full. ParNew is GC activity for the new heap, which is typically a smaller part of the overall heap.

Re: Key-Token mapping in cassandra

2013-04-17 Thread aaron morton
> CASSANDRA-1034 That ticket is about removing an assumption which was not correct. > I would like all keys with "123" as prefix to be mapped to a single token. Why? it's not possible nor desirable IMHO. Tokens are used to identify a single row internally. Cheers - Aaron Mor

Re: Getting error while inserting data in cassandra table using Java with JDBC

2013-04-17 Thread aaron morton
What version are you using ? And what JDBC driver ? Sounds like the driver is not converting the value to bytes for you. > I guess the problem may because of undefined > key_validation_class,default_validation_class and comparator etc. If you are using CQL these are not relevant. Cheers --

Multi datacenter setup question

2013-04-17 Thread More, Sandeep R
Hello, My test setup consist of two datacenters DC1 and DC2. DC2 has a offset of 10 as you can see in the following ring command. I have two questions: 1) Let's say in this case I insert a key at DC2 and its token is, let's say 85070591730234615865843651857942052874, in this case will it be

Using an EC2 cluster from the outside.

2013-04-17 Thread maillists0
I have a working 3 node cluster in a single ec2 region and I need to hit it from our datacenter. As you'd expect, the client gets the internal addresses of the nodes back. Someone on irc mentioned using the public IP for rpc and binding that address to the box. I see that mentioned in an old list

Re: Cassandra Client Recommendation

2013-04-17 Thread Techy Teck
Thanks Aaron for the suggestion. I am not sure, I was able to understand regarding one node thing you mentioned on the native binary protocol? Can you please elaborate that? On Wed, Apr 17, 2013 at 11:21 AM, aaron morton wrote: > One node on the native binary protocol, AFAIK it's still consider

Re: InvalidRequestException: Start key's token sorts after end token

2013-04-17 Thread aaron morton
If you Hadoop task supplying both a start and finish key for the slice ? You probably only want the start. Provide the full call stack and the code in your hadoop task. Cheers - Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com

Re: looking at making astyanax asynchronous but cassandra-thrift-1.1.1 doesn't look right

2013-04-17 Thread aaron morton
Here's an example I did in python a long time ago http://www.mail-archive.com/user@cassandra.apache.org/msg04775.html Call send() then select on the file handle, when it's ready to read call recv(). Or just add more threads on your side :) Cheers - Aaron Morton Freelance Cas

Re: differences between DataStax Community Edition and Cassandra package

2013-04-17 Thread Robert Coli
On Wed, Apr 17, 2013 at 11:19 AM, aaron morton wrote: > It's the same as the Apache version, but DSC comes with samples and the > free version of Ops Centre. > > DSE also comes with Solr special sauce and CDFS. =Rob

Re: How to stop Cassandra and then restart it in windows?

2013-04-17 Thread aaron morton
> Error: Exception thrown by the agent : java.rmi.server.ExportException: Port > already in use: 7199; nested exception is: > java.net.BindException: Address already in use: JVM_Bind The process is already running, is it installed as a service and was it automatically started when the sys

Re: Using an EC2 cluster from the outside.

2013-04-17 Thread Robert Coli
On Wed, Apr 17, 2013 at 12:07 PM, wrote: > I have a working 3 node cluster in a single ec2 region and I need to hit > it from our datacenter. As you'd expect, the client gets the internal > addresses of the nodes back. > > Someone on irc mentioned using the public IP for rpc and binding that > ad

Re: Thrift message length exceeded

2013-04-17 Thread aaron morton
Can you reproduce this in a simple way ? Cheers - Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 18/04/2013, at 5:50 AM, Lanny Ripple wrote: > That was our first thought. Using maven's dependency tree info we verified >

Re: Multi datacenter setup question

2013-04-17 Thread aaron morton
> 1) Let’s say in this case I insert a key at DC2 and its token is, let’s > say 85070591730234615865843651857942052874, in this case will it be owned by > DC2 ? and then replicated on DC1 ? i.e. who owns it. We don't think in terms of owning the token. The token range in the local DC that c

Re: Cassandra Client Recommendation

2013-04-17 Thread aaron morton
Was a typo, should have been "One note on" Cheers - Aaron Morton Freelance Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 18/04/2013, at 7:23 AM, Techy Teck wrote: > Thanks Aaron for the suggestion. I am not sure, I was able to understand > rega

How to make compaction run faster?

2013-04-17 Thread Jay Svc
Hi Team, I have a high write traffic to my Cassandra cluster. I experience a very high number of pending compactions. As I expect higher writes, The pending compactions keep increasing. Even when I stop my writes it takes several hours to finishing pending compactions. My CF is configured with

Re: How to make compaction run faster?

2013-04-17 Thread Edward Capriolo
three things: 1) compaction throughput is fairly low (yaml nodetool) 2) concurrent compactions is fairly low (yaml) 3) multithreaded compaction might be off in your version Try raising these things. Otherwise consider option 4. 4)$$$ RAID,RAM wrote: > Hi Team, > > > > I have

Re: How to make compaction run faster?

2013-04-17 Thread Alexis Rodríguez
:D Jay, check if your disk(s) utilization allows you to change the configuration the way Edward suggest. iostat -xkcd 1 will show you how much of your disk(s) are in use. On Wed, Apr 17, 2013 at 5:26 PM, Edward Capriolo wrote: > three things: > 1) compaction throughput is fairly low (yaml nod

Re: Thrift message length exceeded

2013-04-17 Thread Lanny Ripple
It's slow going finding the time to do so but I'm working on that. We do have another table that has one or sometimes two columns per row. We can run jobs on it without issue. I looked through org.apache.cassandra.hadoop code and don't see anything that's really changed since 1.1.5 (which was

Re: How to make compaction run faster?

2013-04-17 Thread Jay Svc
Hi Edward, Thank you for response. I have tried the following - 1. I have tried various compaction throughput ranging from 16M to 1G. CPU continued to be low and memory between 40% to 50%. I still see compaction more backing. 2. Does concurrent compactors take effect with Leveled compaction? (yam

Re: How to make compaction run faster?

2013-04-17 Thread Jay Svc
Hi Alexis, Thank you for your response. My commit log is on SSD. which shows me 30 to 40 ms of disk latency. When I ran iostat; I see "await" 26ms to 30 ms for my commit log disk. My CPU is less than 18% used. How I reduce the disk latency for my commit log disk. They are SSDs. Thank you in ad

Re: How to make compaction run faster?

2013-04-17 Thread Alexis Rodríguez
Jay, I believe that compaction occurs on the data directories and not in the commitlog. http://wiki.apache.org/cassandra/MemtableSSTable On Wed, Apr 17, 2013 at 7:58 PM, Jay Svc wrote: > Hi Alexis, > > Thank you for your response. > > My commit log is on SSD. which shows me 30 to 40 ms of d

Re: How to stop Cassandra and then restart it in windows?

2013-04-17 Thread Raihan Jamal
When I first started cassandra, I started as- Cassandra -f So I believe that's why it is getting started as service. Whenever I reboot my machine, Cassandra is up always. I am not able to find that process in windows to shut it down. I tried finding 7199 port but not able to find that one out. I

Re: Key-Token mapping in cassandra

2013-04-17 Thread Ravikumar Govindarajan
Thanks Aaron. We are looking at co-locating all keys for a given user in one Cassandra node. Are there any other ways to achieve this -- Ravi On Thursday, April 18, 2013, aaron morton wrote: > CASSANDRA-1034 > > That ticket is about removing an assumption which was not correct. > > I would like

Re: Using an EC2 cluster from the outside.

2013-04-17 Thread Ben Bromhead
Depending on your client, disable automatic client discovery and just specify a list of all your nodes in your client configuration. For more details check out http://xzheng.net/blogs/problem-when-connecting-to-cassandra-with-ruby/ , obviously this deals specifically with a ruby client but it s

[no subject]

2013-04-17 Thread Ertio Lew
I run cassandra on single win 8 machine for development needs. Everything has been working fine for several months but just today I saw this error message in cassandra logs & all host pools were marked down. ERROR 08:40:42,684 Error occurred during processing of message. java.lang.StringIndexOu

Failed shuffle

2013-04-17 Thread David McNelis
I had a situation earlier where my shuffle failed after a hard disk drive filled up. I went through and disabled shuffle on the machines while trying to get the situation resolved. Now, while I can re-enable shuffle on the machines, when trying to do an ls, I get a timeout. Looking at the cassan

Re: Getting error while inserting data in cassandra table using Java with JDBC

2013-04-17 Thread himanshu.joshi
On 04/18/2013 12:06 AM, aaron morton wrote: What version are you using ? And what JDBC driver ? Sounds like the driver is not converting the value to bytes for you. I guess the problem may because of undefined key_validation_class,default_validation_class and comparator etc. If you are using

Re: Repair hanges on 1.1.4

2013-04-17 Thread adeel . akbar
Hi Aaron, Thank you for your feedback. I have also installed DataStax OPS center and its nothing shows progress of repair. Previously every repair progress also shown on OPS center and once it 100%, reapir also completed on nodes. but now reapir is in progress on node but OPS center nothi