Re: Pig + Cassandra = Connection errors

2010-08-21 Thread Christian Decker
Trying to better understand the problem I tried some variations, but first my setup: 1. hmaster: runs the hadoop namenode, jobtracker, a tasktracker and a datanode, also it runs Cassandra and is the first node in the seedlist in the client configuration (CassandraStorage for Pig) 2. h

Re: Pig + Cassandra = Connection errors

2010-08-19 Thread Christian Decker
In the hopes to better understand the problem I took the liberty of putting the storage-conf.xml on the net [1]. I even tried starting from scratch again, and taking care about which interfaces I use, and what ports I bind to, but until now, nothing really got me anywhere. [1] http://pastebin.com/

Re: Pig + Cassandra = Connection errors

2010-08-18 Thread Christian Decker
I have absolutely no idea what is causing the rejections, they appear to be totally random, on all 3 hosts of my cluster. I cleared all iptables states, and since they all sit on the same switch I don't think it has to do with the underlying network. Is there a connection limit on Cassandra nodes?

Re: Pig + Cassandra = Connection errors

2010-08-18 Thread Jonathan Ellis
why are you getting connection refused? do you have a firewall problem? On Wed, Aug 18, 2010 at 7:17 AM, Christian Decker wrote: > Hi all, > I'm trying to get Pig scripts to work on data in Cassandra and right now I > want to simply run the example-script.pig on a different Keyspace/CF > contain

Re: Pig + Cassandra = Connection errors

2010-08-18 Thread Christian Decker
You mean the ? Right now it's 1 milliseconds. So that should take care of the timeouts, but what about the refused connections? On Wed, Aug 18, 2010 at 3:08 PM, Drew Dahlke wrote: > What's your cassandra timeout configured to? It's not uncommon to > raise that to 30sec if you're getting time

Re: Pig + Cassandra = Connection errors

2010-08-18 Thread Drew Dahlke
What's your cassandra timeout configured to? It's not uncommon to raise that to 30sec if you're getting timeouts. On Wed, Aug 18, 2010 at 8:17 AM, Christian Decker wrote: > Hi all, > I'm trying to get Pig scripts to work on data in Cassandra and right now I > want to simply run the example-script

Pig + Cassandra = Connection errors

2010-08-18 Thread Christian Decker
Hi all, I'm trying to get Pig scripts to work on data in Cassandra and right now I want to simply run the example-script.pig on a different Keyspace/CF containing ~6'000'000 entries. I got it running but then the job aborts after quite some time, and when I look at the logs I see hundreds of these: