Re: setting PIG_INPUT_INITIAL_ADDRESS environment . variable in Oozie for cassandra ...¿?

2013-12-12 Thread Miguel Angel Martin junquera
¡Eureka! At last !!! the trick?? As well as putting the jar libraries dependecies in the sharelib folder in hdfs ... I had define this and the other environment variables in the bash_profile and works fine if I launch pig scripts from command line shell. I have to define these variables: PIG_

One big table/cf or many small ones?

2013-12-12 Thread Tinus Sky
Hello, I have two questions regarding modelling a schema optimized for performance for Cassandra. My service does have users who can add a message to a list. The list of message is sorted by date and displayed. When a user changes the message the date is changed and the message moves to the top o

Unable to create collection inside collection

2013-12-12 Thread Santosh Shet
Hi, I am not able to create collection inside another collection in Cassandra. Please find screenshot below [cid:image001.png@01CEF750.4A9B5020] In the above screenshot, I am trying to create column named feeschedule with type Map and Map have values which is of type List. Could anybody sugge

Unbalanced ring with C* 2.0.3 and vnodes after adding additional nodes

2013-12-12 Thread Andreas Finke
Hi, after adding 2 more nodes to a 4 nodes cluster (before) we are experiencing high load on both new nodes. After doing some investigation we found out the following: - High cpu load on vm5+6 - Higher data load on vm5+6 - Write requests are evenly distributed to all 6 nodes by our client appl

Re: user / password authentication advice

2013-12-12 Thread John Sanda
You could use CassandraAuthorizer and PaaswordAuthenticator which ships with Cassandra. See this article[1] for a good overview. [1] http://www.datastax.com/dev/blog/a-quick-tour-of-internal-authentication-and-authorization-security-in-datastax-enterprise-and-apache-cassandra On Thursday, Decembe

Re: Write performance with 1.2.12

2013-12-12 Thread srmore
On Wed, Dec 11, 2013 at 10:49 PM, Aaron Morton wrote: > It is the write latency, read latency is ok. Interestingly the latency is > low when there is one node. When I join other nodes the latency drops about > 1/3. To be specific, when I start sending traffic to the other nodes the > latency for a

Re: Raid Issue on EC2 Datastax ami, 1.2.11

2013-12-12 Thread Philippe Dupont
Hi Aaron, As you can see in the picture, there is not much steal on iostat. That's the same with top. https://imageshack.com/i/0jm4jyp Philippe 2013/12/10 Aaron Morton > Thanks for the update Philip, other people have reported high await on a > single volume previously but I don’t think it’s

Re: 2 nodes cassandra cluster raid10 or JBOD

2013-12-12 Thread cem
Thanks! On Wed, Dec 11, 2013 at 9:37 PM, Aaron Morton wrote: > If you have two nodes, and RF 2, you will only be able to use eventual > consistency. If you want to have stronger consistency and some redundancy 3 > nodes is the minimum requirement. > > In the current setup, with only 2 nodes, I w

Cassandra pytho pagination

2013-12-12 Thread Kumar Ranjan
Hey Folks, I need some ideas about support implementing of pagination on the browser, from the backend. So python code (backend) gets request from frontend with page=1,2,3,4 and so on and count_per_page=50. I am trying to use xget with column_count and buffer_size parameters. Can someone explain

Re: Write performance with 1.2.12

2013-12-12 Thread J. Ryan Earl
Why did you switch to RandomPartitioner away from Murmur3Partitioner? Have you tried with Murmur3? 1. # partitioner: org.apache.cassandra.dht.Murmur3Partitioner 2. partitioner: org.apache.cassandra.dht.RandomPartitioner On Fri, Dec 6, 2013 at 10:36 AM, srmore wrote: > > > > On Fri, De

Re: Write performance with 1.2.12

2013-12-12 Thread srmore
On Thu, Dec 12, 2013 at 11:15 AM, J. Ryan Earl wrote: > Why did you switch to RandomPartitioner away from Murmur3Partitioner? > Have you tried with Murmur3? > > >1. # partitioner: org.apache.cassandra.dht.Murmur3Partitioner >2. partitioner: org.apache.cassandra.dht.RandomPartitioner > >

Migrate data to new cluster using datacenters?

2013-12-12 Thread Andrew Cooper
Hello, We are in the process of isolating multiple applications currently running in one large cassandra cluster to individual smaller clusters. Each application runs in its own keyspace. In order to reduce/eliminate downtime for a migration, I was curious if anyone had attempted the followin

Re: Migrate data to new cluster using datacenters?

2013-12-12 Thread Fabien Rousseau
Hi, We did it once and it worked well. Those two links should help (this is more or less what we've done) : http://www.datastax.com/documentation/cassandra/1.2/webhelp/cassandra/operations/ops_add_dc_to_cluster_t.html http://www.datastax.com/documentation/cassandra/1.2/webhelp/cassandra/operations

Cassandra data update for a row

2013-12-12 Thread Kumar Ranjan
Hey Folks, I have a row like this. 'twitter_row_key' is the row key and 411186035495010304 is column. Rest is values for 411186035495010304 column. See below. 'twitter_row_key': OrderedDict([('411186035495010304', u'{"score": 0, "tid": 411186035495010304, "created_at": "Thu Dec 12 17:29:24 +

Re:

2013-12-12 Thread Kumar Ranjan
Thanks Aaron. On Wed, Dec 11, 2013 at 10:45 PM, Aaron Morton wrote: > > SYSTEM_MANAGER.create_column_family('Narrative','Twitter_search_test', > comparator_type='CompositeType', default_validation_class='UTF8Type', > key_validation_class='UTF8Type', column_validation_classes=validators) > > Comp

Re: Nodetool repair exceptions in Cassandra 2.0.2

2013-12-12 Thread David Laube
Hi Michael, Upgrading from 2.0.2 to 2.0.3 seems to have done the trick! Thank you for the recommendation! Best regards, -David Laube On Dec 9, 2013, at 3:57 PM, "Laing, Michael" wrote: > My experience is that you must upgrade to 2.0.3 ASAP to fix this. > > Michael > > > On Mon, Dec 9, 2013

Re: Nodetool repair exceptions in Cassandra 2.0.2

2013-12-12 Thread David Laube
Thank you for the reply Aaron. Unfortunately, I could not seem to find any additional info in the logs. However, upgrading from 2.0.2 to 2.0.3 seems to have done the trick! Best regards, -David Laube On Dec 11, 2013, at 6:51 PM, Aaron Morton wrote: >> [2013-12-08 11:04:02,047] Repair session

Get all the data for x number of seconds from CQL?

2013-12-12 Thread Techy Teck
Below is my CQL table - CREATE TABLE test1 ( id text, record_name text, record_value blob, PRIMARY KEY (id, record_name) ) here id column will have data like this - timestamp.count And here timestamp is in milliseconds but rounded up to nearest seconds. So as

Re: Bulkoutputformat

2013-12-12 Thread varun allampalli
Thanks Aaron, I was able to generate sstables and load using sstableloader. But after loading the tables when I do a select query I get this, the table has only one record. Is there anything I am missing or any logs I can look at. Request did not complete within rpc_timeout. On Wed, Dec 11, 2013

Re: Bulkoutputformat

2013-12-12 Thread varun allampalli
Hi Aaron, It seems like you answered the question here. https://groups.google.com/forum/#!topic/nosql-databases/vjZA5vdycWA Can you give me the link to the blog which you mentioned http://thelastpickle.com/2013/01/11/primary-keys-in-cql/ Thanks in advance Varun On Thu, Dec 12, 2013 at 3:36 P

Re: Write performance with 1.2.12

2013-12-12 Thread Rahul Menon
Quote from http://www.datastax.com/dev/blog/performance-improvements-in-cassandra-1-2 *"Murmur3Partitioner is NOT compatible with RandomPartitioner, so if you’re upgrading and using the new cassandra.yaml file, be sure to change the partitioner back to RandomPartitioner"* On Thu, Dec 12, 2013 at