Re: Adding a node to cluster keeping 100% data replicated on all nodes

2014-02-10 Thread _ _
> Hi, > > Our environment will consist of cluster with size not bigger than 2 to 4 > nodes per cluster(all > located in the same DC). We want to ensure that every node in the cluster > will own 100% of > the data. A node adding(or removing) procedure will be automated so we want > to ensure we'

Re: One of my nodes is in the wrong datacenter - help!

2014-02-10 Thread Sholes, Joshua
In case anyone was following this issue, it ended up being something that looked an awful lot like CASSANDRA-6053 — when the node was removed, it didn’t successfully remove from the peers table from all nodes, and thus several of them were doing their best to try to contact it despite it being d

CQL3 Custom Functions

2014-02-10 Thread Drew Kutcharian
Hey Guys, How can I define custom CQL3 functions (similar to dateOf, now, etc)? Cheers, Drew

RE: Hector Could not flush transport error

2014-02-10 Thread Senthil, Athinanthny X. -ND
Version is C 1.2.6. We use DSE 3.1.3 From: Robert Coli [mailto:rc...@eventbrite.com] Sent: Friday, February 07, 2014 4:17 PM To: user@cassandra.apache.org Subject: Re: Hector Could not flush transport error On Fri, Feb 7, 2014 at 4:05 PM, Senthil, Athinanthny X. -ND mailto:athinanthny.x.senthil.

Re: One of my nodes is in the wrong datacenter - help!

2014-02-10 Thread Edward Capriolo
Maybe that node was just trying to tell you that it really wanted to work in a different data center :) On Mon, Feb 10, 2014 at 10:08 AM, Sholes, Joshua < joshua_sho...@cable.comcast.com> wrote: > In case anyone was following this issue, it ended up being something > that looked an awful lot l

Recommended OS

2014-02-10 Thread Keith Wright
Hi all, I was wondering what operating systems and versions people are running with success in production environments? We are using C* 1.2.13 and have had issues using CentOS 6.4/6.5. Are others using that OS? What would people recommend? What about Java 6 vs 7 (specific versions?!)?

Re: Recommended OS

2014-02-10 Thread Sholes, Joshua
What issues are you running into with CentOS 6.4/5? I’m running 1.2.8 on CentOS 6.3 and Java 1.7.0-25, and about to test with 1.7.latest. -- Josh Sholes From: Keith Wright mailto:kwri...@nanigans.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra

problems loading cassandra data from pig

2014-02-10 Thread Irooniam
Hello, I posted this issue to the pig mailing list and I'm thinking the issue I'm having is more related to cassandra? When I run pig scripts against hadoop it works as advertised, however when I try to have pig get data from cassandra it fails everytime. Cassandra: [cqlsh 4.1.0 | Cassandra 2.0.

Re: Recommended OS

2014-02-10 Thread Keith Wright
We are running on CentOS 6.4 but an upgrade to 6.5 caused packets to backup on the net queue causing HUGE load spikes and cluster meltdown. Ultimately we reverted. Have others seen this? Are others running CentOS 6.4/6.5? Thanks From: , Joshua mailto:joshua_sho...@cable.comcast.com>> Reply-

impact of update operation to read operation

2014-02-10 Thread Jiaan Zeng
Hi All, I am using Cassandra 1.2.4. I wonder if update operation has *permanent* impacts on read operation. Below is the scenario. Previously, a read only workload runs against one column family and has 4000 qps. Later, a read-update mixed workload runs against the same column family. After that

Re: impact of update operation to read operation

2014-02-10 Thread Tupshin Harper
You don't mention disks and RAM, but I would assume that the additional data meant that you could now cache a lower percentage and that you have to seek on disk more often. -Tupshin On Feb 10, 2014 4:14 PM, "Jiaan Zeng" wrote: > Hi All, > > I am using Cassandra 1.2.4. I wonder if update operati

Re: impact of update operation to read operation

2014-02-10 Thread Benedict Elliott Smith
Also, a major compaction doesn't flush the memtable. If the memtable is still full, reads may take slightly longer as they may have to be merged with any on-disk data before being served. On 10 February 2014 21:18, Tupshin Harper wrote: > You don't mention disks and RAM, but I would assume tha

RE: Recommended OS

2014-02-10 Thread Brust, Corwin [Hollander]
We're running C* 2.0.5 under CentOS 6.5 and have not noticed anything like you describe. We have just a couple of pre-production rings (Dev and Test) meaning nothing we have has received particularly intense utilization. Corwin From: Keith Wright [mailto:kwri...@nanigans.com] Sent: Monday, Feb

RE: Recommended OS

2014-02-10 Thread Keith Wright
Is this your first cluster? Have you run older versions of Cassandra? Any specific resource tuning? Thanks all. We are unable to bootstrap nodes and are considering creating a fresh cluster in hopes this is some how data related. On Feb 10, 2014 5:33 PM, "Brust, Corwin [Hollander]" wrote:

Re: ring describe returns only public ips

2014-02-10 Thread Chris Burroughs
More generally, a thrift api or other mechanism for Astyanax to get the INTERNAL_IP seems necessary to use ConnectionPoolType.TOKEN_AWARE + NodeDiscoveryType.TOKEN_AWARE in a multi-dc setup. Absent one I'm confused how that combination is possible. On 02/06/2014 03:17 PM, Ted Pearson wrote:

Using "IN" with the Datastax driver (2.0-??)

2014-02-10 Thread Jacob Rhoden
Hi Guys, Im experimenting with using IN to reduce the number of quires I have to execute. The following works in CQL: i.e select log_entry from log_index where keyword in (‘keyword1’, ‘keyword2’, ‘keyword3’, etc…); So I now want to work out how to convert this: PreparedStateme

Re: Using "IN" with the Datastax driver (2.0-??)

2014-02-10 Thread DuyHai Doan
Hello Jacob, You can try the bind marker for variadic param (new feature): PreparedStatement p = session.prepare("select log_entry from log_index where keyword *IN* ?"); session.execute(p.bind(Arrays.asList("keyword1","keyword2",...)); Regards Duy Hai DOAN On Mon, Feb 10, 2014 at 11:50 PM,

Re: Using "IN" with the Datastax driver (2.0-??)

2014-02-10 Thread Jacob Rhoden
Perfect, thanks! I wonder if this is documented anywhere? Certainly I have no idea how to search google using the keyword “in” :D String[] words = TagsToArray.tagsToArray(keyword.toLowerCase()); PreparedStatement p = api.getCassandraSession().prepare("select log_entry from log_index wher

Re: Using "IN" with the Datastax driver (2.0-??)

2014-02-10 Thread DuyHai Doan
I don't know if it's documented somewhere. Personnally I got the info by following the Cassandra dev blog and reading each new release notes. In each Cassandra release notes, you have a list of bug fixes but also new features. Just read the corresponding JIRA to get the details. Regards Duy H

Clarification on how multi-DC replication works

2014-02-10 Thread Sameer Farooqui
Hi, I was hoping someone could clarify a point about multi-DC replication. Let's say I have 2 data centers configured with replication factor = 3 in each DC. My client app is sitting in DC 1 and is able to intelligently pick a coordinator that will also be a replica partner. So the client app s