iteration does not yield all data with consistency ONE

2010-11-10 Thread Eric van Orsouw
Hello, We have a cluster of 4 nodes (0.6.6) and use the random partitioner and a replication of 2. When I insert a number of rows I can always retrieve them by their explicit id (get_range_slices("","", 1). Playing with consistency levels and temporarily shutting down a Cassandra node all yield

Re: WordCount example problem

2010-11-10 Thread Patrik Modesto
Hi, I'm trying the WordCount example and getting this error: [12:33]$ ./bin/word_count 10/11/10 12:34:35 INFO WordCount: output reducer type: filesystem 10/11/10 12:34:36 INFO jvm.JvmMetrics: Initializing JVM Metrics with processName=JobTracker, sessionId= 10/11/10 12:34:36 INFO WordCount: XXX:te

Re: Data management on a ring

2010-11-10 Thread aaron morton
If I understand your correctly, you just want to add 8 nodes to a ring that already has 2 ? You could add the nodes and manually assign them tokens following the guidelines here http://wiki.apache.org/cassandra/Operations I'm not sure how to ensure the minimum amount of data transfer though. A

Re: Data management on a ring

2010-11-10 Thread Jean-Yves LEBLEU
Thanks for the anwser. It was not exactly my point, I would like to know if in a 10 nodes rings if it is possible to restrict replication of some data to only 2 nodes, and other data to all nodes ? Regards. Jean-Yves On Wed, Nov 10, 2010 at 11:17 AM, aaron morton wrote: > If I understand your co

about key sorting and token partitioning

2010-11-10 Thread zangds
Hi, I am using cassandra to store a message steam, and want to use timestamps (like mmddhhMIss or something alike) as the keys. So if I use RandomPartitioner, I will loose the order when using get_range_slices(). If I use OrderPreservingPartitioner, how should I configure cassandra to make l

Re: about key sorting and token partitioning

2010-11-10 Thread Peter Schuller
> I am using cassandra to store a message steam, and want to use timestamps > (like mmddhhMIss or something alike) as the keys. > So if I use RandomPartitioner, I will loose the order when using > get_range_slices(). > If I use OrderPreservingPartitioner, how should I configure cassandra to > m

Re: Data management on a ring

2010-11-10 Thread Jonathan Ellis
Yes, on a per-keyspace basis with NetworkTopologyStrategy (in 0.7). On Wed, Nov 10, 2010 at 4:40 AM, Jean-Yves LEBLEU wrote: > Thanks for the anwser. > > It was not exactly my point, I would like to know if in a 10 nodes rings if > it is possible to restrict replication of some data to only 2 nod

Re: WordCount example problem

2010-11-10 Thread Jonathan Ellis
http://www.mail-archive.com/user@cassandra.apache.org/msg07093.html On Wed, Nov 10, 2010 at 5:47 AM, Patrik Modesto wrote: > Hi, > > I'm trying the WordCount example and getting this error: > > [12:33]$ ./bin/word_count > 10/11/10 12:34:35 INFO WordCount: output reducer type: filesystem > 10/11/1

Re: iteration does not yield all data with consistency ONE

2010-11-10 Thread Jonathan Ellis
Was the node that should have the other replica of this row down when it was inserted? On Wed, Nov 10, 2010 at 6:08 AM, Eric van Orsouw wrote: > > Hello, > > > > We have a cluster of 4 nodes (0.6.6) and use the random partitioner and a > replication of 2. > > When I insert a number of rows I can

Question about consitency level & data propagation & eventually consistent

2010-11-10 Thread Thibaut Britz
Hi, Assuming I'm reading and writing with consitency level 1 (one), read repair turned off, I have a few questions about data propagation. Data is being stored at consistency level 3. I'm not interested in the deletes. I can live with older data (or data that has been deleted and will reappear),

RE: iteration does not yield all data with consistency ONE

2010-11-10 Thread Eric van Orsouw
No, all nodes were up and running while the single key was inserted. The insert however was with consistency One. I assume however that the replicas are still written in this case. It is btw also very reproducible. -Original Message- From: Jonathan Ellis [mailto:jbel...@gmail.com] Sent:

Range queries using token instead of key

2010-11-10 Thread Anand Somani
Hi, I am trying to iterate over the entire dataset to calculate some information. Now the way I am trying to do this is by going directly to the node that has a data range, so here is the route I am following - get TokenRange using - describe_ring - then for each tokenRange pick a node and

Re: Question about consitency level & data propagation & eventually consistent

2010-11-10 Thread Peter Schuller
> 1) If all nodes are up: >  - Will all writes eventually reach all nodes (of the 3 nodes)? I believe that if read repair is completely off, then for data that was written that did *not* get saved by hinted hand-off, would not propagate until anti-entropy as part of a 'nodetool repair' or perhaps

RE: WordCount example problem

2010-11-10 Thread Aditya Muralidharan
Also, your Mapper class needs to look like this: MyMapper extends Mapper,Text,SumWritable> ... with all the necessary fixes to the map method. AD -Original Message- From: Jonathan Ellis [mailto:jbel...@gmail.com] Sent: Wednesday, November 10, 2010 8:40 AM To: user Subject: Re: WordCount

Re: iteration does not yield all data with consistency ONE

2010-11-10 Thread Jonathan Ellis
Interesting. Does it simplify further to RF=1 and 2 nodes? On Wed, Nov 10, 2010 at 8:58 AM, Eric van Orsouw wrote: > No, all nodes were up and running while the single key was inserted. > The insert however was with consistency One. I assume however that the > replicas are still written in this

Re: Question about consitency level & data propagation & eventually consistent

2010-11-10 Thread Jonathan Ellis
On Wed, Nov 10, 2010 at 8:54 AM, Thibaut Britz wrote: > Assuming I'm reading and writing with consitency level 1 (one), read repair > turned off, I have a few questions about data propagation. > Data is being stored at consistency level 3. > 1) If all nodes are up: >  - Will all writes eventually

Re: MapReduce/Hadoop in cassandra 0.7 beta3

2010-11-10 Thread Jeremy Hanna
Aditya, Can you reproduce the problem locally with "pig -x local myscript.pig"? Also, moving this message back to the cassandra user list. On Nov 10, 2010, at 10:47 AM, Aditya Muralidharan wrote: > Hi, > > I'm still getting the error associated with > https://issues.apache.org/jira/browse/CAS

Re: Cassandra 0.7 bootstrap exception on windows

2010-11-10 Thread Jeremy Hanna
moving this to the cassandra user list. On Nov 10, 2010, at 11:05 AM, Aditya Muralidharan wrote: > Hi, > > I'm building (on windows) a release tar from the HEAD of the Cassandra 0.7 > branch. Running a new single node instance of Cassandra gives me the > following bootstrap exception: > INFO 1

Re: MapReduce/Hadoop in cassandra 0.7 beta3

2010-11-10 Thread Stu Hood
Hey Aditya, Would you mind attaching that last hundred few lines from before the exception from the server log to this ticket: https://issues.apache.org/jira/browse/CASSANDRA-1724 ? Thanks, Stu -Original Message- From: "Jeremy Hanna" Sent: Wednesday, November 10, 2010 11:40am To: user

encoding of values in cassandra

2010-11-10 Thread Koert Kuipers
Cassandra keys and values are just bytes. My values range from simple doubles to complex objects so I need to serialize them with something like avro, thrift or protobuf. Since I am working in a test environment and casssandra is moving to avro I decided to use the avro protocol to communicate

RE: MapReduce/Hadoop in cassandra 0.7 beta3

2010-11-10 Thread Aditya Muralidharan
My bad. Moved to Cassandra user list. -Original Message- From: Aditya Muralidharan [mailto:aditya.muralidha...@nisc.coop] Sent: Wednesday, November 10, 2010 10:48 AM To: u...@pig.apache.org Subject: RE: MapReduce/Hadoop in cassandra 0.7 beta3 Hi, I'm still getting the error associated w

Re: encoding of values in cassandra

2010-11-10 Thread Jonathan Ellis
We are moving towards treating Thrift more as a driver than as a format itself, and using libraries like Hector, pycassa, and phpcassa from the client. On Wed, Nov 10, 2010 at 1:03 PM, Koert Kuipers wrote: > Cassandra keys and values are just bytes. My values range from simple > doubles to comple

multiple datacenter with low replication factor - idea for greater flexibility

2010-11-10 Thread Wayne Lewis
Hello, We've had Cassandra running in a single production data center now for several months and have started detailed plans to add data center fault tolerance. Our requirements do not appear to be solved out-of-the-box with Cassandra. I'd like to share a solution we're planning and find others

[RELEASE] 0.6.7

2010-11-10 Thread Eric Evans
It's been about a month since our last stable update and we've accumulated a few changes[1] worth having, so I'm pleased to announce the release of 0.6.7. If you're coming from a version older than 0.6.6 then please be sure to read the release notes[2]; upgrades from 0.6.6. should be completely s

Re: Range queries using token instead of key

2010-11-10 Thread Edward Capriolo
On Wed, Nov 10, 2010 at 10:05 AM, Anand Somani wrote: > Hi, > > I am trying to iterate over the entire dataset to calculate some > information. Now the way I am trying to do this is by going directly to the > node that has a data range, so here is the route I am following > > get TokenRange using

CF Stats in 0.7beta3

2010-11-10 Thread Rock, Paul
Afternoon all - I'm playing with 0.7beta3 on some boxes I have here at the office and while checking out the stats from one of my tests I'm seeing Write Latency being reported as "0.009 ms". I haven't done any timing yet in my client, but is this really microsecond latency, or is there a mismatc

Re: CF Stats in 0.7beta3

2010-11-10 Thread Ryan King
Yeah, that's really microsecond latency. Note, though that this isn't the full request timing, its just the storage proxy down, so it doesn't account for any latency added by thrift or the network. -ryan On Wed, Nov 10, 2010 at 1:43 PM, Rock, Paul wrote: > Afternoon all - I'm playing with 0.7bet

Non-Unique Indexes, How ?

2010-11-10 Thread J T
Hi, I'm trying to work out a way to support a non-unique index. For example, lets say I have a contact list, where its possible to have Names that are the same but are for different people and so should have different contact entries but I'd want to be able to search on their full name and get a

rename column family with cassandra-cli in 0.7.0-beta3

2010-11-10 Thread gbanks

Re: Non-Unique Indexes, How ?

2010-11-10 Thread Jonathan Ellis
On Wed, Nov 10, 2010 at 5:55 PM, J T wrote: > CF: Contacts (with an index on 'fullname') > key : id1 { fullname : "John Brown", address : "London" } > key : id2 { fullname : "John Brown", address : "Paris"    } > Would the 0.7 index on fullname allow me to lookup the 2 entries if I > searched on "

Re: Non-Unique Indexes, How ?

2010-11-10 Thread J T
Ok, so non-unique indexes are supported, but only full equality matches on the values are supported right now. Will it in the future allow for partial/range matches ? e.g. Find all contacts with a J as the first letter ? Jason On Thu, Nov 11, 2010 at 12:13 AM, Jonathan Ellis wrote: > On Wed,

Re: rename column family with cassandra-cli in 0.7.0-beta3

2010-11-10 Thread Jonathan Ellis
https://issues.apache.org/jira/browse/CASSANDRA-1630 On Wed, Nov 10, 2010 at 6:09 PM, gbanks wrote: > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com

Re: Non-Unique Indexes, How ?

2010-11-10 Thread Jonathan Ellis
Yes. On Wed, Nov 10, 2010 at 6:39 PM, J T wrote: > Ok, so non-unique indexes are supported, but only full equality matches on > the values are supported right now. > Will it in the future allow for partial/range matches ? > > e.g. Find all contacts with a J as the first letter ? > Jason > On Thu,

Unsubscribe

2010-11-10 Thread Vibhaw P Rajan
Warm regards, Vibhaw Rajan Application Developer-Mainframes IBM India Pvt. Ltd. DLF IT Park, Chennai, India Office +91 44 22723552 Mobile +91 996 253 3029 Email vibra...@in.ibm.com "Success is not final, failure is not fatal: it is the courage to continue that counts"

Re: WordCount example problem

2010-11-10 Thread Patrik Modesto
Thanks, I'll do. P. On Wed, Nov 10, 2010 at 16:28, Aditya Muralidharan wrote: > Also, your Mapper class needs to look like this: > MyMapper extends Mapper IColumn>,Text,SumWritable> ... with all the necessary fixes to the map method. > > AD > > -Original Message- > From: Jonathan Ellis [

Re: WordCount example problem

2010-11-10 Thread Patrik Modesto
That's exactly what's happening to me. I wonder why Google did't find it. Thanks! P. On Wed, Nov 10, 2010 at 15:39, Jonathan Ellis wrote: > http://www.mail-archive.com/user@cassandra.apache.org/msg07093.html > > On Wed, Nov 10, 2010 at 5:47 AM, Patrik Modesto > wrote: >> Hi, >> >> I'm trying t