Re: nodetool connection refused

2012-09-08 Thread Manu Zhang
7, 2012 at 11:53 PM, Manu Zhang wrote: > >> When I run Cassandra-trunk in Eclipse, nodetool fail to connect with the >> following error >> "Failed to connect to '127.0.0.1:7199': Connection refused" >> But if I run in terminal, all will be fine. >>

Re: nodetool connection refused

2012-09-09 Thread Manu Zhang
It's more like an Eclipse issue now since I find a "0.0.0.0:7199" listener when executing "bin/cassandra" in terminal but none when running Cassandra in Eclipse. On Sun, Sep 9, 2012 at 12:56 PM, Manu Zhang wrote: > No, I don't find a listener whose port i

Re: nodetool connection refused

2012-09-11 Thread Manu Zhang
problems solved. I didn't add the jmx_host and jmx_port to vm_arguments in Eclipse. How come it is not covered in wiki http://wiki.apache.org/cassandra/RunningCassandraInEclipse ? Or is it outdated? On Mon, Sep 10, 2012 at 10:11 AM, Manu Zhang wrote: > It's more like an Eclipse issu

Re: nodetool connection refused

2012-09-12 Thread Manu Zhang
ckle.com > > On 12/09/2012, at 1:14 PM, Manu Zhang wrote: > > problems solved. I didn't add the jmx_host and jmx_port to vm_arguments in > Eclipse. How come it is not covered in wiki > http://wiki.apache.org/cassandra/RunningCassandraInEclipse ? Or is it > outdat

Re: nodetool connection refused

2012-09-14 Thread Manu Zhang
> http://www.thelastpickle.com > > On 13/09/2012, at 12:41 PM, Manu Zhang wrote: > > I'm afraid we have to include all $JVM_OPTS in the cassandra-env.sh ? > > On Thu, Sep 13, 2012 at 5:49 AM, aaron morton wrote: > >> Thanks for updating the Wiki :) >> >

Re:

2012-09-19 Thread Manu Zhang
: > A few questions: what version of 1.1 are you running. What version of > Hadoop? > > What is your job config? What is the buffer size you've chosen? How much > data are you dealing with? > > On Sep 19, 2012, at 7:23 PM, "Manu Zhang" wrote: > > > I've

Re:

2012-09-19 Thread Manu Zhang
Yeah, BulkLoader. You did help me to elaborate my question. Thanks! On Thu, Sep 20, 2012 at 10:58 AM, Michael Kjellman wrote: > I assumed you were talking about BulkLoader. I haven't played with trunk > yet so I'm afraid I won't be much help here... > > On Sep 19, 20

Re:

2012-09-19 Thread Manu Zhang
the problem seems to have gone away with changing Murmur3Partitioner back to RandomPartitioner On Thu, Sep 20, 2012 at 11:14 AM, Manu Zhang wrote: > Yeah, BulkLoader. You did help me to elaborate my question. Thanks! > > > On Thu, Sep 20, 2012 at 10:58 AM, Michael Kjellma

Re:

2012-09-24 Thread Manu Zhang
for the bulk loader the issue got resolved? > > If not then we might have a bug and your feedback might help the community. > > Regards, > > > > > > On Wed, Sep 19, 2012 at 10:41 PM, Manu Zhang wrote: > >> the problem seems to have gone away with changing Murmu

Re:

2012-09-25 Thread Manu Zhang
restart if you leave this enabled. The text of this > setting is not explicit about the long-term cache shrinkage, so it's easy > to think that it will restore the cache to its configured size after the > pressures have subsided. It won't. > > Charles > > On Tue, Sep 25, 201

Re:

2012-09-25 Thread Manu Zhang
The DEFAULT_CACHING_STRATEGY is Caching.KEYS_ONLY but even configuring row cache size to be greater zero won't enable row cache. Why? On Wed, Sep 26, 2012 at 9:44 AM, Manu Zhang wrote: > I wonder now if "get_range_slices" call will ever look for data in row > cache.

Re:

2012-09-26 Thread Manu Zhang
Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 25/09/2012, at 7:59 PM, Manu Zhang wrote: > > Is there an example to update column family adding secondary indices with > thrift api? Here's how I do that now...but what if I have a

Re:

2012-09-26 Thread Manu Zhang
strategy for the CF to be ROWS_ONLY. > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 26/09/2012, at 2:18 PM, Manu Zhang wrote: > > The DEFAULT_CACHING_STRATEGY is Caching.KEYS_ONLY but eve

Re:

2012-09-30 Thread Manu Zhang
lled. > > Cheers > > - > Aaron Morton > Freelance Developer > @aaronmorton > http://www.thelastpickle.com > > On 27/09/2012, at 2:32 PM, Manu Zhang wrote: > > I still don't see it in jconsole. BTW, how long would you expect to cost > to read a c

Re: how to stop hinted handoff

2012-10-09 Thread Manu Zhang
The example used ONE but the hinted hand off is still performed for every 10 minutes On Wed, Oct 10, 2012 at 10:25 AM, CharSyam wrote: > if you don't use Write Level ANY, It will be automactically turned off. > > Just use Write Level One. > > > 2012/10/9 Manu Zhang >

Re: Attempting to load already loaded column family during startup

2012-10-09 Thread Manu Zhang
well, that didn't go away after I remove all the caches. What should I do now? On Wed, Oct 10, 2012 at 2:15 PM, Manu Zhang wrote: > exception encountered during startup: Attempting to load already loaded > column family system_traces.sessionsjava.lang.RuntimeException: Attemptin

Re: Attempting to load already loaded column family during startup

2012-10-10 Thread Manu Zhang
keyspaces; } In 1.2-beta2, system keyspace will be filtered out. I think I'm gonna update my 1.2 node. On Wed, Oct 10, 2012 at 2:18 PM, Manu Zhang wrote: > well, that didn't go away after I remove all the caches. What should I do > now? > > > On Wed, Oct 10, 20

Re: Why my Cassandra is compacting like mad

2012-10-15 Thread Manu Zhang
oh, forgot to mention that my Cassandra is 1.2beta-2 On Mon, Oct 15, 2012 at 11:11 PM, Michael Kjellman wrote: > I had a similar bug with 1.1.5 but I couldn't reproduce it so I didn't > file a bug. I did a rolling restart of my nodes and things went back to > normal. &

Re: Why my Cassandra is compacting like mad

2012-10-15 Thread Manu Zhang
ober 15, 2012 8:11 AM > To: "user@cassandra.apache.org" > Subject: Re: Why my Cassandra is compacting like mad > > I had a similar bug with 1.1.5 but I couldn't reproduce it so I didn't > file a bug. I did a rolling restart of my nodes and things went back to > normal

Re: Why my Cassandra is compacting like mad

2012-10-16 Thread Manu Zhang
e use of CPU and disk IO. It > will write a lot of small SSTables which will need to be compacted very > often. That was one of my thought when I asked. > > Alain > > > 2012/10/15 Manu Zhang > >> I use default option for compaction: >>in_memory_comp

Re: What does ReadRepair exactly do?

2012-10-22 Thread Manu Zhang
Is it through filter.collateColumns(resolved, iters, Integer.MIN_VALUE) and then MergeIterator.get(toCollate, fcomp, reducer) but I don't know what happens hereafter? How is reconcile exactly been called? On Mon, Oct 22, 2012 at 6:49 AM, aaron morton wrote: > There are two processes in cassandra

Re: What does ReadRepair exactly do?

2012-10-23 Thread Manu Zhang
why repair again? We block until the consistency constraint is met. Then the latest version is returned and repair is done asynchronously if any mismatch. We may retry read if fewer columns than required are returned. On Wed, Oct 24, 2012 at 6:10 AM, shankarpnsn wrote: > Hello, > > This conversa

Re: What does ReadRepair exactly do?

2012-10-23 Thread Manu Zhang
I think so. Otherwise, we may never complete a read if writes come in continuously. On Wed, Oct 24, 2012 at 9:04 AM, shankarpnsn wrote: > manuzhang wrote > > why repair again? We block until the consistency constraint is met. Then > > the latest version is returned and repair is done asynchronou

Re: What does ReadRepair exactly do?

2012-10-24 Thread Manu Zhang
And we don't send read request to all of the three replicas (R1, R2, R3) if CL=QUOROM; just 2 of them depending on proximity On Wed, Oct 24, 2012 at 10:20 PM, Hiller, Dean wrote: > The user will meet the required consistency unless you encounter some kind > of bug in cassandra. You will either

Re: What does ReadRepair exactly do?

2012-10-24 Thread Manu Zhang
oh, it would clarity a lot if you go to read the source code; the method is o.a.c.service.StorageProxy.fetchRows if I remember it correctly On Wed, Oct 24, 2012 at 10:26 PM, Manu Zhang wrote: > And we don't send read request to all of the three replicas (R1, R2, R3) > if CL=QUOROM

Re: What does ReadRepair exactly do?

2012-10-25 Thread Manu Zhang
read quorum doesn't mean we read newest values from a quorum number of replicas but to ensure we read at least one newest value as long as write quorum succeeded beforehand and W+R > N. On Fri, Oct 26, 2012 at 12:00 AM, Hiller, Dean wrote: > Kind of an interesting question > > I think you are sa

how to implement a client with off-heap memory

2012-10-28 Thread Manu Zhang
Hi all, I've been writing a client on Cassandra Thrift API. The client will read almost 1G of data into JVM heap and thus its performance suffers from GC operations. To reduce latency, I'm currently thinking about implementing an off-heap memory (just like that of RowCache) to hold data and manage

Re: distribution of token ranges with virtual nodes

2012-11-01 Thread Manu Zhang
> it will migrate you to virtual nodes by splitting the existing partition > 256 ways. Out of curiosity, is it for the purpose of avoiding streaming? the former would require you to perform a shuffle to achieve that. Is there a nodetool option or are there other ways "shuffle" could be done a

Re: distribution of token ranges with virtual nodes

2012-11-01 Thread Manu Zhang
wrote: > On Thu, Nov 1, 2012 at 10:05 PM, Manu Zhang > wrote: > > > >> it will migrate you to virtual nodes by splitting the existing partition > >> 256 ways. > > > > > > Out of curiosity, is it for the purpose of avoiding streaming? > &

Re: cassandra 1.0.10 : Bootstrapping 7 node cluster to 14 nodes

2012-11-02 Thread Manu Zhang
My guess is that 108 has become a new replica for the streamed data on 103, 104, 107, which is decided by your per-keyspace replica placement strategy. When we bootstrap, we do not simply stream data from 102 to 108. Rather, we calculate all the ranges that 108 is responsible for. So looking from t

get_range_slice gets no rowcache support?

2012-11-07 Thread Manu Zhang
I've asked this question before. And after reading the source codes, I find that get_range_slice doesn't query rowcache before reading from Memtable and SSTable. I just want to make sure whether I've overlooked something. If my observation is correct, what's the consideration here?

Re: get_range_slice gets no rowcache support?

2012-11-08 Thread Manu Zhang
first place. Would someone please clarify it for me? Thanks in advance. On Thu, Nov 8, 2012 at 3:23 PM, Manu Zhang wrote: > I've asked this question before. And after reading the source codes, I > find that get_range_slice doesn't query rowcache before reading from > Memtable and

Re: Read during digest mismatch

2012-11-13 Thread Manu Zhang
If consistency is two, don't we just send data request to one and digest request to another? On Mon, Nov 12, 2012 at 2:49 AM, Jonathan Ellis wrote: > Correct. Which is one reason there is a separate setting for > cross-datacenter read repair, by the way. > > On Thu, Nov 8, 2012 at 4:43 PM, san

Re: composite column validation_class question

2012-11-13 Thread Manu Zhang
thrift http://www.datastax.com/dev/blog/introduction-to-composite-columns-part-1 thrift-to-cql3 http://www.datastax.com/dev/blog/thrift-to-cql3 On Wed, Nov 14, 2012 at 1:57 AM, Tyler Hobbs wrote: > I'm not completely sure what the latest Thrift behavior around this is > (it's never been great),

Re: unable to read saved rowcache from disk

2012-11-13 Thread Manu Zhang
o do not save a big > row cache. > > > On Tuesday, November 13, 2012, Manu Zhang wrote: > > I have a rowcache provieded by SerializingCacheProvider. > > The data that has been read into it is about 500MB, as claimed by > jconsole. After saving cache, it is around 15MB on

Re: unable to read saved rowcache from disk

2012-11-13 Thread Manu Zhang
only do a > small number / reads/sec. > > On Tue, Nov 13, 2012 at 9:47 PM, Manu Zhang > wrote: > > "incorrect"... what do you mean? I think it's only 15MB, which is not > big. > > > > > > On Wed, Nov 14, 2012 at 10:38 AM, Edward Capriolo &

Re: unable to read saved rowcache from disk

2012-11-14 Thread Manu Zhang
long time to read back 50K > entries. Unless the source table was very large and you can only do a > small number / reads/sec. > > On Tue, Nov 13, 2012 at 9:47 PM, Manu Zhang > wrote: > > "incorrect"... what do you mean? I think it's only 15MB, which is not

Re: unable to read saved rowcache from disk

2012-11-14 Thread Manu Zhang
OOM at deserializing 747321th row On Thu, Nov 15, 2012 at 9:08 AM, Manu Zhang wrote: > oh, as for the number of rows, it's 165. How long would you expect it > to be read back? > > > On Thu, Nov 15, 2012 at 3:57 AM, Wei Zhu wrote: > >> Good information Edward.

Re: unable to read saved rowcache from disk

2012-11-14 Thread Manu Zhang
saved rowcache from disk > From: Manu Zhang > To: user@cassandra.apache.org > CC: > > > OOM at deserializing 747321th row > > > On Thu, Nov 15, 2012 at 9:08 AM, Manu Zhang wrote: > >> oh, as for the number of rows, it's 165. How long would you expect i

Re: unable to read saved rowcache from disk

2012-11-14 Thread Manu Zhang
- > Subject: Re: unable to read saved rowcache from disk > From: Manu Zhang > To: user@cassandra.apache.org > CC: > > > add a counter and print out myself > > > On Thu, Nov 15, 2012 at 1:51 PM, Wz1975 wrote: > >> Curious where did you see

Re: unable to read saved rowcache from disk

2012-11-16 Thread Manu Zhang
500m, 1.6m rows, each row average > 300B, so 700k row should be a little over 200m, unless it is reading more, > maybe tombstone? Or the rows on disk have grown for some reason, but > row cache was not updated? Could be something else eats up the memory. > You may profi

Re: unable to read saved rowcache from disk

2012-11-18 Thread Manu Zhang
> > If you are using the off heap cache the upper bound is memory. If you are > using the on head it's the JVM heap. But as I said earlier, I could not watch the usage of JVM heap while reading saved caches

Re: get_range_slice gets no rowcache support?

2012-11-18 Thread Manu Zhang
ere are still a few use cases where using the > cache would be beneficial, so I'm sure there's a ticket out there somewhere > that presents a few options for supporting this. > > > On Thu, Nov 8, 2012 at 8:39 PM, Manu Zhang wrote: > >> I did overlook something

Re: CQL3 Compound Primary Keys - Do I have the right idea?

2012-12-24 Thread Manu Zhang
> > CREATE TABLE seen_ships ( >day text, >time_seen timestamp, >shipname text, >PRIMARY KEY (day, time_seen) >); In CQL3, we could select all the columns with the same 'day' and same 'time_seen'. Is it possible with cassandra-cli? On Mon, Dec 24, 2012 at 6:5

Re: Changing rpc_port in cassandra.yaml has no effect

2012-12-26 Thread Manu Zhang
assume you're using a *nix system, the command to check which program is using port 9160: netstat -anp | grep 9160 On Wed, Dec 26, 2012 at 5:56 AM, Bob Futrelle wrote: > I have been using cqlsh (and --cql3) successfully for a few weeks. > But yesterday it stopped working, with the all too famili

Re: num_tokens - virtual nodes

2013-01-04 Thread Manu Zhang
https://www.youtube.com/watch?v=GddZ3pXiDys&list=PLC5E3906433F5A165&index=28 This video of Cassandra summit 2012 has mentioned the use of 256 tokens by default (though it's no longer in the conf/cassandra.yaml). I remember that more tokens could lead to more disk seeks or something. I think 256 is

Re: initial_token configuration

2013-01-09 Thread Manu Zhang
no bother. I've seen the codes. On Wed, Jan 9, 2013 at 11:57 AM, Manu Zhang wrote: > # If blank, Cassandra will request a token bisecting the range of >> # the heaviest-loaded existing node. If there is no load information >> # available, such as is the case with a new c

Re: distribution of token ranges with virtual nodes

2013-01-09 Thread Manu Zhang
Is cassandra-shuffle command in the trunk? Or it is only included in the Debian package? I don't find it in the trunk. On Sat, Nov 3, 2012 at 2:18 AM, Eric Evans wrote: > On Fri, Nov 2, 2012 at 12:38 AM, Manu Zhang > wrote: > >> It splits into a contiguous range, becaus

Re: distribution of token ranges with virtual nodes

2013-01-09 Thread Manu Zhang
sorry, I missed it since it's not executable by default. On Thu, Jan 10, 2013 at 10:05 AM, Jason Wee wrote: > It should be in the trunk, check it > https://github.com/apache/cassandra/blob/trunk/bin/cassandra-shuffle > > > On Thu, Jan 10, 2013 at 1:18 AM, Manu Zhang wrot

Re: Cassandra source code explained

2013-01-22 Thread Manu Zhang
On Wed 23 Jan 2013 01:10:58 AM CST, Radek Gruchalski wrote: Thank you. I found this but was hoping that there's anything broader out there. This will have to be enough. Kind regards,
 Radek Gruchalski radek.gruchal...@technicolor.com | radek.gruchal...@p

why set replica placement strategy at keyspace level ?

2013-01-26 Thread Manu Zhang
Although I've got to know Cassandra for quite a while, this question only has occurred to me recently: Why are the replica placement strategy and replica factors set at the keyspace level? Would setting them at the column family level offers more flexibility? Is this because it's easier for

Re: why set replica placement strategy at keyspace level ?

2013-01-28 Thread Manu Zhang
to mind. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 27/01/2013, at 4:15 PM, Manu Zhang wrote: Although I've got to know Cassandra for quite a while, this question only has occurred to me recently: Why ar

Re: why set replica placement strategy at keyspace level ?

2013-01-29 Thread Manu Zhang
ld be split into 4, which would then be sent to different replicas. Even if they went to the same replicas they would be written to the commit log as different mutations. So if you have RS per CF you lose atomic commits for writes to the same row. Cheers ----- Aaron Morton Freelance

how RandomPartitioner calculate tokens

2013-01-30 Thread Manu Zhang
*codes. By the way, the documentation doesn't mention how Murmur3Partitioner calculate tokens for multiple data center. Assuming it doesn't calculate tokens per data center, what difference between Murmur3Partitioner and RandomPartitioner has made that unnecessary? *Thanks. * *Manu Zhang* *

Re: how RandomPartitioner calculate tokens

2013-01-30 Thread Manu Zhang
r both partitioner is not exactly the same. And as a side note, if you use vnodes, you don't really have to bother about manually assigning tokens for nodes. -- Sylvain On Wed, Jan 30, 2013 at 9:22 AM, Manu Zhang mailto:owenzhang1...@gmail.com>> wrote: Hi, As per the Datastax Ca

Re: Understanding Virtual Nodes on Cassandra 1.2

2013-01-30 Thread Manu Zhang
On Wed 30 Jan 2013 02:29:27 AM CST, Zhong Li wrote: One more question, can I add a virtual node manually without reboot and rebuild a host data? I checked nodetool command, there is no option to add a node. Thanks. Zhong On Jan 29, 2013, at 11:09 AM, Zhong Li wrote: I was misunderstood thi

Re: why set replica placement strategy at keyspace level ?

2013-01-30 Thread Manu Zhang
- Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 30/01/2013, at 12:33 PM, Manu Zhang wrote: On Tue 29 Jan 2013 03:39:17 PM CST, aaron morton wrote: So If I write to CF Users with rowkey="dean" and to CF Schedules w

Re: Understanding Virtual Nodes on Cassandra 1.2

2013-01-31 Thread Manu Zhang
, On Jan 30, 2013, at 10:21 AM, Manu Zhang wrote: On Wed 30 Jan 2013 02:29:27 AM CST, Zhong Li wrote: One more question, can I add a virtual node manually without reboot and rebuild a host data? I checked nodetool command, there is no option to add a node. Thanks. Zhong On Jan 29, 2013, at 11

Re: Multi-range gets/deletes in Thrift/CQL

2013-01-31 Thread Manu Zhang
On Fri 25 Jan 2013 07:42:03 PM CST, Ravikumar Govindarajan wrote: I came across two issue fixes in 1.2 release a) Cassandra-3885 - Specify Multiple Ranges in SliceQueryFilter b) Cassandra-3708 - Specify range during tombstone ops We have composite columns in our app and wish to utilize the abov

Re: Start token sorts after end token

2013-02-02 Thread Manu Zhang
On Fri 01 Feb 2013 06:56:54 PM CST, Jeremy Hanna wrote: See https://issues.apache.org/jira/browse/CASSANDRA-5168 - should be fixed in 1.1.10 and 1.2.2. On Jan 30, 2013, at 9:18 AM, Tejas Patil wrote: While reading data from Cassandra in map-reduce, I am getting "InvalidRequestException(why:

Re: Issues with CQLSH in Cassandra 1.2

2013-02-02 Thread Manu Zhang
On Tue 29 Jan 2013 03:55:52 AM CST, aaron morton wrote: I was able to replicate it… $ bin/nodetool -h 127.0.0.1 -p 7100 describering foo Schema Version:253da4a3-e277-35b5-8d04-dbeeb3c9508e TokenRange: TokenRange(start_token:3074457345618258602, end_token:-9223372036854775808, endpoints

Re: Start token sorts after end token

2013-02-02 Thread Manu Zhang
On Fri 01 Feb 2013 06:56:54 PM CST, Jeremy Hanna wrote: See https://issues.apache.org/jira/browse/CASSANDRA-5168 - should be fixed in 1.1.10 and 1.2.2. On Jan 30, 2013, at 9:18 AM, Tejas Patil wrote: While reading data from Cassandra in map-reduce, I am getting "InvalidRequestException(why:

Re: CQL : Date comparison in where clause fails

2013-02-03 Thread Manu Zhang
On Sun 03 Feb 2013 07:36:58 AM CST, Paul van Hoven wrote: I've got a table that has a column called date. I created an index on the column date with the following command: CREATE INDEX date_key ON ola (date); Now, I can perform the following command: select * from ola where date = '2013-01-01'

Re: Upgrade to Cassandra 1.2

2013-02-03 Thread Manu Zhang
On Sun 03 Feb 2013 05:45:56 AM CST, Daning Wang wrote: I'd like to upgrade from 1.1.6 to 1.2.1, one big feature in 1.2 is that it can have multiple tokens in one node. but there is only one token in 1.1.6. how can I upgrade to 1.2.1 then breaking the token to take advantage of this feature? I we

Re: CQL : Date comparison in where clause fails

2013-02-03 Thread Manu Zhang
um cannot be restricted by more than one relation if it includes an Equal Perhaps you meant to use CQL 2? Try using the -2 option when starting cqlsh. So, this still fails. Therefore I'm not shure whether I missunderstand the issue or if it does not solve my problem. 2013/2/3 Manu Zhang : On

Re: CQL : Request did not complete within rpc_timeout

2013-02-03 Thread Manu Zhang
On Mon 04 Feb 2013 04:42:12 AM CST, Paul van Hoven wrote: Thanks for the answer. Can anybody else answer my other two questions, because my problem is not solved yet? 2013/2/3 Edward Capriolo : This was the issue that prompted the "WITH FILTERING ALLOWED": https://issues.apache.org/jira/browse

Re: Upgrade to Cassandra 1.2

2013-02-12 Thread Manu Zhang
gt;> -dc, --only-dc Apply only to named DC (create only) >>> -tp, --thrift-port Thrift port number (Default: 9160) >>> -p, --port JMX port number (Default: 7199) >>> -tf, --thrift-framed Enable framed transport for Thrift (Default