Java Driver paging slower than manual/token paging?

2015-07-17 Thread Keith Freeman
We've recently started upgrading from 1.2.12 to 2.1.7. In 1.2.12 we wrote code that used the well-known pagination pattern (tokens) to process all rows in one of our tables. For 2.1.7 we tried replacing that code with the new built-in pagination code: List queryRows = new ArrayList<>();

Re: any code to load large data from web into Cassandra

2014-12-27 Thread Keith Sterling
convert csv et into ss tables and then use sstableloader to load direct into Cassandra K --  Keith Sterling Head of Software E: keith.sterl...@first-utility.com P: +44 7771 597 630 W: first-utility.com A: Opus 40 Business Park,  Haywood Road, Warwick CV34 5AH On Sat, Dec 27

Re: any API to load large data from web into Cassandra

2014-12-26 Thread Keith Sterling
Take a look at sstableloader. We use it to load 30+m rows into Cassandra Datastax documentation is a good staty  --  Keith Sterling Head of Software E: keith.sterl...@first-utility.com P: +44 7771 597 630 W: first-utility.com A: Opus 40 Business Park,  Haywood Road, Warwick

Re: Full GC in cassandra

2014-07-28 Thread Keith Wright
What’s your cfhistograms look like? From: Ruchir Jha mailto:ruchir@gmail.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Monday, July 28, 2014 at 10:43 AM To: "user@cassandra.apache.org"

Re: Hot, large row

2014-07-28 Thread Keith Wright
@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Subject: Re: Hot, large row Is it the accumulated tombstones on a row that make it act as if “wide”? Does cfhistograms count the tombstones or subtract them when reporting on cell-count for rows? (I don’t know.) -- Jack Krupansky Fr

Re: Hot, large row

2014-07-25 Thread Keith Wright
Ha, check out who filed that ticket! Yes I’m aware of it. My hope is that it was mostly addressed in CASSANDRA-6563 so I may upgrade from 2.0.6 to 2.0.9. I’m really just surprised that others are not doing similar actions as I and thus experiencing similar issues. To answer DuyHai’s questio

Re: Hot, large row

2014-07-25 Thread Keith Wright
Did you use native paging of Java Driver to avoid loading all columns at a time? And the loading all skus for one user, is it a rare scenario or is it your main use case for this colum family? Le 25 juil. 2014 04:11, "Keith Wright" mailto:kwri...@nanigans.com>> a écrit : One last

Re: Hot, large row

2014-07-24 Thread Keith Wright
age cases and thus having the same issue? I am hoping to upgrade to 2.0.9 which has improvements to remove tombstones. From: Keith Wright mailto:kwri...@nanigans.com>> Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org&

Re: Hot, large row

2014-07-24 Thread Keith Wright
up to 100 ms. If I were you, I'll check for disk I/O first and maybe CPU usage On Thu, Jul 24, 2014 at 10:32 PM, Keith Wright mailto:kwri...@nanigans.com>> wrote: Cfhistograms for the tables I believe are most likely the issue are below on the node that most recently presented the

Re: Hot, large row

2014-07-24 Thread Keith Wright
lication is issuing that may overload the server. I know that's better said than done but I don't have any other idea for the moment Shameless self-promotion -- To support this kind of live prod debugging & investigation that I add a new dynamic query logg

Re: Hot, large row

2014-07-24 Thread Keith Wright
a for the moment Shameless self-promotion -- To support this kind of live prod debugging & investigation that I add a new dynamic query logging feature in Achilles: https://github.com/doanduyhai/Achilles/wiki/Statements-Logging-and-Tracing#dynamic-statements-logging Once you h

Re: Hot, large row

2014-07-24 Thread Keith Wright
"events" & "sessions". It may help or not since the tracing is done once every 1000. "Any way to get the server to blacklist these wide rows automatically?" --> No On Thu, Jul 24, 2014 at 8:48 PM, Keith Wright mailto:kwri...@nanigans.com>> wrote: Hi al

Hot, large row

2014-07-24 Thread Keith Wright
Hi all, We are seeing an issue where basically daily one of our nodes spikes in load and is churning in CMS heap pressure. It appears that reads are backing up and my guess is that our application is reading a large row repeatedly. Our write structure can lead itself to wide rows very infr

Re: Cassandra use cases/Strengths/Weakness

2014-07-14 Thread Keith Freeman
We've struggled getting consistent write latency & linear write scalability with a pretty heavy insert load (1000's of records/second), and our records are about 1k-2k of data (mix of integer/string columns and a blob). Wondering if you have any rough numbers for your "small to medium write si

Re: possible to have TTL on individual collection values?

2014-07-12 Thread Keith Wright
Yes each item in the set can have a different TTL so long as they are upserted with commands having differing TTLs. You should read about how collections/maps work in CQL3 in terms of their CQL2 structure. On Jul 12, 2014 7:32 PM, Kevin Burton wrote: If I have a set.. can I have the individual

Re: Different TTLs for different columns requires two INSERTs?

2014-07-12 Thread Keith Wright
My understanding is that yes you need two inserts which you could do as part of a batch. On Jul 12, 2014 7:24 PM, Kevin Burton wrote: I have a column that I would like to have a different TTL than the whole row. because of the cassandra USING TTL clause I don't think this is possible to do wit

Changing default_time_to_live

2014-05-19 Thread Keith Wright
Hi all, we are using C* 2.0.6 and have set the default_time_to_live parameter on a number of our LCS column families. I was wondering what would happen if we were to decrease this value via a table alter. Would subsequent compactions of data written before that alter honor the new value and re

Re: Tombstones

2014-05-16 Thread Keith Wright
Note that Cassandra will not compact away some tombstones if you have differing column TTLs. See the following jira and resolution I filed for this: https://issues.apache.org/jira/browse/CASSANDRA-6654 On May 16, 2014 4:49 PM, Chris Lohfink wrote: It will delete them after gc_grace_seconds (se

any way to REALLY turn off commitlog?

2014-03-27 Thread Keith Freeman
We're running an insert-heavy use-case and have set "durable_writes = false" for all of our keyspaces. While inserts are coming in (about 2000 1k-records per second), we are still seeing 50Mb written to files in the commitlog directory every 6-10 seconds (using iostat). Anybody know why so mu

Re: How expensive are additional keyspaces?

2014-03-11 Thread Keith Wright
Does this whole true for the native protocol? I’ve noticed that you can create a session object in the datastax driver without specifying a keyspace and so long as you include the keyspace in all queries instead of just table name, it works fine. In that case, I assume there’s only one connect

Re: need help with Cassandra 1.2 Full GCing -- output of jmap histogram

2014-03-10 Thread Keith Wright
I also want to point out an issue I filed that was closed as not an issue: CASSANDRA-6654 Basically if you’re using mixed TTLs on columns in a row, the “oldest” TTL is used to determine if tombstones of other columns can be removed. In other words, if you have a column with a 1 day TTL and a

Compaction does not remove tombstones if column has higher TTL

2014-02-28 Thread Keith Wright
FYI – I recently filed https://issues.apache.org/jira/browse/CASSANDRA-6654 and wanted to let everyone know the result as it was not what I expected. I am using C* 1.2.12 and found that my droppable tombstone ratio kept increasing on an LCS table (currently > .3). Documentation states that co

Re: Intermittent long application pauses on nodes

2014-02-13 Thread Keith Wright
C* to 2.0.x and Java to 1.7 seems to have helped our issues. 2014-02-13 Keith Wright mailto:kwri...@nanigans.com>>: Frank did you ever file a ticket for this issue or find the root cause? I believe we are seeing the same issues when attempting to bootstrap. Thanks From: R

Re: Intermittent long application pauses on nodes

2014-02-12 Thread Keith Wright
Frank did you ever file a ticket for this issue or find the root cause? I believe we are seeing the same issues when attempting to bootstrap. Thanks From: Robert Coli mailto:rc...@eventbrite.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apa

RE: Recommended OS

2014-02-10 Thread Keith Wright
t; wrote: We’re running C* 2.0.5 under CentOS 6.5 and have not noticed anything like you describe. We have just a couple of pre-production rings (Dev and Test) meaning nothing we have has received particularly intense utilization. Corwin From: Keith Wright [mailto:kwri...@nanigans.com] Se

Re: Recommended OS

2014-02-10 Thread Keith Wright
: Don Jackson mailto:djack...@nanigans.com>>, Dave Carroll mailto:dcarr...@nanigans.com>> Subject: Re: Recommended OS What issues are you running into with CentOS 6.4/5? I’m running 1.2.8 on CentOS 6.3 and Java 1.7.0-25, and about to test with 1.7.latest. -- Josh Sholes Fr

Recommended OS

2014-02-10 Thread Keith Wright
Hi all, I was wondering what operating systems and versions people are running with success in production environments? We are using C* 1.2.13 and have had issues using CentOS 6.4/6.5. Are others using that OS? What would people recommend? What about Java 6 vs 7 (specific versions?!)?

Re: Auto-Bootstrap not Auto-Bootstrapping?

2014-02-06 Thread Keith Wright
Is it a seed node? My understanding is that they do not bootstrap On Feb 6, 2014 4:23 PM, Thunder Stumpges wrote: Hi all, We recently needed/wanted to reconfigure the disks for our 3-node C*2.0.4 Cassandra setup and rebuild the server at the same time. Upon adding the newly rebuilt server int

Re: Move to smaller nodes

2014-02-05 Thread Keith Wright
ocker for us. Thanks On Feb 5, 2014 7:04 PM, Robert Coli wrote: On Wed, Feb 5, 2014 at 11:18 AM, Keith Wright mailto:kwri...@nanigans.com>> wrote: Hi Rob, thanks for the response! Interestingly if we run a repair we don’t see the bootstrap issue so I am considering doing the empty

Re: Move to smaller nodes

2014-02-05 Thread Keith Wright
Also there is one more option which is we could upgrade to 2.0 in the hopes that our issue is fixed as part of the streaming overhaul. But seeing as this is a production cluster and 2.0 does not yet appear production ready, that makes me nervous. From: Keith Wright mailto:kwri...@nanigans.com

Re: Move to smaller nodes

2014-02-05 Thread Keith Wright
Hi Rob, thanks for the response! Interestingly if we run a repair we don’t see the bootstrap issue so I am considering doing the empty node repair methodology. Its just that it usually takes a week for that to work. As I see it, we could try the following to fix the bootstrap issue: * Upd

Move to smaller nodes

2014-02-05 Thread Keith Wright
Hi all, Earlier today I emailed about issues we’re having bootstrapping nodes into our existing cluster. One theory we have is that our nodes are simply too large and are considering moving to more, smaller nodes. However, because we cannot bootstrap it makes it difficult. As I see it, w

Re: Bootstrap failure

2014-02-05 Thread Keith Wright
) } Total time for which application threads were stopped: 0.0280980 seconds Application time: 0.5525030 seconds Total time for which application threads were stopped: 0.0547220 seconds Application time: 0.0009410 seconds {Heap before GC invocations=520 (full 3): From: Keith Wright mailto:k

Bootstrap failure

2014-02-05 Thread Keith Wright
Hi all, We have been struggling with the inability to bootstrap nodes into our 1.2.13 environment with Vnodes using centos 6.4 with Java 7. We have an 8 node cluster (32 GB RAM, dual hex core, SSDs, 8 GB heap with 1200 MB eden space, RF3) with around 1 TB per node using murmur3. When we

Re: Automatic tombstone compaction

2014-02-03 Thread Keith Wright
Hi all, I am running C* 1.2.13 with Vnodes at around 1 TB / node. I just noticed that one of my larger LCS CFs (300-400 GB/Node) is showing a droppable tombstone ration of between 23 and 28% on my nodes. I did not indicate a value to be used in my table creation so I assume its using the def

Re: com.datastax.driver.core.exceptions.InvalidTypeException: Invalid type for value 1 of CQL type text, expecting class java.lang.String but class [Ljava.lang.Object; provided

2013-12-07 Thread Keith Wright
As the comment in your code suggests, you need to cast the array passed to the bind method as Object[]. This is true anytime you pass an array to a varargs method. On Dec 7, 2013 4:01 PM, Techy Teck wrote: I am trying to insert into Cassandra database using Datastax Java driver. But everytime

Re: cleanup failure; FileNotFoundException deleting (wrong?) db file

2013-11-06 Thread Keith Freeman
Is it possible that the keyspace was dropped then re-created ( https://issues.apache.org/jira/browse/CASSANDRA-4857)? I've seen similar stack traces in that case. On 11/05/2013 10:47 PM, Elias Ross wrote: I'm seeing the following: Caused by: java.lang.RuntimeException: java.io.FileNotFoundEx

Re: CQL & selecting individual items from a map

2013-10-29 Thread Keith Freeman
There's some rationale here: http://mail-archives.apache.org/mod_mbox/cassandra-user/201305.mbox/%3CCAENxBwx6pcSA=cWn=dkw_52k5odw5f3xigj-zn_4bwfth+4...@mail.gmail.com%3E And I'm sure part of the reason is the 64k size limit: maps (and sets and lists) are limited to 64k total size (http://wiki.

Re: Disappearing index data.

2013-10-07 Thread Keith Freeman
We use Jmxterm: http://wiki.cyclopsgroup.org/jmxterm On 10/07/2013 07:53 AM, Tom van den Berge wrote: Thanks, I'll give that a try. Is there a way to do this without JMX? I wouldn't know now to run a JMX console on my production servers without a graphical interface. On Mon, Oct 7, 2013 at

Re: Unable to bootstrap new node

2013-10-03 Thread Keith Wright
ly-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Date: Wednesday, October 2, 2013 1:55 PM To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Subject: Re: U

Unable to bootstrap new node

2013-10-02 Thread Keith Wright
Hi all, We are running C* 1.2.8 with Vnodes enabled and are attempting to bootstrap a new node and are having issues. When we add the node we see it bootstrap and we see data start to stream over from other nodes however we are seeing one of the other nodes get stuck in full GCs to the poin

BigTable-like Versioned Cells, Importing PostgreSQL Data

2013-09-19 Thread Keith Bogs
struggling because most of the documentation revolves around Java. I'm most comfortable with Ruby and (increasingly) Go. I'd appreciate any insights, would really like to get Cassandra going for real. It's been such a pleasure to setup vs. HBase and whatnot. Keith

commitlog partition

2013-09-16 Thread Keith Freeman
I'm spec'ing out some hardware for a small cassandra cluster. I know the recommendation (v1.2+) on spinning media is to have the commitlog on a separate physical disk from the data, but is it considered ok for performance to put the commitlog on a partition of the OS's disk?

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-13 Thread Keith Freeman
Paul- Sorry to go off-list but I'm diving pretty far into details here. Ignore if you wish. Thanks a lot for the example, definitely very helpful. I'm surprised that the Cassandra experts aren't more interested-in/alarmed-by our results, it seems like we've proved that insert performance fo

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-12 Thread Keith Freeman
ster. However, nothing worked remotely as well as the batch_mutate when it came to writing a full wide-row at once. I think Cassandra 2.0 makes CQL work better for these cases (CASSANDRA-4693), but I haven't tested it yet. -Paul -Original Message- From: Keith Freeman [mailto:8fo...

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-11 Thread Keith Freeman
efault) and see what happens. However, given that there are no entries in 'All time blocked' for such, they may be something else. How are you inserting the data? On Tue, Sep 10, 2013 at 12:40 PM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com&

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-11 Thread Keith Freeman
stions/18522191/using-cassandra-and-cql3-how-do-you-insert-an-entire-wide-row-in-a-single-reque . I was able to solve my issue by switching to using the thrift batch_mutate to write a full wide-row at once instead of using many CQL INSERT statements. -Paul -Original Message----- From: Keith

Re: FileNotFoundException while inserting (1.2.8)

2013-09-11 Thread Keith Freeman
n testing, this is the first time I've been able to follow-up and report it to the mailing list. On 09/11/2013 10:55 AM, Robert Coli wrote: On Wed, Sep 11, 2013 at 6:49 AM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: Yes, I started with a fresh keyspace

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-11 Thread Keith Freeman
On 09/10/2013 11:42 AM, Nate McCall wrote: With SSDs, you can turn up memtable_flush_writers - try 3 initially (1 by default) and see what happens. However, given that there are no entries in 'All time blocked' for such, they may be something else. Tried that, it seems to have reduced the loads

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-11 Thread Keith Freeman
I have RF=2 On 09/10/2013 11:18 AM, Robert Coli wrote: On Tue, Sep 10, 2013 at 10:17 AM, Robert Coli <mailto:rc...@eventbrite.com>> wrote: On Tue, Sep 10, 2013 at 7:55 AM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: On my 3-node cl

Re: FileNotFoundException while inserting (1.2.8)

2013-09-11 Thread Keith Freeman
Yes, I started with a fresh keyspace (dropped and re-created) to run this test. On 09/10/2013 02:01 PM, sankalp kohli wrote: Have you dropped and recreated a keyspace with the same name recently? On Tue, Sep 10, 2013 at 8:40 AM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gma

heavy insert load overloads CPUs, with MutationStage pending

2013-09-10 Thread Keith Freeman
On my 3-node cluster (v1.2.8) with 4-cores each and SSDs for commitlog and data, I get high CPU loads during a heavy-ish wide-row insert load into a single CF (5000 1k inserts/sec), e.g. uptime load avg for last minute 18/11/10. Checking tpstats, I see MutationStage pending on all the nodes, e

Re: heavy insert load overloads CPUs, with MutationStage pending

2013-09-10 Thread Keith Freeman
On 09/10/2013 11:17 AM, Robert Coli wrote: On Tue, Sep 10, 2013 at 7:55 AM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: On my 3-node cluster (v1.2.8) with 4-cores each and SSDs for commitlog and data On SSD, you don't need to separate commit

FileNotFoundException while inserting (1.2.8)

2013-09-10 Thread Keith Freeman
While running a heavy insert load, one of my nodes started throwing this exception when trying a compaction: INFO [CompactionExecutor:23] 2013-09-09 16:08:07,528 CompactionTask.java (line 105) Compacting [SSTableReader(p ath='/var/lib/cassandra/data/smdb/tracedata/smdb-tracedata-ic-6-Data.db')

Re: Does collection in CQL3 have certain limits?

2013-09-02 Thread Keith Wright
I know that the size is limited to max short (~32k) because when deserializing the response from the server, the first item returned is the number of items and its a short. That being said you could likely handle this by looking for the overflow and allowing double max short. Vikas Goyal wrot

Re: insert performance (1.2.8)

2013-08-26 Thread Keith Freeman
experience that shuffling the inserts across rows for wide rows gave us "normal" insert rates. When you mutate a entire wide row at once it hits a bottleneck. On Mon, Aug 26, 2013 at 4:49 PM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: I

Re: how-to scan a table using CQL3

2013-08-26 Thread Keith Wright
New Zealand @aaronmorton http://www.thelastpickle.com On 11/05/2013, at 9:23 AM, Thorsten von Eicken mailto:t...@rightscale.com>> wrote: Thanks, this is interesting, but if I'm not mistaken, Astyanax uses CQL2. I'm trying to find a CQL3 solution on top the binary protocol. There has

Re: insert performance (1.2.8)

2013-08-21 Thread Keith Freeman
tstruction. The increase will be doing a lot more in the same payload message. Otherwise CQL is more efficient. If you do build those giant string, yes you should see a performance improvement. On Tue, Aug 20, 2013 at 8:03 PM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com&

Re: Nodes get stuck

2013-08-20 Thread Keith Wright
nodetool stop cleanup the java task for stopping cleanup itself is at 1500% and has not returned after 2 minutes. This is VERY odd behavior. Any ideas? Hardware failure? Network? We are not seeing anything there but wanted to get ideas. Thanks From: Keith Wright mailto:kwri

Re: insert performance (1.2.8)

2013-08-20 Thread Keith Freeman
ke in your case). Again, apologies, I would not have recommended that route if I knew it was only in 2.0. I would be willing to bet you could hit those insert numbers pretty easily with thrift given the shape of your mutation. On Tue, Aug 20, 2013 at 5:00 PM, Keith Freeman <8fo...@gmail.com

Nodes get stuck

2013-08-20 Thread Keith Wright
Hi all, We are using C* 1.2.4 with Vnodes and SSD. We have seen behavior recently where 3 of our nodes get locked up in high load in what appears to be a GC spiral while the rest of the cluster (7 total nodes) appears fine. When I run a tpstats, I see the following (assuming tpstats retur

Re: insert performance (1.2.8)

2013-08-20 Thread Keith Freeman
at what RF and consistency level are you writing? On Tue, Aug 20, 2013 at 8:56 AM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: Ok, I'll try prepared statements. But while sending my statements async might speed up my client, it wouldn'

Re: insert performance (1.2.8)

2013-08-20 Thread Keith Freeman
ssion.executeAsync coupled with Google Guava Futures API to get better throughput on the client side. On Mon, Aug 19, 2013 at 10:14 PM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: Sure, I've tried different numbers for batches and threads, but gen

Re: insert performance (1.2.8)

2013-08-19 Thread Keith Freeman
bove, not much else to suggest without seeing some example code (on pastebin, gist or similar, ideally). On Mon, Aug 19, 2013 at 5:49 PM, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: I've got a 3-node cassandra cluster (16G/4-core VMs ESXi v5 on 2.5Gh

insert performance (1.2.8)

2013-08-19 Thread Keith Freeman
I've got a 3-node cassandra cluster (16G/4-core VMs ESXi v5 on 2.5Ghz machines not shared with any other VMs). I'm inserting time-series data into a single column-family using "wide rows" (timeuuids) and have a 3-part partition key so my primary key is something like ((a, b, day), in-time-uuid

write load while idle?

2013-08-16 Thread Keith Freeman
I have a 3-node cluster running 1.2.8, and with no clients connected (for about an hour) opscenter is showing a heartbeat-like pattern for total writes in the "Cluster Reads & Writes" panel on the dashboard ranging from about 10/sec to 26/sec. Total reads on the other hand are showing a straig

Re: token(), limit and wide rows

2013-08-16 Thread Keith Freeman
I've run into the same problem, surprised nobody's responded to you. Any time someone asks "how do I page through all the rows of a table in CQL3?", the standard answer is token() and limit. But as you point out, this method will often miss some data from wide rows. Maybe a Cassandra expert

Re: HsHa

2013-08-14 Thread Keith Wright
Same here on 1.2.4. From: Romain HARDOUIN mailto:romain.hardo...@urssaf.fr>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Wednesday, August 14, 2013 3:36 AM To: "user@cassandra.apache.org" mail

Re: Any good GUI based tool to manage data in Casandra?

2013-08-09 Thread Keith Freeman
Sounds like a good tool, but isn't it odd to only have Windows & Mac versions? My impression has been that most users run Cassandra on Linux. Is a Linux version coming (please!)? On 08/09/2013 01:27 PM, Alex Popescu wrote: On Fri, Aug 9, 2013 at 10:12 AM, David McNelis

Re: Unable to bootstrap node

2013-08-06 Thread Keith Wright
eption. Once this is up, again restart the bootstrapping node with bootstrap argument. Now it will successfully stream the data. On Mon, Aug 5, 2013 at 6:08 PM, Keith Wright mailto:kwri...@nanigans.com>> wrote: Yes we likely dropped and recreated tables. If we stop the sending node, wha

CQL3 select between is broken?

2013-08-06 Thread Keith Freeman
I've been looking at examples about modeling series data in Cassandra, and in one experiment created a table like this: create table vvv (k text, t bigint, value text, primary key (k, t)); After inserting some data with identical k values and differing t values, I tried this query (which is nea

Re: clarification of token() in CQL3

2013-08-06 Thread Keith Freeman
keys might fall sequentially after token(last_processed_row)) On 08/06/2013 08:18 AM, Richard Low wrote: On 6 August 2013 15:12, Keith Freeman <8fo...@gmail.com <mailto:8fo...@gmail.com>> wrote: I've seen in several places the advice to use queries like to this

clarification of token() in CQL3

2013-08-06 Thread Keith Freeman
I've seen in several places the advice to use queries like to this page through lots of rows: select id from mytable where token(id) > token(last_id) But it's hard to find detailed information about how this works (at least that I can understand -- the description in the Cassandra manual is p

Re: Unable to bootstrap node

2013-08-05 Thread Keith Wright
problem. Are you seeing lot of FileNotFoundExceptions? Did you do any schema change recently? Sankalp On Mon, Aug 5, 2013 at 5:39 PM, Keith Wright mailto:kwri...@nanigans.com>> wrote: Hi all, I have been trying to bootstrap a new node into my 7 node 1.2.4 C* cluster with Vnodes RF3 w

Unable to bootstrap node

2013-08-05 Thread Keith Wright
Hi all, I have been trying to bootstrap a new node into my 7 node 1.2.4 C* cluster with Vnodes RF3 with no luck. It gets close to completing and then the streaming just stalls with streaming at 99% from 1 or 2 nodes. Nodetool netstats shows the items that have yet to stream but the logs o

Re: CQL and undefined columns

2013-08-05 Thread Keith Freeman
From the Cassandra 1.2 Manual: Using the compact storage directive prevents you from adding more than one column that is not part of the PRIMARY KEY. At this time, updates to data in a table created with compact storage are not allowed. The table with compact storage that uses a compound prim

Re: org.apache.cassandra.io.sstable.CorruptSSTableException

2013-08-05 Thread Keith Wright
This type of corruption usually happens because of a failed or failing disk/memory. On Mon, Aug 5, 2013 at 8:44 AM, Jason Wee mailto:peich...@gmail.com>> wrote: you can try nodetool scrub. if it does not work, try repair then cleanup. had this issue a few weeks back but our version is

Re: org.apache.cassandra.io.sstable.CorruptSSTableException

2013-08-04 Thread Keith Wright
Re-sending hoping to get some help. Any ideas would be much appreciated! From: Keith Wright mailto:kwri...@nanigans.com>> Date: Friday, August 2, 2013 3:01 PM To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandr

Re: Choosing Java Driver for using Cassandra with Java EE7

2013-08-03 Thread Keith Wright
I definitely recommend the datastax driver so long as you plan on using cql3. Its binary protocol allows multiplexing, is vnode token aware, and does not require serialize/deserialize to thrift. We used astyanax before and it did work well and has support (aka recipes) for standard usage cases

org.apache.cassandra.io.sstable.CorruptSSTableException

2013-08-02 Thread Keith Wright
Hi all, We just added a node to our cluster (1.2.4 Vnodes) and they appear to be running well exception I see that the new node is not making any progress compacting one of the CF. The exception below is generated. My assumption is that the only way to handle this is to stop the node, dele

Re: key cache hit rate and BF false positive

2013-07-31 Thread Keith Wright
uld check nodetool compactionstats to see if compaction can keep up. >You've changed the sstable_size_mb i'd also wait to see what it looks >like when that change has gone through. I'd like to see if the sstable >count goes down. > >Hope that helps. > >---

Re: sstable size change

2013-07-31 Thread Keith Wright
Created https://issues.apache.org/jira/browse/CASSANDRA-5834 From: aaron morton mailto:aa...@thelastpickle.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Wednesday, July 31, 2013 5:11 AM To: "user@cassandra.apache.org

Re: key cache hit rate and BF false positive

2013-07-29 Thread Keith Wright
Re-sending hoping to get some assistance. Thanks! From: Keith Wright mailto:kwri...@nanigans.com>> Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Date: Friday, July 26, 2013 11:25 PM To: "user

Re: sstable size change

2013-07-29 Thread Keith Wright
ribe the version and query you first used. Thanks for taking the time to update the thread. Cheers - Aaron Morton Cassandra Consultant New Zealand @aaronmorton http://www.thelastpickle.com On 27/07/2013, at 12:57 AM, Keith Wright mailto:kwri...@nanigans.com>> wrote: FYI. It

Re: key cache hit rate and BF false positive

2013-07-26 Thread Keith Wright
0 0 0 0 0 20924300 0 0 2 0 0 25109160 0 0 1 0 0 From: Keith Wright mailto:kwri

key cache hit rate and BF false positive

2013-07-26 Thread Keith Wright
Hi all, I am experiencing VERY poor key cache hit rate on my 6 node C* 1.2.4 with Vnode cluster. I am using CQL3 with LCS and yesterday increased my SSTable size from default 5 MB to 256 MB although I did not force a major compaction and am instead letting the new size take effect organica

Re: sstable size change

2013-07-26 Thread Keith Wright
hrown an error. I assume I should open a bug for this? alter table shard_user_lookup with compaction_strategy_options = {'sstable_size_in_mb':256}; From: Keith Wright mailto:kwri...@nanigans.com>> Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>

Re: sstable size change

2013-07-25 Thread Keith Wright
Zhu mailto:wz1...@yahoo.com>> Date: Wednesday, July 24, 2013 8:49 PM To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Subject: Re: sstable size change what is output of show keyspaces from cassandra-cli, did you see the new value? Com

Re: sstable size change

2013-07-24 Thread Keith Wright
ng as compactions are naturally triggered" Yes. Let it compact and increase in size. On Tue, Jul 23, 2013 at 9:38 AM, Robert Coli mailto:rc...@eventbrite.com>> wrote: On Tue, Jul 23, 2013 at 6:48 AM, Keith Wright mailto:kwri...@nanigans.com>> wrote: Can you elaborate on what you

Re: sstable size change

2013-07-23 Thread Keith Wright
es a level might work? Since they would overflow the new level, they would compact soon, but the impact might be less drastic than just deleting the .json file (which takes everything to L0)... /Janne On 22 Jul 2013, at 16:02, Keith Wright mailto:kwri...@nanigans.com>> wrote: Hi all,

sstable size change

2013-07-22 Thread Keith Wright
Hi all, I know there has been several threads recently on this but I wanted to make sure I got a clear answer: we are looking to increase our SSTable size for a couple of our LCS tables as well as chunk size (to match the SSD block size). The largest table is at 500 GB across 6 nodes (RF

Re: Bulk loader with Cassandra 1.2.5

2013-06-07 Thread Keith Wright
Looking into it further, I believe your issue is that you did not define the table with compact storage. Without that, CQL3 will treat every column as a composite (as is hinted in your stack trace where you see AbstractCompositeType is the cause of the error). Try changing your table definitio

Re: Bulk loader with Cassandra 1.2.5

2013-06-06 Thread Keith Wright
Could it be because you are writing age as a long but have it defined as an integer in the table definition? Davide Anastasia wrote: Hi, I am currently using Cassandra 1.2.5 on RHEL6 with Oracle JVM. I want to try to build a bulk loader for stock prices that I have available in CSV format. H

Re: Bulk loading into CQL3 Composite Columns

2013-05-30 Thread Keith Wright
ache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Subject: Re: Bulk loading into CQL3 Composite Columns Hi Keith... Thanks for the help. I'm presently not importing the Hector library (Which is where classes like CompositeSerializer and Stri

Re: Bulk loading into CQL3 Composite Columns

2013-05-30 Thread Keith Wright
.addColumn( CompositeSerializer.get().toByteBuffer(columnComposite), null, System.currentTimeMillis() ); From: Keith Wright mailto:kwri...@nanigans.com>> Date: Thursday, May 30, 2013 3:32 PM To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@ca

Re: Bulk loading into CQL3 Composite Columns

2013-05-30 Thread Keith Wright
You do not want to repeat the first item of your primary key again. If you recall, in CQL3 a primary key as defined below indicates that the row key is the first item (key) and then the column names are composites of val1,val2. Although I don't see why you need val2 as part of the primary key

write time of CQL3 set items

2013-05-23 Thread Keith Wright
Hi all, I am using C* 1.2.4 with CQL3 and am taking advantage of the new collection support. One usage case I have is that I want a set of text and I need to know the time when each item in the set was written. If I understand CQL3 correctly, the underlying data engine utilizes composites

Re: SSTable size versus read performance

2013-05-16 Thread Keith Wright
Are your new fancy SSD drives using large sectors? If your read-ahead is really reading 512 x 4KB per random IO, then that 2 MB per read seems like a lot of extra overhead. -Bryan On Thu, May 16, 2013 at 12:35 PM, Keith Wright mailto:kwri...@nanigans.com>> wrote: We act

Re: SSTable size versus read performance

2013-05-16 Thread Keith Wright
PM To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" mailto:user@cassandra.apache.org>> Subject: Re: SSTable size versus read performance My 5 cents: I'd check blockdev --getra for data drives - too high values for readahead (default to 256 for debian) can

Re: SSTable size versus read performance

2013-05-16 Thread Keith Wright
; mailto:user@cassandra.apache.org>> Subject: Re: SSTable size versus read performance With you use compression you should play with your block size. I believe the default may be 32K but I had more success with 8K, nearly same compression ratio, less young gen memory pressure. On T

  1   2   >