Re: Wide Rows - Data Model Design

2014-09-19 Thread Jonathan Lacefield
Hello, Yes, this is a wide row table design. The first col is your Partition Key. The remaining 2 cols are clustering cols. You will receive ordered result sets based on client_name, record_date when running that query. Jonathan [image: datastax_logo.png] Jonathan Lacefield Solution

Re: Load balancing in C* Cluster

2014-10-28 Thread Jonathan Lacefield
2.1/common/drivers/introduction/introArchOverview_c.html [2] https://github.com/Netflix/astyanax Thanks, Jonathan [image: datastax_logo.png] Jonathan Lacefield Solution Architect | (404) 822 3487 | jlacefi...@datastax.com [image: linkedin.png] <http://www.linkedin.com/in/jlacefield/> [ima

Re: Better option to load data to cassandra

2014-11-13 Thread Jonathan Lacefield
Here's another post which is pretty comprehensive for this topic. http://informationsurvival.blogspot.com/2014/02/cassandra-cql3-integration.html [image: datastax_logo.png] Jonathan Lacefield Solution Architect | (404) 822 3487 | jlacefi...@datastax.com [image: linkedin.png]

Re: 100% CPU utilization, ParNew and never completing compactions

2014-12-16 Thread Jonathan Lacefield
hy do you think you have a lot of tombstones in that one particular table? [1] https://issues.apache.org/jira/browse/CASSANDRA-8447 [2] https://issues.apache.org/jira/browse/CASSANDRA-8485 Jonathan [image: datastax_logo.png] Jonathan Lacefield Solution Architect | (404) 822 3487 | jl

Re: High read latency after data volume increased

2015-01-12 Thread Jonathan Lacefield
There's likely 2 things occurring 1) the cfhistograms error is due to https://issues.apache.org/jira/browse/CASSANDRA-8028 Which is resolved in 2.1.3. Looks like voting is under way for 2.1.3. As rcoli mentioned, you are running the latest open source of C* which should be treated as beta until a

Re: Dynamic Columns

2015-01-20 Thread Jonathan Lacefield
Hello, Have you looked at solving this challenge with clustering columns? Also, please describe the problem set details for more specific advice from this group. Starting new projects on Thrift isn't the recommended approach. Jonathan [image: datastax_logo.png] Jonathan Lace

Re: Dynamic Columns

2015-01-20 Thread Jonathan Lacefield
to look in the archives for a similar discussion topic. Believe this item was asked a few months ago as well. [image: datastax_logo.png] Jonathan Lacefield Solution Architect | (404) 822 3487 | jlacefi...@datastax.com [image: linkedin.png] <http://www.linkedin.com/in/jlacefield/> [ima

Re: Re: Dynamic Columns

2015-01-21 Thread Jonathan Lacefield
problem. Or, rethink the design approach and leverage CQL. Please let the mailing list know the direction you choose. Jonathan [image: datastax_logo.png] Jonathan Lacefield Solution Architect | (404) 822 3487 | jlacefi...@datastax.com [image: linkedin.png] <http://www.linkedin.com

Re: Adding new node to cluster

2015-02-18 Thread Jonathan Lacefield
: datastax_logo.png] Jonathan Lacefield Solution Architect | (404) 822 3487 | jlacefi...@datastax.com [image: linkedin.png] <http://www.linkedin.com/in/jlacefield/> [image: facebook.png] <https://www.facebook.com/datastax> [image: twitter.png] <https://twitter.com/datastax> [ima

Re: Way to Cassandra File System

2015-03-24 Thread Jonathan Lacefield
e. Hope this provides clarity for you. Thanks, Jonathan [image: datastax_logo.png] Jonathan Lacefield Director - Consulting, Americas | (404) 822 3487 | jlacefi...@datastax.com <https://twitter.com/datastax> <https://github.com/datastax/> [image: linkedin.png] <http://www.l

Re: Tracking word frequencies

2014-01-17 Thread Jonathan Lacefield
query reads with Cassanda. This can be done through tuning, data modeling, and/or scaling. Please post a cfhistograms for this table as well as provide some details on the specific queries you are running. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487

Re: Help me on Cassandra Data Modelling

2014-01-27 Thread Jonathan Lacefield
plicate data as you will store 1 tag combination in every Tag partition, i.e. if a tag combination has 2 parts, then you will have 2 rows Hope this helps. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/w

Re: GC eden filled instantly (any size). Dropping messages.

2014-01-27 Thread Jonathan Lacefield
ssues are the result of the Data Model or access patterns based on the results of CFHistograms 3) What does tpstats look like? Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/pro

Re: Worse perf after Row Caching version 1.2.5:

2014-02-11 Thread Jonathan Lacefield
Hello, Please paste the output of cfhistograms for these tables. Also, what does your environment look like, number of nodes, disk drive configs, memory, C* version, etc. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefi

Re: Cassandra Version History

2014-02-24 Thread Jonathan Lacefield
Hello, Check out the full version list here: https://issues.apache.org/jira/browse/CASSANDRA?selectedTab=com.atlassian.jira.plugin.system.project:versions-panel&subset=-1 Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield&g

Re: Reverse a delete operation

2014-02-25 Thread Jonathan Lacefield
A couple of options: 1) Do you have a snapshot? If so, you could recover form the snapshot? 2) Look in the stable via SSTable2JSON and find the pre-deleted row state then reinsert it. There are probably other options as well. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487

Re: Getting the most-recent version from time-series data

2014-02-25 Thread Jonathan Lacefield
Clint One approach would be to create a copy of this table and switch the clustering columns around so version precedes family. This way you could easily grab the 1st, 2nd, N version rows. Would this help you in your situation? Jonathan > On Feb 25, 2014, at 7:49 PM, Clint Kelly wrote: > >

Re: Cassandra nodetool status result after restoring snapshot

2014-02-26 Thread Jonathan Lacefield
e you using Opscenter? A quick comparison of cfstats between clusters may help you analyze your situation and help you pinpoint if you are missing any data for a particular keyspace, etc as well. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.link

Re: Backup/Restore in Cassandra

2014-03-07 Thread Jonathan Lacefield
Here's the 2.0 documentation for backup and restore - http://www.datastax.com/documentation/cassandra/2.0/cassandra/operations/ops_backup_restore_c.html Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield>

Re: replication_factor: ?

2014-03-07 Thread Jonathan Lacefield
had a 3 node cluster with an RF 2, then 2 copies of data would exist on each node., i.e. you would have 2 copies of data in your cluster. Hope that helps. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com

Re: read one -- internal behavior

2014-03-07 Thread Jonathan Lacefield
B is the answer > On Mar 7, 2014, at 7:35 PM, James Lyons wrote: > > I'm wondering about the following scenario. > > Consider a cluster of nodes with replication say 3. > When performing a read at "read one" consistency and lets say my client isn't > smart enough to route the request to the Cass

Re: read one -- internal behavior

2014-03-07 Thread Jonathan Lacefield
ng message to [ip address]' for each of the replicas. On Fri, Mar 7, 2014 at 5:44 PM, Jonathan Lacefield wrote: > B is the answer > > > On Mar 7, 2014, at 7:35 PM, James Lyons wrote: > > > > I'm wondering about the following scenario. > > > > Conside

Re: need help with Cassandra 1.2 Full GCing -- output of jmap histogram

2014-03-10 Thread Jonathan Lacefield
o, you may be running into this with the older version of Cassandra: https://issues.apache.org/jira/browse/CASSANDRA-6541 Hope this helps. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/

Re: DSE Hadoop support for provisioning hardware

2014-03-11 Thread Jonathan Lacefield
Hello, Not sure this question is appropriate for the Open Source C* users group. If you would like, please email me directly to discuss DataStax specific items. Thanks, Jonathan jlacefi...@datastax.om Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.

Re: Relation between Atomic Batches and Consistency Level

2014-03-17 Thread Jonathan Lacefield
ading records contained in the batch, you will read results of partially updated batches. Taking this into account for your second question, you should expect that your read CL will preform as it would for any individual row mutation. Hope this helps. Jonathan Jonathan Lacefield Solutions Arch

Re: Multi-site Active-Active replication - Preparing Sites - Cluster Name and Snitch

2014-03-17 Thread Jonathan Lacefield
actually. 4) Yes, all Keyspaces should be updated as a part of this process. Hope that helps. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/products-services/training/virtual-training&

Re: Relation between Atomic Batches and Consistency Level

2014-03-17 Thread Jonathan Lacefield
lient fails, etc. Hope that helps. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/products-services/training/virtual-training> On Mon, Mar 17, 2014 at 1:38 PM, D

Re: Relation between Atomic Batches and Consistency Level

2014-03-18 Thread Jonathan Lacefield
e batchlog table for execution Hope this helps. I believe this is the best i can do for you at the moment. Thanks, Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/products-services/training/vi

Re: How to extract information from commit log?

2014-03-18 Thread Jonathan Lacefield
cular versions of partitions. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/products-services/training/virtual-training> On Tue, Mar 18, 2014 at 2:25 PM, Han,Meng wrote

Re: No output.log is ever generated

2014-03-24 Thread Jonathan Lacefield
. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/products-services/training/virtual-training> On Mon, Mar 24, 2014 at 8:26 AM, user 01 wrote: > Hints please, anyone ? >

Re: need help with Cassandra 1.2 Full GCing -- output of jmap histogram

2014-03-25 Thread Jonathan Lacefield
Sorry to hear about the frustration. How often are you deleting data/what are you setting for ttl on cols? Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/products-services/training/virtual

Re: Question about how compaction and partition keys interact

2014-03-26 Thread Jonathan Lacefield
FILTERING. Did you ever consider partitioning simply on customer and using date and type as clustering keys? Hope that helps. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/what-we-offer/

Re: Question about how compaction and partition keys interact

2014-03-27 Thread Jonathan Lacefield
it will provide a significant advantage to your query access times. Hope that helps and if not, maybe someone else can provide the answer to your specific question regarding the impacts of your model on compaction. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 34

Re: Auto-Bootstrap not Auto-Bootstrapping?

2014-04-07 Thread Jonathan Lacefield
Hello Not sure I follow the auto bootstrap question, but seeds are only used on startup. Also, what do you mean by convert the node to a seed node? You could simply add the 4th node IP address to the seed list of the other nodes in the .yaml file. Hope that helps Jonathan > On Apr 7, 201

Re: Apache cassandra not joining cluster ring

2014-04-09 Thread Jonathan Lacefield
Hello The nodetool status that you mentioned, was that executed on the 4th node itself? Also What does netstat display? Are the correct ports listening on that node? Per opscenter, What version of opscenter are you using? Are you able to manually start the agents on the nodes themselves

Re: Per-keyspace partitioners?

2014-04-09 Thread Jonathan Lacefield
Hello, Partitioner is per cluster. We have seen users create separate clusters for items like this, but that's an edge case. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/cassandrasummi

Re: Point in Time Recovery

2014-04-10 Thread Jonathan Lacefield
Hello, Have you tried the procedure documented here: http://www.datastax.com/documentation/cassandra/1.2/cassandra/configuration/configLogArchive_t.html Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield>

Re: List and Cancel running queries

2014-04-11 Thread Jonathan Lacefield
No. This is not possible today > On Apr 11, 2014, at 1:19 AM, Richard Jennings > wrote: > > Is it possible to list all running queries on a Cassandra cluster ? > Is it possible to cancel a running query on a Cassandra cluster? > > Regards

Re: How safe is "nodetool move" in 1.2 ?

2014-04-16 Thread Jonathan Lacefield
Assuming you have enough nodes not undergoing "move" to meet your CL requirements, then yes, your cluster will still accept reads and writes. However, it's always good to test this before doing it in production to ensure your cluster and app will function as designed. Jonathan Lace

Re: row caching for frequently updated column

2014-04-29 Thread Jonathan Lacefield
the ability to "preheat" key and page cache, but I don't believe this is possible for row cache. Hope that helps. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/cassandrasum

Re: cassandra snapshots

2014-05-06 Thread Jonathan Lacefield
apshot configurations which could have been set for your cluster. http://www.datastax.com/documentation/cassandra/2.0/cassandra/configuration/configCassandra_yaml_r.html Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://w

Re: Query first 1 columns for each partitioning keys in CQL?

2014-05-16 Thread Jonathan Lacefield
Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/cassandrasummit14> On Fri, May 16, 2014 at 12:23 AM, Matope Ono wrote: > Hi, I'm modeling some queries in CQL3. > > I'd like to query first

Re: Cannot query secondary index

2014-06-09 Thread Jonathan Lacefield
his helps. Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/cassandrasummit14> On Mon, Jun 9, 2014 at 5:18 PM, Redmumba wrote: > I have a table with a timestamp column on it; however, wh

Re: Cannot query secondary index

2014-06-09 Thread Jonathan Lacefield
Hello, Will you please describe the use case and what you are trying to model. What are some questions/queries that you would like to serve via Cassandra. This will help the community help you a little better. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <h

Re: Cannot query secondary index

2014-06-13 Thread Jonathan Lacefield
te when TTLs aren't keeping up with your load. You have such a special use case for this functionality that the little, in frequent, performance hit outweighs the complexity of implementing options 1 and 3. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 8

Re: restarting node makes cpu load of the entire cluster to raise

2014-06-18 Thread Jonathan Lacefield
Hello Have you checked the log file to see what's happening during startup ? What caused the rolling restart? Did you preform an upgrade or change a config? > On Jun 18, 2014, at 5:40 AM, Alain RODRIGUEZ wrote: > > Hi guys > > Using 1.2.11, when I try to rolling restart the cluster, any nod

Re: Configuring all nodes as seeds

2014-06-18 Thread Jonathan Lacefield
/ops_replace_seed_node.html Take this into consideration, coupled with the fact that nodes will require replacing along the way, when determining the right number of seeds to use per cluster. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield>

Re: restarting node makes cpu load of the entire cluster to raise

2014-06-18 Thread Jonathan Lacefield
a flush or drain before stopping and starting Cassandra. This will flush memtables and clear your log segments. Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/cassandrasummit14> On Wed, Jun 18,

Re: restarting node makes cpu load of the entire cluster to raise

2014-06-26 Thread Jonathan Lacefield
y i can't be of more help with this one. It may take some hands-on investigation or maybe someone else in the community has experienced this issue and can provide feedback. Thanks, Jonathan Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlac

Re: keyspace with hundreds of columnfamilies

2014-07-02 Thread Jonathan Lacefield
Hello There is overhead for memory with each col family. This type of configuration could cause heap issues. What is driving the requirement for so many Cfs? > On Jul 2, 2014, at 4:14 AM, tommaso barbugli wrote: > > Hi, > Are there any known issues, shortcomings about organising data in hundr

Re: horizontal query scaling issues follow on

2014-07-21 Thread Jonathan Lacefield
ross partitions from this right? Does it mean for 1800 (the >>> number of unique keys) that each has 8 cells? >>> >> >> Yes, your interpretation is correct. Each of your 1800 partitions >> has 8 cells (taking up 1109 bytes). >> >> >> -- >> Tyler Hobbs >> DataStax <http://datastax.com/> >> > > -- Jonathan Lacefield Solutions Architect, DataStax (404) 822 3487 <http://www.linkedin.com/in/jlacefield> <http://www.datastax.com/cassandrasummit14>

Re: Barman equivalent for Cassandra?

2021-03-12 Thread Jonathan Lacefield
There is a community delivered tool named Medusa that may have what you're looking for as well - https://cassandra.tools/medusa Jonathan Lacefield e. jlacefi...@datastax.com w. www.datastax.com schedule a meeting on my calendar <https://calendar.google.com/cale