Re: Efficient Paging Option in Wide Rows

2016-04-24 Thread Carlos Alonso
n would fit in the good practice by applying the formulae from this video: Cheers! Carlos Alonso | Software Engineer | @calonso <> On 23 April 2016 at 20:25, Anuj Wadehra wrote:

Re: Unable to reliably count keys on a thrift CF

2016-04-25 Thread Carlos Alonso
Hi Jens. Thanks for your response but my idea is to count different keys, so, if I understood correctly selecting WHERE key = #{key} won't give me any new key, right? Thanks! Carlos Alonso | Software Engineer | @calonso <> On 25 April 2016 at 09:22,

Re: Inconsistent Reads after Restoring Snapshot

2016-04-29 Thread Carlos Alonso
one with highest timestamp (last write wins) if they don't agree, and running a read repair (repair that particular record) if the nodes didn't agreed. Hope this helps. Carlos Alonso | Software Engineer | @calonso <> On 29 April 2016 at 04:33, Anuj Wade

Re: OutOfMemoryError when initializing a secondary index

2016-08-09 Thread Carlos Alonso
If you're deleting all traces of the index you probably want to look at the commit log as they are probably being recreated from there. Hope it helps. Carlos Alonso | Software Engineer | @calonso <> On 5 August 2016 at 23:05, Charlie Moad wrote: > Run

Re: Cassandra Exception

2016-08-16 Thread Carlos Alonso
Are you running the migration to create the flapalarmcache table from more than one client concurrently? Carlos Alonso | Software Engineer | @calonso <> On 16 August 2016 at 10:51, Kamal C wrote: > Hi all, > > I'm using cassandra-3.7 version,

Re: Cassandra Exception

2016-08-17 Thread Carlos Alonso
Ok cool, glad you found the issue. Carlos Alonso | Software Engineer | @calonso <> On 17 August 2016 at 09:48, Kamal C wrote: > Carlos, > Yes, I'm running multiple clients simultaneously. Each one of them tries > to create table if it doesn'

Re: lack of space for replicas

2016-08-18 Thread Carlos Alonso
/architectureDataDistributeReplication_c.html Carlos Alonso | Software Engineer | @calonso <> On 18 August 2016 at 12:15, jean paul wrote: > Hi, > > > The distributed replicas take space. Is there a scenario when a new > replicas cannot be stored on a node due to

Re: A question to updatesstables

2016-08-18 Thread Carlos Alonso
Replies inline Carlos Alonso | Software Engineer | @calonso <> On 18 August 2016 at 11:56, Lu, Boying wrote: > Hi, All, > > > > We use Cassandra in our product. I our early release we use Cassandra > 1.2.10 whose SSTable is ‘ic’ format. >

Re: How to configure cassandra in a multi cluster mode?

2016-08-25 Thread Carlos Alonso
as a seed. Finally run nodetool status to check that they see each other. Hope it helps Carlos Alonso | Software Engineer | @calonso <> On 25 August 2016 at 10:19, Alexandr Porunov wrote: > Hello, > > I am little bit confusing about cassandra&

Re: cassandra database design

2016-08-31 Thread Carlos Alonso
Maybe a good question could be: Which is your access pattern to this data? Carlos Alonso | Software Engineer | @calonso <> On 31 August 2016 at 11:47, Stone Fang wrote: > Hi all, > have some questions on how to define clustering key. > > have

Re: cassandra database design

2016-09-01 Thread Carlos Alonso
e sweet spot for your use case. Regards Carlos Alonso | Software Engineer | @calonso <> On 1 September 2016 at 02:58, Stone Fang wrote: > access pattern is > > select *from datacenter where datacentername = '' and publish>$time and >

Re: large system hint partition

2016-09-18 Thread Carlos Alonso
By inspecting the contents on your system.hints table, specifically the host_id column, you can see which is the destination host of those hints and check if it is one of the alive or dead ones. Carlos Alonso | Software Engineer | @calonso <> On 18 September 2016

Re: Cassandra data model right definition

2016-09-30 Thread Carlos Alonso
Cassandra is a Wide Column Store Carlos Alonso | Software Engineer | @calonso <> On 30 September 2016 at 18:24, Mehdi Bada wrote: > Hi all, > > I have a theoritical question: > - Is Apache Cassandra rea

Re: Property file snitch and unbalanced cluster

2016-10-28 Thread Carlos Alonso
cluster is back to 1 datacenter with 3 nodes and then add them sequentially again making sure the configuration in the Snitch is the proper one. Regards. Carlos Alonso | Software Engineer | @calonso <> On 28 October 2016 at 10:06, Robert Sicoie wrote: > Hi

Re: Designing a table in cassandra

2016-11-07 Thread Carlos Alonso
ggested primary key will work for queries like "Give me the version(s) of this particular device_name in this particular time range" Hope it helps. Regards Carlos Alonso | Software Engineer | @calonso <> On 7 November 2016 at 01:23, sat wrote: > Hi

Re: failing bootstraps with OOM

2016-11-07 Thread Carlos Alonso
ation) from one node to another and stop the old node and start the new one. As the configuration is all the same (just the ip will change) it joins the ring as if it was the old one. And there's no need for any bootstrapping. BTW, are you using vnodes? Regards Carlos Alonso | Software Engine

Re: store individual inventory items in a table, how to assign them correctly

2016-11-08 Thread Carlos Alonso
Bear in mind that LWT will, under certain circumstances fail too. See amazing Chris Batey's talk about it on Cassandra Summit: Carlos Alonso | Software Engineer | @calonso <> On 7 November 2016 at 22:22, Justin Cam

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Carlos Alonso
task (every night maybe?) that deletes the extra rows that are not needed anymore. The query pattern for this is quite efficient as something like SELECT * FROM user_views WHERE user_name = ? LIMIT 10; Regards Carlos Alonso | Software Engineer | @calonso <>

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Carlos Alonso
Hi Ben, you're right, but in my example the last_time timestamp field is actually part of the primary key. Regards Carlos Alonso | Software Engineer | @calonso <> On 10 November 2016 at 11:50, Benjamin Roth wrote: > I pretty much guess the CQL you pos

Re: Reading Commit log files

2016-11-23 Thread Carlos Alonso
Hi Kamesh. Flushing memtables to disk causes the corresponding commitlog segments to be deleted. Once the data is flushed into SSTables it can be considered durable (in case of a node crash, the data won't be lost), and therefore there's no point in keeping it in the commitlog as well. Try withou

Re: Reading Commit log files

2016-11-23 Thread Carlos Alonso
gt; flushing. Still not able to read the commit logs. > However, I am able to read the commit logs of *system* and > *system_schema* key spaces but not able to read the application key space > (key space created by me). > > Thanks & Regards > > Kamesh. > > On Wed, No

Re: Any use-case about a migration from SQL Server to Cassandra?

2015-06-24 Thread Carlos Alonso
This article from Spotify Labs is a really nice write up of migrating SQL (Postgres in this case) to Cassandra Carlos Alonso | Software Engineer | @calonso <> On 23 June 2015 at 20:23, Alex Popescu wrote: > > On Tue, Jun 23, 2015 at 12:13 PM, Marcos

Re: Insert (and delete) data loss?

2015-06-30 Thread Carlos Alonso
Hi Moreno, Which consistency level are you using? If you're using ONE, that may make sense, as, depending on the partitioning and the cluster coordinating the query, different values may be received. Hope it helps. Regards Carlos Alonso | Software Engineer | @calonso <https://twi

Re: Example Data Modelling

2015-07-06 Thread Carlos Alonso
ic integer, flexible_allowance float, PRIMARY KEY(EmpID, month) ) That way the salaries will be partitioned by EmpID and clustered by month, which I guess is the natural sorting you want. Hope it helps, Cheers! Carlos Alonso | Software Engineer | @calonso <> O

Re: Example Data Modelling

2015-07-07 Thread Carlos Alonso
estion for getting a salary by employee ID and I'd also have Peter's one to run the end of the month query. Does it make sense? Cheers! Carlos Alonso | Software Engineer | @calonso <> On 7 July 2015 at 09:07, Srinivasa T N wrote: > Thanks for the

Re: Example Data Modelling

2015-07-07 Thread Carlos Alonso
Hi Jerome, Good point!! Really a nice usage of static columns! BTW, wouldn't the EmpID be static as well? Cheers Carlos Alonso | Software Engineer | @calonso <> On 7 July 2015 at 14:42, Jérôme Mainaud wrote: > Hello, > > You can slightly ad

Re: Schema questions for data structures with recently-modified access patterns

2015-07-21 Thread Carlos Alonso
clustering order, the latest modification will always be first record in the row. Hope it helps. Carlos Alonso | Software Engineer | @calonso <> On 21 July 2015 at 05:59, Robert Wille wrote: > Data structures that have a recently-modified access pattern see

Re: Schema questions for data structures with recently-modified access patterns

2015-07-22 Thread Carlos Alonso
IMARY KEY(date, docId), making all docs modified in same day be together in the same partition, and on the other hand, two updates on the same date won't generate a two rows as the primary key would be exactly the same. Does it make sense? Carlos Alonso | Software Engineer | @calo

Re: Tag filtering data model

2015-09-15 Thread Carlos Alonso
Really interesting question Artur. Have you gone any further? I think, based on my experience and recalling Cassandra's good practices, that full denormalisation is the Cassandra way to go. Cheers Carlos Alonso | Software Engineer | @calonso <> On 11 Sep

Re: Cassandra Summit 2015 Roll Call!

2015-09-22 Thread Carlos Alonso
Hi guys. I'm already here and I'll be the whole Summit. I'll be doing a live demo on Thursday on troubleshooting Cassandra production issues as a developer. This is me!! Carlos Alonso | Software Engineer | @calonso <h

DC's versions compatibility

2015-09-28 Thread Carlos Alonso
Hi guys. I have a very old cassandra cluster 1.2.19 and I'm looking to add a new datacenter to it for analytics purposes in a newer version, let's say 2.1.8. Will those DC's communicate properly? Regards Carlos Alonso | Software Engineer | @calonso <>

Re: DC's versions compatibility

2015-09-29 Thread Carlos Alonso
Hi guys. Thanks for your reply. Is this because the versions are very far from each other (e.g. Different major versions) Or as a rule of thumb, all nodes in a cluster, even in different DC's MUST have the same version? Regards Carlos Alonso | Software Engineer | @calonso <https://twi

How is the schema applied?

2015-09-29 Thread Carlos Alonso
can think of the that header being a reference to the specific record of that column in the system.schema_columns table and some programming logic to match data types and fill missing fields before replying to the client. Is that right? Cheers! Carlos Alonso | Software Engineer | @calonso <ht

Summit Recap

2015-09-29 Thread Carlos Alonso
Here is my particular Summit Recap: Thanks to everyone for such a great contents and thanks DataStax as well for the great organisation. Cheers! Carlos Alonso | Software Engineer | @calonso <>

DataStax Cassandra MVP 2015 roster

2015-09-30 Thread Carlos Alonso
So glad to be one of the awarded and big congratulations to everyone else in that list!! Thanks DataStax and let's keep pushing this product/community for more!! Carlos Alonso | Software Engineer | @calonso <>

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
Hi Stephen. The UnknownColumnFamilyException made me thought of a possible schema disagreement in which any of your nodes has a different version and therefore you cannot reach quorum? Can you run nodetool describecluster and see if all nodes have the same schema versions? Cheers! Carlos

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
And that's a stupid one, I know, but does the column you're trying to access actually exist? Carlos Alonso | Software Engineer | @calonso <> On 1 October 2015 at 11:09, Walsh, Stephen wrote: > I did think of that and they are all the same version

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
Which versions of Cassandra and OpsCenter are you using? Because probably opscenter and your app are using cql and cqlsh is using thrift or vice versa and that's why depending on where you access from you see different things? Carlos Alonso | Software Engineer | @calonso <https://twi

Re: Consistency Issues

2015-10-01 Thread Carlos Alonso
Well... I wasn't expecting that, as both OpsCenter 5.2.1 and cqlsh in Cassandra 2.1.x both use native protocol. I was expecting them having different protocols, so Have no further ideas :( Carlos Alonso | Software Engineer | @calonso <> On 1 October 20

OpsCenter detecting decommissioning node

2015-10-08 Thread Carlos Alonso
While decommissioning nodes...Why doesn't OpsCenter show it as an activity or something in the status of the node? That would be helpful to monitor the progress, as when upgrading sstables or compacting for example. Regards Carlos Alonso | Software Engineer | @calonso <https://twi

Stale node decommission?

2015-10-08 Thread Carlos Alonso
node being decommisioned is in a different version to the others Any ideas? Carlos Alonso | Software Engineer | @calonso <>

Re: Stale node decommission?

2015-10-08 Thread Carlos Alonso
:] at org.apache.cassandra.streaming.ConnectionHandler$ ~[cassandra-all-] at [na:1.7.0_75] Carlos Alonso | Software Engineer | @calonso <>

Re: Node won't go away

2015-10-09 Thread Carlos Alonso
:// Carlos Alonso | Software Engineer | @calonso <> On 8 October 2015 at 19:24, sai krishnam raju potturi wrote: > the below solution should work. > > For each node in the cluster : > a : Stop cassandra service

Re: Cassandra query degradation with high frequency updated tables.

2015-10-09 Thread Carlos Alonso
Yeah, I was about to suggest the compaction strategy too. Leveled compaction sounds like a better fit when records are being updated Carlos Alonso | Software Engineer | @calonso <> On 8 October 2015 at 22:35, Tyler Hobbs wrote: > Upgrade to 2.2.2. Your sst

Re: Is replication possible with already existing data?

2015-10-12 Thread Carlos Alonso
Yes Ajay, in your particular scenario, after all hints are delivered, both CAS11 and CAS12 will have the exact same data. Cheers! Carlos Alonso | Software Engineer | @calonso <> On 11 October 2015 at 05:21, Ajay Garg wrote: > Thanks a ton Anuja for

Re: management and monitoring nodetool repair

2015-10-19 Thread Carlos Alonso
! Carlos Alonso | Software Engineer | @calonso <> On 16 October 2015 at 14:09, wrote: > Hi, > > I'm looking for a tool for management and monitoring of the status of > nodetool repair. > > Currently I am trying to test cassandra-reaper, but if y

Re: Read query taking a long time

2015-10-19 Thread Carlos Alonso
Could you send cfhistograms and cfstats relevant to the read column family? That could help Carlos Alonso | Software Engineer | @calonso <> On 17 October 2015 at 16:15, Brice Figureau <> wrote: > Hi, > > I'v

Re: management and monitoring nodetool repair

2015-10-19 Thread Carlos Alonso
ibly, java stack traces. Carlos Alonso | Software Engineer | @calonso <> On 19 October 2015 at 14:00, wrote: > Thx Carlos, > > > > How can I get information on error during repair ? > > > > Thx. > > *De :* Carlos Alonso [mailto:i...@

Re: Read query taking a long time

2015-10-20 Thread Carlos Alonso
atency. Can you check that command in all nodes? Regards Carlos Alonso | Software Engineer | @calonso <> On 20 October 2015 at 13:59, Brice Figureau <> wrote: > Hi, > > Thanks for your answer. Unfortunately since I

Re: Read query taking a long time

2015-10-21 Thread Carlos Alonso
very wide rows), but very heavy weighting (lots of data). Does that make sense? Carlos Alonso | Software Engineer | @calonso <> On 21 October 2015 at 08:35, Brice Figureau <> wrote: > Hi, > > On 20/10/2015 19:48, C

Weird replace_address issue in 1.2

2015-10-29 Thread Carlos Alonso
ience with this? Regards Carlos Alonso | Software Engineer | @calonso <>

Re: Weird replace_address issue in 1.2

2015-10-30 Thread Carlos Alonso
Well, everything was fine. The streams finished and after that the node joined the ring nicely and everyone removed the old one. :) Thanks! Carlos Alonso | Software Engineer | @calonso <> On 29 October 2015 at 21:45, Robert Coli wrote: > On Thu, Oct 29,

Re: Cassandra Data Model with Narrow partition

2015-10-30 Thread Carlos Alonso
actions I'd use DTCS or LCS, but given that you will have a big amount of tombstones due to TTLs I'd never go with STCS. Hope it helps! Carlos Alonso | Software Engineer | @calonso <> On 30 October 2015 at 10:55, wrote: > Hi, > > > > Cou

Re: scylladb

2015-11-11 Thread Carlos Alonso
Any update about this? @Carlos Rolo, did you tried it? Thoughts? Carlos Alonso | Software Engineer | @calonso <> On 5 November 2015 at 14:07, Carlos Rolo wrote: > Something to do on a expected rainy weekend. Thanks for the information. > > Rega

Re: Getting code=2200 [Invalid query] message=Invalid column name ... while executing ALTER statement

2015-11-13 Thread Carlos Alonso
Maybe schema disagreement? Run nodetool describecluster to discover Carlos Alonso | Software Engineer | @calonso <> On 13 November 2015 at 11:14, Rajesh Radhakrishnan <> wrote: > > Hi, > > I am using Cassandra 2

Re: No query results while expecting results

2015-11-23 Thread Carlos Alonso
Ok, so by default the CONSISTENCY LEVEL of CQLSH is ONE. Therefore it could be that the node you're connected to doesn't actually contain any of this data. Try improving the CL for that read and see what happens. Regards. Carlos Alonso | Software Engineer | @calonso <https:

Re: No query results while expecting results

2015-11-23 Thread Carlos Alonso
so use the thrift client to list it and see how it looks on disk and see if there's something wrong. If the data is there and looks fine, probably there's a problem managing varints somewhere in the read path. Regarfds Carlos Alonso | Software Engineer | @calonso <https://twit

Re: Hints timed out

2015-11-23 Thread Carlos Alonso
Are both nodes using the same Cassandra version? Carlos Alonso | Software Engineer | @calonso <> On 20 November 2015 at 23:00, Arun Sandu wrote: > Hi, > > 1. What would be the reason for hints getting timed out? None of the nodes > went down. usin

Re: No query results while expecting results

2015-11-23 Thread Carlos Alonso
Did you tried to observe it using cassandra-cli? (the thrift client) It shows the 'disk-layout' of the data and may help as well. Otherwise, if you can reproduce it having a varint as the last part of the partition key (or at any other location), this may well be a bug. Carlos Alonso

Re: Cassandra Cleanup and disk space

2015-11-26 Thread Carlos Alonso
May it be a SizeTieredCompaction of big SSTables just finished and freed some space? Carlos Alonso | Software Engineer | @calonso <> On 26 November 2015 at 08:55, Luigi Tagliamonte wrote: > Hi Everyone, > I'd like to understand what cleanup does on

Re: Are write-write conflicts possible in Cassandra atomic batches when updating multiple tables?

2015-12-18 Thread Carlos Alonso
Hi Mukil. As batches aren't isolated, I think write conflicts are effectively possible unless, somehow, you prevent them with LWT. Carlos Alonso | Software Engineer | @calonso <> On 17 December 2015 at 00:08, Mukil Kesavan wrote: > Hello, > >

Re: Would data be lost by nodetool removenode force

2015-12-21 Thread Carlos Alonso
Why is the old node not able to restart? If you're about to bring a new one to replace the old dead one, it may be simpler to just replace it Hope it helps. Carlos Alonso | Software Engineer | @ca

Re: Modeling contact list, plain table or List

2016-01-11 Thread Carlos Alonso
I have never used Materialized Views so maybe this suggestion is not possible, but in this case, wouldn't it make sense to define the materialized view as is_favourite IS TRUE instead of is_favourite IS NOT NULL? Carlos Alonso | Software Engineer | @calonso <

Re: Sorting & pagination in apache cassandra 2.1

2016-01-12 Thread Carlos Alonso
. Regards Carlos Alonso | Software Engineer | @calonso <> On 12 January 2016 at 07:30, anuja jain wrote: > 1 more question, what does it mean by "cassandra inherently sorts data"? > For eg: > I have a table with schema > > CREATE TABLE users

Re: [Typo correction] Is it good for performance to put rows that are of different types but are always queried together in the same table partition?

2016-01-12 Thread Carlos Alonso
Why can't you have something like this? CREATE TABLE t ( p INT, q1 INT, q2 UUID, c1 INT, c2 TEXT, PRIMARY KEY (p, q1, q2) ) Sounds the simplest solution. Carlos Alonso | Software Engineer | @calonso <> On 12 January 2016 at 18:27, Bamoqi wrote

Re: Cassandra is consuming a lot of disk space

2016-01-13 Thread Carlos Alonso
I'd have a look also at possible running compactions. If you have big column families with STCS then large compactions may be happening. Check it with nodetool compactionstats Carlos Alonso | Software Engineer | @calonso <> On 13 January 2016 at 05:22, Kev

Re: Node stuck when joining a Cassandra 2.2.0 cluster

2016-01-13 Thread Carlos Alonso
Hi Robert. I'm thinking of upgrading hardware in place. Can you please elaborate a bit more on how to use the auto_bootstrap=false + hibernate repair technique? Cheers! Carlos Alonso | Software Engineer | @calonso <> On 6 January 2016 at 11:10, Herbert Fis

Re: Sorting & pagination in apache cassandra 2.1

2016-01-15 Thread Carlos Alonso
than write just once and have expensive reads. Carlos Alonso | Software Engineer | @calonso <> On 15 January 2016 at 05:57, anuja jain wrote: > @Jonathan > what do you mean by "you'll need to maintain your own materialized view > tables"? &

Re: Connection error 61 for cassandra

2016-01-19 Thread Carlos Alonso
I ran through those issues a while ago. It was on Ubuntu rather than OSX but probably the same. I compiled my steps here: Cheers! Carlos Alonso | Software Engineer | @calonso <>

Re: Getting error while issuing Cassandra stress

2016-01-24 Thread Carlos Alonso
Before closing this. Would the reason possibly be that with SimpleStrategy and RF=1 the data is only on one node on a single DC, and when the driver was connecting to the other DC, LOCAL_ONE could not be achieved? Regards. Carlos Alonso | Software Engineer | @calonso <https://twitter.

Re: Modeling nested collection with C* 2.0

2016-01-28 Thread Carlos Alonso
Hi Ahmed, I think modelling them as a map where you can 'label' your emails or addresses sounds like a good option. More info here: Regards Carlos Alonso | Software Engineer | @calonso <

Re: Session timeout

2016-01-29 Thread Carlos Alonso
Hi Oleg. The drivers have builtin the timeout configurable functionality. Hope it helps. Carlos Alonso | Software Engineer | @calonso <> On 28 January 2016 at 22:18, oleg yusim wrote: > Greetings, > > Does Cassandra support session timeout? If so, w

Re: Session timeout

2016-01-29 Thread Carlos Alonso
I personally don't use the Java but the Ruby driver, but I'm pretty sure you'll be able to find it in the docs: Carlos Alonso | Software Engineer | @calonso <> On 29 January 2016 at 13:15, oleg yusim

Re: Session timeout

2016-01-29 Thread Carlos Alonso
Oh, I thought you meant read/write timeout, not session timeout due to inactivity... Not sure there's such option. Sorry Carlos Alonso | Software Engineer | @calonso <> On 29 January 2016 at 13:35, oleg yusim wrote: > Carlos, > > I went thr

Re: Session timeout

2016-01-29 Thread Carlos Alonso
t has quite a lot of activity and its easy sometimes to miss emails. About this session timeout thing, could you please reply to this thread if you find a solution? I'm curious about it. Cheers! Carlos Alonso | Software Engineer | @calonso <> On 29 January 2

Re: Atomic Batch: Maintaining consistency between tables

2016-02-04 Thread Carlos Alonso
Hi, The coordinator will send mutations to all required replicas and wait for required acknowledgements to fulfil consistency level. Regards Carlos Alonso | Software Engineer | @calonso <> On 4 February 2016 at 11:56, wrote: > Hello, > > &g

Re: Scenarios which need Repair

2016-02-09 Thread Carlos Alonso
ired wherever there are writes/updates. Regards Carlos Alonso | Software Engineer | @calonso <> On 9 February 2016 at 11:10, Anuj Wadehra wrote: > Hi, > > Can someone take this? > > > Thanks > Anuj > > > > On Mon, 8 Feb, 2016 at 11:

Re: Schema Versioning

2016-02-11 Thread Carlos Alonso
Here we use the Cassanity gem: This one suggests using schema migration files that are then registered in a column family to keep track of the version. Carlos Alonso | Software Engineer | @calonso <> On 10 February 2016 at

Re: Modeling Master Tables in Cassandra

2016-02-12 Thread Carlos Alonso
way, you avoid client side joins. Hope this helps! Carlos Alonso | Software Engineer | @calonso <> On 12 February 2016 at 09:25, Harikrishnan A wrote: > Hello, > I have a scenario where I need to create a customer master table in > cassandra which

Re: Reenable data access after temporarily moving data out of data directory

2016-02-24 Thread Carlos Alonso
Hi Jason Try this: Carlos Alonso | Software Engineer | @calonso <> On 24 February 2016 at 07:07, Jason Kania wrote: > Hi, > > I encountered an error in Cassandra or the la

Re: Migrating from single node to cluster

2016-02-26 Thread Carlos Alonso
vided for that (you probably saw a warning note at the last line of nodetool status' output. Hope this helps. Regards Carlos Alonso | Software Engineer | @calonso <> On 26 February 2016 at 03:53, Jason Kania wrote: > Hi, > > I am wondering if ther

Re: Consistent read timeouts for bursts of reads

2016-03-01 Thread Carlos Alonso
retrieved range) is too big is why the nodes get overloaded and end up dropping the read requests. If you see GC pressure that would point towards my hypothesis too. Hope this helps. Carlos Alonso | Software Engineer | @calonso <> On 25 February 2016 at 16:34,

Re: Strategy for dividing wide rows beyond just adding to the partition key

2016-03-11 Thread Carlos Alonso
reading was taken. This works for a >> number of queries but for scanning all the readings against a particular >> sensorUnitId and sensorId combination, we seem to be stuck. >> >> We won't know the range of valid values of the timeShardId for a given >> sensorUnitId and sensorId combination so would have to write to an >> additional table to track the valid timeShardId. We suspect this would >> create tombstone accumulation problems given the number of updates required >> to the same row so haven't tried this option. >> >> Alternatively, we hit a different bottleneck in the form of SELECT >> DISTINCT in trying to directly access the partition keys. Since SELECT >> DISTINCT does not allow for a where clause to filter on the partition key >> values, we have to filter several hundred thousand partition keys just to >> find those related to the relevant sensorUnitId and sensorId. This problem >> will only grow worse for us. >> >> Are there any other approaches that can be suggested? We have been >> looking around, but haven't found any references beyond the initial >> suggestion to add some sort of shard id to the partition key to handle wide >> rows. >> >> Thanks, >> >> Jason >> >> >> >> >> >> >> >> > -- Carlos Alonso | Software Engineer | @calonso <>

Re: Strategy for dividing wide rows beyond just adding to the partition key

2016-03-14 Thread Carlos Alonso
pdate the 'last partition' entry for a sensor if we're really one week after the latest saved. That will generate one single tombstone per sensor and that doesn't sound scary I think. On the other hand. Did you considered offloading the historical data to a better data warehouse

Re: If a cluster column has the same value over many rows, is the value duplicated in memory?

2016-03-14 Thread Carlos Alonso
They'll be duplicated. The only way of having one single value per partition is to declare it as static. Carlos Alonso | Software Engineer | @calonso <> On 8 March 2016 at 11:49, X. F. Li wrote: > Hi, > > If a cluster column has the same value ov


2016-03-15 Thread Carlos Alonso
You're inserting just a set {'', ''} under the key 'emails' and another set {'', ''} for the key 'unverifiedEmails' for your loginIds hash instead of the UDT. Carlos A

Re: Strategies for avoiding corrupted duplicate data?

2016-03-19 Thread Carlos Alonso
ion = 2 where ... IF version = 1 (version number previously read, of course) Hope this helps. Carlos Alonso | Software Engineer | @calonso <> On 17 March 2016 at 07:22, Max C wrote: > Hello, > > What are your best practices for avoiding collisions when

Re: Question about SELECT command

2016-03-20 Thread Carlos Alonso
Yes, they could. Carlos Alonso | Software Engineer | @calonso <> On 17 March 2016 at 16:10, Thouraya TH wrote: > Hi all; > > Please, i have a question about the architecure behind SELECT command. > Given this table: > > c1 c2 c3

Re: disk space used vs nodetool status

2016-03-22 Thread Carlos Alonso
vely using disk space. Hope this helps. Carlos Alonso | Software Engineer | @calonso <> On 22 March 2016 at 07:57, Anishek Agarwal wrote: > Hello, > > Using cassandra 2.0.17 on one of the 7 nodes i see that the "Load" column > from nodetool

Re: Migration from 2.0.10 to 2.1.12

2016-03-30 Thread Carlos Alonso
Try surrounding it with quotes. Didn't know about the reserved keyword full though. Regards Carlos Alonso | Software Engineer | @calonso <> On 30 March 2016 at 10:36, Jean Carlo wrote: > Hi! > > I am creating my new tables on cassandra 2.1.12

Re: Migration from 2.0.10 to 2.1.12

2016-03-30 Thread Carlos Alonso
Well... I guess that if full is a reserved word there's nothing you can do to change that but then, checking which are the keywords for that version it turns out that full is not there!! Maybe a bug? Regards Carlos A

Re: Runtime exception during repair job task

2016-03-31 Thread Carlos Alonso
This is probably due to corrupt data or a cassandra upgrade where you didn't ran upgradesstables I'd then suggest scrubbing the column family (or upgrading it). Hope it helps. Carlos Alonso | Software Engineer | @calonso <> On 31 March 2016 at 12:1

Re: auto_boorstrap when a node is down

2016-03-31 Thread Carlos Alonso
/cassandra-instantaneous-in-place-node-replacement/ We've been using it for a while and works nicely and avoids the time, resources and baby-sitting consumption of streaming data across nodes. Regards Carlos Alonso | Software Engineer | @calonso <> On 31 March

Re: auto_boorstrap when a node is down

2016-03-31 Thread Carlos Alonso
Mmm ok, then I think you may need follow the standard dead node replacement procedure: Cheers! Carlos Alonso | Software Engineer | @calonso <> On 31 March 2016 at 16:34, Peddi, P

Re: *** How to bring up one of the Nodes which is down ***

2016-04-12 Thread Carlos Alonso
Hi Lokesh, This may sounds a bit silly but... What about starting Cassandra process in that box? Regards, Carlos Alonso | Software Engineer | @calonso <> On 11 April 2016 at 19:16, Lokesh Ceeba - Vendor wrote: > Team, > > Can you

Re: Bootstrapping Question for Cassandra 2.0

2016-04-19 Thread Carlos Alonso
It will be possible after the tokens reassignment has finished. Actually Cassandra itself will tell you. If you try the remove command and is not possible it will fail with an error indicating that the operation is not possible while there are nodes in Joining status. Regards Carlos Alonso

Unable to reliably count keys on a thrift CF

2016-04-21 Thread Carlos Alonso
OM tbl WHERE token(key) = #{token}") else results = connection.execute("SELECT token(key), key FROM tbl WHERE token(key) >= #{token} LIMIT 1") end last_token = token end puts keys.keys.count What am I missing? Thanks! Carlos Alonso | Software Engineer | @calonso <>

Log Slow Queries

2015-06-18 Thread Carlos Alonso
er log or saved to a particular column family to analyse them later. I've googled about this and the only 'easy' solution available out there seems to be DataStax Enterprise. What are you guys using? Thanks, Carlos Alonso | Software Engineer | @calonso <>

Re: Log Slow Queries

2015-06-18 Thread Carlos Alonso
the driver otherwise (hopefully is not very hard). Regards Carlos Alonso | Software Engineer | @calonso <> On 18 June 2015 at 12:21, DE VITO Dominique <> wrote: > Hi Carlos, > > > > Different possibi