Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
o be done anyway) and sorting by a field would perfectly do the trick. But from the docs, this is not possible right now. Does anyone see a better solution or are all my assumptions correct? -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
;re at least >>> guaranteed to hit a single node everytime >>> >>> On Tue, Oct 4, 2016 at 1:56 PM, Dorian Hoxha >>> wrote: >>> >>>> Can you use the lucene index https://github.com/Stratio/cas >>>> sandra-lucene-index ? >&g

Re: Cassandra listen port

2016-10-04 Thread Benjamin Roth
fr/dbi-services-et-ses-collaborateurs/offres-emplois-opportunites-carrieres/>* > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
andra > on Azure and SoftLayer.Launch your cluster in minutes.* > > > On Tue, 04 Oct 2016 07:27:54 -0400 *Benjamin Roth > >* wrote > > Hi! > > I have a frequently used pattern which seems to be quite costly in CS. The > pattern is always the same: I have a uni

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
mir Yudovin, > > > *Winguzone Inc <https://winguzone.com?from=list> - Hosted Cloud Cassandra > on Azure and SoftLayer.Launch your cluster in minutes.* > > > On Tue, 04 Oct 2016 12:19:13 -0400 *Benjamin > Roth>* wrote > > I use the self-compiled m

Re: Efficient model for a sorting

2016-10-04 Thread Benjamin Roth
nobody will > be able to help you with. > On Tue, Oct 4, 2016 at 12:40 PM Benjamin Roth > wrote: > >> I have the impression, that not the tick-tock is the real problem but MVs >> are not really battle-tested yet. >> Depending on the model, they put much more complexity

Re: Cassandra listen port

2016-10-04 Thread Benjamin Roth
Launch your cluster in minutes. > > > > > On Tue, 04 Oct 2016 11:36:04 -0400 Benjamin > Roth<benjamin.r...@jaumo.com> wrote > > There are several ports for several services. They are all set in > cassandra.yaml > > See here for complete documentation:

Re: Cassandra listen port

2016-10-04 Thread Benjamin Roth
L Server experts ! – Join the team > > > ----- Original Message - > From: Benjamin Roth > To: user@cassandra.apache.org > Sent: Tue, 04 Oct 2016 20:36:49 +0200 (CEST) > Subject: Re: Cassandra listen port > > As far as I can see, these ports are also used for outgoing connecti

dtests jolokia fails to attach

2016-10-05 Thread Benjamin Roth
ND PID USER FD TYPEDEVICE SIZE/OFF NODE NAME java20376 br 72u IPv4 124151830 0t0 TCP *:font-service (LISTEN) Any ideas? I just want to be able to close CASSANDRA-12689 -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: dtests jolokia fails to attach

2016-10-05 Thread Benjamin Roth
Maybe additional information, this is the CS command line for ccm node1: br 20376 3.2 8.6 2331136 708308 pts/5 Sl 06:10 0:30 java -Xloggc:/home/br/.ccm/test/node1/logs/gc.log -ea -XX:+UseThreadPriorities -XX:ThreadPriorityPolicy=42 -XX:+HeapDumpOnOutOfMemoryError -Xss256k -XX:StringTa

Re: dtests jolokia fails to attach

2016-10-06 Thread Benjamin Roth
t you need to do > "remove_perf_disable_shared_mem(node1)" before starting the node > > /Marcus > > On Thu, Oct 6, 2016 at 8:30 AM, Benjamin Roth > wrote: > >> Maybe additional information, this is the CS command line for ccm node1: >> >> br

Re: Rationale for using Hazelcast in front of Cassandra?

2016-10-07 Thread Benjamin Roth
t; 3(replication) cassandra nodes ? >> >> Example: why does facebook use memcache ? They certainly have things >> distributed on thousands of servers. >> >> >> >> On Thu, Oct 6, 2016 at 11:40 PM, KARR, DAVID wrote: >> >> I've seen use cases

Re: which one of the following choices is more efficient?

2016-10-26 Thread Benjamin Roth
runtime and >>> space complexity for CRUD operations the same in either case) which one >>> would you choose and why? >>> >>> >> > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Cannot mix counter and non counter columns in the same table

2016-11-01 Thread Benjamin Roth
Big Fat lol!!! Am 01.11.2016 19:02 schrieb "Ali Akhtar" : > ^ Stockholm syndrome :) > > On Tue, Nov 1, 2016 at 10:54 PM, Robert Wille wrote: > >> I used to think it was terrible as well. But it really isn’t. Just put >> your non-counter columns in a separate table with the same primary key. If >

Re: commit log on NFS volume

2016-11-01 Thread Benjamin Roth
Using nfs for a distribited System like Cassandra is like putting a Ferrari on a Truck and going for a Race with the Truck. It is simply nonsense. Am 01.11.2016 19:39 schrieb "Vladimir Yudovin" : > Hi, > > it's not only performance issue. In case of network problem writer tread > can be blocked,

Re: commit log on NFS volume

2016-11-01 Thread Benjamin Roth
ng what has > already been said. I don't even know that NFS is to blame for the > CommitLogReplayException that I cited. > > On Tue, Nov 1, 2016 at 2:43 PM, Benjamin Roth > wrote: > >> Using nfs for a distribited System like Cassandra is like putting a >> Ferra

Re: Introducing Cassandra 3.7 LTS

2016-11-02 Thread Benjamin Roth
; submitted back to the community). >>> >>> *Why release it?* >>> A number of our customers and people in the community have asked if we >>> would make this available, which we are more than happy to do so. This >>> repository represents what Instaclustr runs in pr

Re: large number of pending compactions, sstables steadily increasing

2016-11-07 Thread Benjamin Roth
> >>> SSTables in each level: [11/4, 20/10, 213/100, 1356/1000, >> 306, 0, >> >>> 0, 0, 0] >> >>> >> >>> Space used (live): 301894591442 >> >>> >> >>> Space used (total): 301894591442 >> >>> >> >>> >> >>> >> >>> Problematic node >> >>> >> >>> Keyspace: mykeyspace >> >>> >> >>> Read Count: 0 >> >>> >> >>> Read Latency: NaN ms. >> >>> >> >>> Write Count: 30520190 >> >>> >> >>> Write Latency: 0.05171286705620116 ms. >> >>> >> >>> Pending Flushes: 0 >> >>> >> >>> Table: mytable >> >>> >> >>> SSTable count: 14105 >> >>> >> >>> SSTables in each level: [13039/4, 21/10, 206/100, 831, 0, 0, >> 0, >> >>> 0, 0] >> >>> >> >>> Space used (live): 561143255289 >> >>> >> >>> Space used (total): 561143255289 >> > >> > Thanks, >> > >> > Ezra >> >> >> >> -- >> Jens Rantil >> Backend engineer >> Tink AB >> >> Email: jens.ran...@tink.se >> Phone: +46 708 84 18 32 >> Web: www.tink.se >> >> Facebook <https://www.facebook.com/#!/tink.se> Linkedin >> <http://www.linkedin.com/company/2735919?trk=vsrp_companies_res_photo&trkInfo=VSRPsearchId%3A1057023381369207406670%2CVSRPtargetId%3A2735919%2CVSRPcmpt%3Aprimary> >> Twitter <https://twitter.com/tink> >> >> >> -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Log traces of debug logs

2016-11-09 Thread Benjamin Roth
Hi! Is there a way to tell logback to log the trace of a debug log? The background is that i'd like to know from where a table flush is triggered. Thanks guys!

Re: Log traces of debug logs

2016-11-09 Thread Benjamin Roth
tps://winguzone.com?from=list> - Hosted Cloud > CassandraLaunch your cluster in minutes.* > > > On Wed, 09 Nov 2016 10:17:37 -0500*Benjamin Roth > >* wrote > > Hi! > > Is there a way to tell logback to log the trace of a debug log? The > background is that i'd like to know from where a table flush is triggered. > > Thanks guys! > > >

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Benjamin Roth
gt; On Mon, 07 Nov 2016 21:54:08 -0500*ben ben > >* wrote ---- >> >> >> Hi guys, >> >> We are maintaining a system for an on-line video service. ALL users' >> viewing records of every movie are stored in C*. So she/he can continue to >> enjoy the movie from the last point next time. The table is designed as >> below: >> CREATE TABLE recent ( >> user_name text, >> vedio_id text, >> position int, >> last_time timestamp, >> PRIMARY KEY (user_name, vedio_id) >> ) >> >> It worked well before. However, the records increase every day and the >> last ten items may be adequate for the business. The current model use >> vedio_id as cluster key to keep a row for a movie, but as you know, the >> business prefer to order by the last_time desc. If we use last_time as >> cluster key, there will be many records for a singe movie and the recent >> one is actually desired. So how to model that? Do you have any suggestions? >> Thanks! >> >> >> BRs, >> BEN >> >> >> >> >> >> > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Benjamin Roth
os Alonso | Software Engineer | @calonso <https://twitter.com/calonso> On 10 November 2016 at 11:50, Benjamin Roth wrote: > I pretty much guess the CQL you posted is invalid. You cannot set a > clustering column that is not part of the primary key. > But you can use a materialized

Re: 答复: 答复: A difficult data model with C*

2016-11-10 Thread Benjamin Roth
This is the reason why One would like to use an mv for it. An mv Handels this. It adds a clustering Key while preserving uniqueness of the original pk. Am 11.11.2016 02:33 schrieb "Gang Liu" : > I guess orignal design is keep one record for one video per user. maybe > their app will report many p

Re: Priority for cassandra nodes in cluster

2016-11-12 Thread Benjamin Roth
1. From a 15 year experience of running distributed Services: dont Mix Services on machines if you don't have to. Dedicate each server to a single task if you can afford it. It is easier to manage and reduces risks in case of overload or failure 2. You can assign a different number of tokens for ea

Re: Storing videos in cassandra

2016-11-14 Thread Benjamin Roth
Just wanted to know How does hulu or netflix store videos in cassandra. > >> > >> Do they just use references to the video files in the form of URL's and > store in the DB?? > >> > >> could someone please me on this. > >> > >> > >&g

Re: Storing videos in cassandra

2016-11-20 Thread Benjamin Roth
:03 PM, "raghavendra vutti" < >>> raghu9raghaven...@gmail.com> wrote: >>> >>>> Hi, >>>> >>>> Just wanted to know How does hulu or netflix store videos in cassandra. >>>> >>>> Do they just use referen

Re: Cassandra Config as per server hardware for heavy write

2016-11-22 Thread Benjamin Roth
gt;> Times Internet Ltd. | A Times of India Group Company >> >> FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA >> >> *P** Please do not print this email unless it is absolutely necessary. >> Spread environmental awareness.* >> >> >> Educa

Re: Cassandra Config as per server hardware for heavy write

2016-11-23 Thread Benjamin Roth
; > AND dclocal_read_repair_chance = 0.1 > > AND default_time_to_live = 0 > > AND gc_grace_seconds = 864000 > > AND max_index_interval = 2048 > > AND memtable_flush_period_in_ms = 0 > > AND min_index_interval = 128 > > AND read_repair_

Re: Cassandra Config as per server hardware for heavy write

2016-11-23 Thread Benjamin Roth
paid int, > > position text, > > pcid bigint, > > refurl text, > > sec text, > > siid bigint, > > tmpid bigint, > > xforwardedfor text, > > PRIMARY KEY (date, userid, time, xid) > > ) WITH CLUSTERING ORDER BY (userid ASC, time ASC, xid A

Re: repair -pr in crontab

2016-11-24 Thread Benjamin Roth
I recommend using cassandra-reaper Using crons without proper Monitoring will most likely not work as expected. There are some reaper forks on GitHub. You have to check which one works with your Cassandra version. The original one from Spotify only works on 2.x not on 3.x Am 25.11.2016 07:31 schr

Re: repair -pr in crontab

2016-11-25 Thread Benjamin Roth
>> Hi Artur, >> When I asked similar questions, someone addressed me to the below links >> and they are helpful. >> >> See http://www.datastax.com/dev/blog/repair-in-cassandra >> https://lostechies.com/ryansvihla/2015/09/25/cassandras- >> repair-should-be-calle

Bootstrap fails on 3.10

2016-11-25 Thread Benjamin Roth
node with auto_bootstrap=false to get it up and running and I ran repair afterwards. I'd like to avoid that repair and all the inconsistencies this time. -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161

Re: Bootstrap fails on 3.10

2016-11-25 Thread Benjamin Roth
e/CASSANDRA-12905. I will bump it's priority to critical > since it can prevent or difficult bootstrap. > > Did you try resuming bootstrap with "nodetool bootstrap resume" after the > failure? It may eventually succeed, since this is an MV lock contention > problem. >

Re: Bootstrap fails on 3.10

2016-11-25 Thread Benjamin Roth
I proposed a quite simple fix for https://issues.apache.org/jira/browse/CASSANDRA-12905 Sorry that I don't supply a patch. I am good at analysing code but totally unexperienced with the workflows here. 2016-11-25 19:57 GMT+01:00 Benjamin Roth : > Yes, I have MVs. > > Interesting i

Re: Java GC pauses, reality check

2016-11-25 Thread Benjamin Roth
Lol. The counter proof is to use another memory Model like Arc. Thats why i personally think Java is NOT the First choice for Server Applications. But thats a philosophic discussion. Am 25.11.2016 23:38 schrieb "Kant Kodali" : > +1 Chris Lohfink response > > I would also restate the following sen

Re: Java GC pauses, reality check

2016-11-25 Thread Benjamin Roth
This sounds amazing but also expensive - I don't see pricing on their page. Are you able and allowed to tell a rough pricing range? Am 26.11.2016 04:33 schrieb "Harikrishnan Pillai" : > We are running azul zing in prod with 1 million reads/s and 100 K writes/s > with azul .we never had a major gc

Re: Java GC pauses, reality check

2016-11-25 Thread Benjamin Roth
ing is on a sliding scale depending upon > utilization, and not ridiculous. > > - James > > Sent from my iPhone > > On Nov 25, 2016, at 10:40 PM, Benjamin Roth > wrote: > > This sounds amazing but also expensive - I don't see pricing on their > page. Are you a

Re: Java GC pauses, reality check

2016-11-26 Thread Benjamin Roth
with it. Am 27.11.2016 05:28 schrieb "Kant Kodali" : > Benjamin Roth: How do you know Arc eliminates GC pauses completely? By > completely I mean no GC pauses whatsoever. > > When you say Java is NOT the First choice for Server Applications you are > generalizing it too

Re: Java GC pauses, reality check

2016-11-26 Thread Benjamin Roth
#x27;t any lesser > pain :) > > Not only Cassandra that runs on JVM. Majority of Apache projects do run on > JVM for a reason. > > Bottom line. My point here is there are pros and cons of every language. > It doesn't make much sense to target one language. > > > &

Re: Java GC pauses, reality check

2016-11-27 Thread Benjamin Roth
> I do know for the fact lot of people are having trouble to reach those > benchmarks). > > SEDA to TPC is a very big change. Let's see how long it would take for > Apache C* > > https://issues.apache.org/jira/browse/CASSANDRA-10989 > > > > > On Sat, Nov 26, 20

Re: Java GC pauses, reality check

2016-11-27 Thread Benjamin Roth
run time, not compile time. It's not free > either. Every time a reference is added, there's overhead in tracking it. > It also doesn't catch cycles. You still need garbage collection to avoid > memory leaks. > > On Sun, Nov 27, 2016 at 12:31 AM Benjamin Roth > wrote: >

Re: What is the size of each Virtual Node token range?

2016-11-28 Thread Benjamin Roth
e > maximum total number of rows in a server is also predefined? > > maximum total number of rows in a server = num_tokens_in _vnode_1 + > num_tokens_in _vnode_2 + num_tokens_in _vnode_3 + + > num_tokens_in _vnode_256 > -- Benjamin Roth Prokurist Jaumo GmbH ·

Re: Which version is stable enough for production environment?

2016-11-29 Thread Benjamin Roth
y, November 29, 2016 5:42 PM > *To:* user > *Subject:* Which version is stable enough for production environment? > > Hi Cassandra Experts, > > We prepare to deploy Cassandra in production env, but we > can not confirm which version is stable and recommended, could someone in &g

Re: Which version is stable enough for production environment?

2016-11-29 Thread Benjamin Roth
lose its content, but please reply > to this email immediately and highlight the error to the sender and then > immediately delete the message. > > On 30 November 2016 at 18:20, Benjamin Roth > wrote: > >> What are the compaction issues / hint corruprions you encountered?

Re: Which version is stable enough for production environment?

2016-11-30 Thread Benjamin Roth
; and let us know. We're well aware of the issues affecting MVs, but they > haven't really been solved anywhere yet. > > On 30 November 2016 at 07:54, Benjamin Roth > wrote: > >> Hi Brooke, >> >> Just had a quick look on your code and I will promise that you

Re: Which version is stable enough for production environment?

2016-11-30 Thread Benjamin Roth
t; Also when i testes some of our write heavy workload Leveled Compaction was > not keeping up.With same system settings 2.1.16 performs better and all > levels was properly aligned. > -- > *From:* Benjamin Roth > *Sent:* Tuesday, November 29, 2016 11:20:1

Re: Why does `now()` produce different times within the same query?

2016-11-30 Thread Benjamin Roth
Great comment. +1 Am 01.12.2016 06:29 schrieb "Ben Bromhead" : > tl;dr +1 yup raise a jira to discuss how now() should behave in a single > statement (and possible extend to batch statements). > > The values of now should be the same if you assume that now() works like > it does in relational dat

Re: Sanity checks to run post restore data?

2016-12-01 Thread Benjamin Roth
to learn, what sanity > checks should be performed after restoring them? > > Thanks, > Varun > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Cassandra 2.x Stability

2016-12-01 Thread Benjamin Roth
essage or any information herein. >> If you have received this message in error, please advise the sender >> immediately by reply email and delete this message. Thank you. >> >> >> >> This message may contain confidential and/or privileged information. >> If you are not the addressee or authorized to receive this on behalf of >> the addressee you must not use, copy, disclose or take action based on this >> message or any information herein. >> If you have received this message in error, please advise the sender >> immediately by reply email and delete this message. Thank you. >> >> >> -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Cassandra: maximum size of collection list type

2016-12-01 Thread Benjamin Roth
You can read it in the docs but i think it was 2^16 aka 64k Am 01.12.2016 18:00 schrieb "Selvam Raman" : > Hi, > > What is the maximum size which can be stored into collection list(in a row > ) in cassandra. > > -- > Selvam Raman > "லஞ்சம் தவிர்த்து நெஞ்சம் நிமிர்த்து" >

Re: Cassandra 2.x Stability

2016-12-02 Thread Benjamin Roth
-12-01 19:14 GMT+01:00 Kai Wang : > Ben, I just read through those two tickets. It's scarier than I thought. > Thank you for all the investigations and comments. > > On Thu, Dec 1, 2016 at 10:31 AM, Benjamin Roth > wrote: > >> A little experience report on MVs: &g

Re: Which version is stable enough for production environment?

2016-12-02 Thread Benjamin Roth
Feedback; Export >>> https://issues.apache.org/jira/browse/CASSANDRA-12844 >>> >>> >>> Also when i testes some of our write heavy workload Leveled Compaction >>> was not keeping up.With same system settings 2.1.16 performs better and all >>>

Re: node decommission throttled

2016-12-07 Thread Benjamin Roth
Maybe your System cannot Stream faster. Is your cpu or hd/ssd fully utilized? Am 07.12.2016 16:07 schrieb "Eric Evans" : > On Tue, Dec 6, 2016 at 9:54 AM, Aleksandr Ivanov wrote: > > I'm trying to decommission one C* node from 6 nodes cluster and see that > > outbound network traffic on this nod

Re: Batch size warnings

2016-12-07 Thread Benjamin Roth
Could you please be more specific? Am 07.12.2016 17:10 schrieb "Voytek Jarnot" : > Should've mentioned - running 3.9. Also - please do not recommend MVs: I > tried, they're broken, we punted. > > On Wed, Dec 7, 2016 at 10:06 AM, Voytek Jarnot > wrote: > >> The low default value for batch_size_w

Re: Batch size warnings

2016-12-07 Thread Benjamin Roth
guarantee atomicity. > > On Wed, Dec 7, 2016 at 10:13 AM, Benjamin Roth > wrote: > >> Could you please be more specific? >> >> Am 07.12.2016 17:10 schrieb "Voytek Jarnot" : >> >>> Should've mentioned - running 3.9. Also - please do not rec

Re: Batch size warnings

2016-12-07 Thread Benjamin Roth
ch related to > the stuff you're dealing with ... Basically Cassandra just stepping on its > own er, tripping over its own feet streaming MVs. > > On Dec 7, 2016 10:45 AM, "Benjamin Roth" wrote: > >> I meant the mv thing >> >> Am 07.12.2016

Re: node decommission throttled

2016-12-08 Thread Benjamin Roth
Just an educated guess: you have materialized Views? They are known to Stream very slow Am 08.12.2016 10:28 schrieb "Aleksandr Ivanov" : > Yes, I use compression. > Tried without and it gave ~15% increase in speed, but is still too low > (~35Mbps) > > On sending side no high CPU/IO/etc utilizatio

Re: Are Materialized views persisted on disk?

2016-12-13 Thread Benjamin Roth
The word "materialized" implies that. 2016-12-13 20:34 GMT+01:00 Carl Yeksigian : > Yes, they are stored on disk like a normal table. > > On Tue, Dec 13, 2016 at 2:31 PM, Kant Kodali wrote: > >> Are Materialized views persisted on disk? sorry for the naive question

Re: Are Materialized views persisted on disk?

2016-12-13 Thread Benjamin Roth
gt; People should be able to ask legit questions here without getting snarky > answers, please don't do that. Not everyone has the same background or > knowledge that you do. > > On Tue, Dec 13, 2016 at 11:49 AM Benjamin Roth > wrote: > >> The word "materialized&

Re: Cassandra MV insert Warning

2016-12-14 Thread Benjamin Roth
11) ~[na:1.8.0_65] at > org.apache.cassandra.concurrent.AbstractLocalAwareExecutorServ > ice$FutureTask.run(AbstractLocalAwareExecutorService.java:164) > ~[apache-cassandra-3.7.0.jar:3.7.0] at org.apache.cassandra. > concurrent.SEPWorker.run(SEPWorker.java:105) > [apache-cassandra-3.7

Re: All nodes hosting replicas down

2016-12-18 Thread Benjamin Roth
jean paul >> wrote: >> >>> Hi, >>> >>> Please, if we choose a replication factor =2 (simple strategy), so, we >>> have two replicas of data on the ring. >>> >>> What happen in the case of all nodes containing replicas are down ? &g

Re: Materialised view for sets of UUID

2016-12-21 Thread Benjamin Roth
the second variant has to go through many rows to collect > all non-deleted clustering key values. Write performance is > slightly better for the first variant because one table + two > materialised views is more expensive than two tables. > > What would you prefer? > > Tschö, > Torsten. > > -- > Torsten Bronger > > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Why does Cassandra recommends Oracle JVM instead of OpenJDK?

2017-01-02 Thread Benjamin Roth
Does this discussion really make sense any more? To me it seems it turned opinionated and religious. From my point of view anything that has to be said was said. Am 02.01.2017 21:27 schrieb "Edward Capriolo" : > > > On Mon, Jan 2, 2017 at 11:56 AM, Eric Evans > wrote: > >> On Fri, Dec 23, 2016 a

Re: Trying to identify the cause of these errors.

2017-01-10 Thread Benjamin Roth
-cassandra-3.3.0.jar:3.3.0] > at org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:105) > [apache-cassandra-3.3.0.jar:3.3.0] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_111] > Caused by: java.lang.NullPointerException: null > WARN [SharedPool-Worker-2] 2017-01-09 15:04:41,678 > AbstractLocalAwareExecutorService.java:169 - Uncaught exception on thread > Thread[SharedPool-Worker-2,5,main]: {} > java.lang.RuntimeException: java.lang.NullPointerException > at > org.apache.cassandra.service.StorageProxy$DroppableRunnable.run(StorageProxy.java:2461) > ~[apache-cassandra-3.3.0.jar:3.3.0] > at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) > ~[na:1.8.0_111] > at org.apache.cassandra.concurrent.AbstractLocalAwareExecutorServ > ice$FutureTask.run(AbstractLocalAwareExecutorService.java:164) > ~[apache-cassandra-3.3.0.jar:3.3.0] > at org.apache.cassandra.concurrent.AbstractLocalAwareExecutorServ > ice$LocalSessionFutureTask.run(AbstractLocalAwareExecutorService.java:136) > [apache-cassandra-3.3.0.jar:3.3.0] > at org.apache.cassandra.concurrent.SEPWorker.run(SEPWorker.java:105) > [apache-cassandra-3.3.0.jar:3.3.0] > at java.lang.Thread.run(Thread.java:745) [na:1.8.0_111] > > > Regards, > > *DHRUVA GOPAL* > > *sr. MANAGER, ENGINEERING* > > *REPORTING, ANALYTICS AND BIG DATA* > > *+1 408.325.2011 <+1%20408-325-2011>* *WORK* > > *+1 408.219.1094 <+1%20408-219-1094>* *MOBILE* > > *UNITED STATES* > > *dhruva.go...@aspect.com * > > *aspect.com <http://www.aspect.com/>* > > [image: Description: http://webapp2.aspect.com/EmailSigLogo-rev.jpg] > > > This email (including any attachments) is proprietary to Aspect Software, > Inc. and may contain information that is confidential. If you have received > this message in error, please do not read, copy or forward this message. > Please notify the sender immediately, delete it from your system and > destroy any copies. You may not further disclose or distribute this email > or its attachments. > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Change primary key from int to bigint

2017-01-11 Thread Benjamin Roth
or does the SSTable serialization and maybe the token generation require the tables to be completely reread+rewritten? -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Change primary key from int to bigint

2017-01-11 Thread Benjamin Roth
ompatible serialization types, so that won't work. > However, changing to 'varint' will work fine. > > Hope that helps. > > Tom > > > > On Wed, Jan 11, 2017 at 9:21 AM, Benjamin Roth > wrote: > >> Hi there, >> >> Does anyone know if there is

Re: Change primary key from int to bigint

2017-01-11 Thread Benjamin Roth
16777216 | test > 2147483647 | test > > (3 rows) > cqlsh> select * from foo; > > id | value > +--- > 128 | test >16777216 | test > 1 | test > 2147483647 | test >16777215 | test > 256 | t

Re: Change primary key from int to bigint

2017-01-11 Thread Benjamin Roth
gt;> For my understanding: >> When creating a new table, is bigint or varint a better choice for >> storing (up to) 64bit ints? Is there a difference in performance? >> >> 2017-01-11 9:39 GMT+01:00 Tom van der Woerdt < >> tom.vanderwoe...@booking.com>: >>

Re: Strange issue wherein cassandra not being started from cron

2017-01-11 Thread Benjamin Roth
I think you should take a look at supervisord or sth similar. This is a much more reliable solution than using crons. Am 12.01.2017 06:12 schrieb "Ajay Garg" : On Wed, Jan 11, 2017 at 8:29 PM, Martin Schröder wrote: > 2017-01-11 15:42 GMT+01:00 Ajay Garg : > > Tried everything. > > Then try >

Re: Strange issue wherein cassandra not being started from cron

2017-01-11 Thread Benjamin Roth
x27;s why you have a cluster of them. > > Best >Martin > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Queries execution time

2017-01-12 Thread Benjamin Roth
> Hi, > Does anyone know if there is a way to record in a log file the queries > total or partial execution time? I am interested in something similar to > the tracing option but on file. > > Thanks > Best Regards > Salvatore > -- Benjamin Roth Prokurist Jaumo GmbH · www

Re: implementing a 'sorted set' on top of cassandra

2017-01-13 Thread Benjamin Roth
Not if you want to sort by score (a counter) Am 14.01.2017 08:33 schrieb "DuyHai Doan" : > Clustering column can be seen as sorted set > > Table abstraction == Map> > > > On Sat, Jan 14, 2017 at 2:28 AM, Edward Capriolo > wrote: > >> >> >> On Fri, Jan 13, 2017 at 8:14 PM, Jonathan Haddad >> wro

Re: implementing a 'sorted set' on top of cassandra

2017-01-13 Thread Benjamin Roth
inutes or so) >3. Write back to a different CF with the ordered keys I care about > > Does this seem crazy? Is there a simpler way to do this in cassandra? > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: implementing a 'sorted set' on top of cassandra

2017-01-14 Thread Benjamin Roth
ince counter >> cannot be used as type for clustering column (which allows sort) >> >> Or use simple numeric type on clustering column but then to increment the >> value *concurrently* and *safely* it's prohibitive (SELECT to fetch current >> value + UPDATE ... IF

Re: implementing a 'sorted set' on top of cassandra

2017-01-17 Thread Benjamin Roth
rmance hit there? My thought was to make a >> tradeoff by using more cassandra disk space (ie pre calculating all sets), >> in exchange for faster reads when requests actually come in that need this >> data. >> >> From: Benjamin Roth >> Reply-To: "user@

Re: parallel processing - splitting data

2017-01-19 Thread Benjamin Roth
the replicated data. > > Many thanks for any help and guidance, > > Frank Hughes > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: parallel processing - splitting data

2017-01-19 Thread Benjamin Roth
ow to > do the split. > > e.g. from node 1 > > Token ranges - start:-5144720537407094184 end:-5129226025397315327 > > This token range isn't returned by node 2, 3 or 4. > > Thanks again > > Frank > > On 19 January 2017 at 12:19, Benjamin Roth > wrote: &

Re: Huge size of system.batches table after dropping an incomplete Materialized View

2017-01-22 Thread Benjamin Roth
me as when I tried to drop the MV. > > Some insight and suggestions regarding it will be very helpful. I will > like to know if i can safely truncate the table, rm the files or any other > approach to clean it up? > > Thanks. > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Getting Error while Writing in Multi DC mode when Remote Dc is Down.

2017-01-23 Thread Benjamin Roth
08%2005591> (Mobile)* > > Times Internet Ltd. | A Times of India Group Company > > FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > We th

Re: Getting Error while Writing in Multi DC mode when Remote Dc is Down.

2017-01-23 Thread Benjamin Roth
FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Benjamin Roth [mailto:benjamin.r...@jaumo.com] > *Sent:* Monday, January 23, 2017 5:05 PM > *T

Re: Huge size of system.batches table after dropping an incomplete Materialized View

2017-01-23 Thread Benjamin Roth
top these errors. >> >> Now I see huge size of system.batches table on one of the nodes. It seems >> related to issues mentioned above since last modification timestamps of the >> sstable files inside system/batches is same as when I tried to drop the MV. >> >

Re: [Multi DC] Old Data Not syncing from Existing cluster to new Cluster

2017-01-24 Thread Benjamin Roth
uted > > > > Cassandra version : 3.0.9 > > > > > > *Thanks & Regards,* > *Abhishek Kumar Maheshwari* > *+91- 805591 <+91%208%2005591> (Mobile)* > > Times Internet Ltd. | A Times of India Group Company > > FC - 6, Sector 16A, Film City, N

Re: [Multi DC] Old Data Not syncing from Existing cluster to new Cluster

2017-01-24 Thread Benjamin Roth
- 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Benjamin Roth [mailto:benjamin.r...@jaumo.com] > *Sent:* Tuesday, January 24, 2017 8:55 PM > *

Re: [Multi DC] Old Data Not syncing from Existing cluster to new Cluster

2017-01-24 Thread Benjamin Roth
es of India Group Company > > FC - 6, Sector 16A, Film City, Noida, U.P. 201301 | INDIA > > *P** Please do not print this email unless it is absolutely necessary. > Spread environmental awareness.* > > > > *From:* Benjamin Roth [mailto:benjamin.r...@jaumo.com] > *Sent:* Tues

CS process killed by kernel OOM

2017-01-25 Thread Benjamin Roth
anyone encountered sth like this before? -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7161 304880-1 AG Ulm · HRB 731058 · Managing Director: Jens Kammerer

Re: Disc size for cluster

2017-01-26 Thread Benjamin Roth
Strategy. > Is it correct, that SizeTiered Compaction Strategy needs (in the worst > case) 50% free disc space during compaction? > > So this would then result in a cluster of 3TB x 3 x 2 == 18 TB of raw > storage? > > Thanks and Regards > Raphael Vogel > -- Benjamin

Re: Time series data model and tombstones

2017-01-28 Thread Benjamin Roth
Maybe trace your queries to see what's happening in detail. Am 28.01.2017 21:32 schrieb "John Sanda" : Thanks for the response. This version of the code is using STCS. gc_grace_seconds was set to one day and then I changed it to zero since RF = 1. I understand that expired data will still generat

Re: Is it possible to have a column which can hold any data type (for inserting as json)

2017-01-31 Thread Benjamin Roth
n I try to insert the data as JSON I got the error : *Error decoding > JSON value for value: Expected a UTF-8 string, but got a Double: 1.390692*. > The message is clear that a double value cannot be inserted to text column. > The real issue is that the value can be of any data type, so

RE: Is it possible to have a column which can hold any data type (for inserting as json)

2017-02-01 Thread Benjamin Roth
: 1, > >"address":"", > >"datatype":"DOUBLE", > >"name":"Longitude", > >"attributes":{ > > "ID":"1" > > }, > >"category":"REAL", >

RE: Is it possible to have a column which can hold any data type (for inserting as json)

2017-02-01 Thread Benjamin Roth
02.02.2017 05:30 schrieb "Rajeswari Menon" : > Yes. Is there any way to define value to accept any data type as the json > value data may vary? Or is there any way to do the same without defining a > schema? > > > > Regards, > > Rajeswari > > > > *Fro

Re: CS process killed by kernel OOM

2017-02-06 Thread Benjamin Roth
or not. It also was my first thought but in the end the main thing is, it works again and it does with more mibn_free_kbytes 2017-02-06 11:53 GMT+01:00 Avi Kivity : > > On 01/26/2017 07:36 AM, Benjamin Roth wrote: > > Hi there, > > We installed 2 new nodes these days. They run on

Re: CS process killed by kernel OOM

2017-02-06 Thread Benjamin Roth
d reporting it, it will help others avoid encountering the > same problem if it gets fixed. > > On 02/06/2017 03:07 PM, Benjamin Roth wrote: > > Thanks for the reply. We got rid of the OOMs by increasing > vm.min_free_kbytes, it's default of approx 90mb is maybe a bit low for &

Re: Why does CockroachDB github website say Cassandra has no Availability on datacenter failure?

2017-02-07 Thread Benjamin Roth
Ask for forgiveness not for permission if you do marketing ;) Am 07.02.2017 13:11 schrieb "Kant Kodali" : > lol. But seriously are they even allowed to say something that is not true > about another product ? > > On Tue, Feb 7, 2017 at 4:05 AM, kurt greaves wrote: > >> Marketing never lies. Ever

DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
lp to have it for daily business, IMHO it's a waste of resources to run multiple queries just to fetch a bunch of records by a PK. Thanks in advance for any reply -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com Wehrstraße 46 · 73035 Göppingen · Germany Phone +49 7161 304880-6 · Fax +49 7

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
ent to (PK1=1 AND PK2=2) or are you > looking for (PK1 IN (1,2) AND PK2 IN (1,2)) or something else? > > Cheers > Ben > > On Thu, 9 Feb 2017 at 20:09 Benjamin Roth wrote: > >> Hi Guys, >> >> CQL says this is not allowed: >> >> DELETE FROM ks.cf WH

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
ment per PK tuple would be roughly equivalent? And probably > no more complex to generate in the client? > > On Thu, 9 Feb 2017 at 20:22 Benjamin Roth wrote: > >> Maybe that makes it clear: >> >> DELETE FROM ks.cf WHERE (partitionkey1, partitionkey2) IN ((1, 2), (1,

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
around with a batch if you need atomicity. > > Which is not saying it will never be and shouldn't be supported btw, there > is something to be said for the consistency of the CQL language in general. > But it's why no-one took time to do it so far. > > On Thu,

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
id quite a bit of > benchmarking on this topic for his talk at last year’s C* summit: > http://www.slideshare.net/DataStax/microbatching- > highperformance-writes-adam-zegelin-instaclustr-cassandra-summit-2016 > > On Thu, 9 Feb 2017 at 20:52 Benjamin Roth wrote: > >> Ok g

Re: DELETE/SELECT with multi-column PK and IN

2017-02-09 Thread Benjamin Roth
Ok now I REALLY got it :) Thanks Sylvain! 2017-02-09 11:42 GMT+01:00 Sylvain Lebresne : > On Thu, Feb 9, 2017 at 10:52 AM, Benjamin Roth > wrote: > >> Ok got it. >> >> But it's interesting that this is supported: >> DELETE/SELECT FROM ks.cf WHERE (

Re: cassandra user request log

2017-02-10 Thread Benjamin Roth
17-02-10 9:42 GMT+01:00 vincent gromakowski : > Hi all, > Is there any way to trace user activity at the server level to see which > user is accessing which data ? Do you thin it would be simple to implement ? > Tx > -- Benjamin Roth Prokurist Jaumo GmbH · www.jaumo.com W

  1   2   >