Exception when setting tokens for the cassandra nodes

2013-04-29 Thread Rahul
Source) at javax.management.remote.rmi.RMIConnector$RemoteMBeanServerConnection.getAttribute(RMIConnector.java:901) at javax.management.MBeanServerInvocationHandler.invoke(MBeanServerInvocationHandler.java:280) Any suggestions towards solving this problem would be deeply appreciated. thanks, rahul --

RE: Exception when setting tokens for the cassandra nodes

2013-04-30 Thread Rahul
63) for i in range(2)]' > > ** ** > > which gives the following values: > > ['-9223372036854775808', '0'] > > ** ** > > *From:* Rahul [mailto:rahule...@gmail.com] > *Sent:* Monday, April 29, 2013 7:23 PM > *To:* user@cassan

Re: Best way to Drop Tombstones/after GC Grace

2018-03-15 Thread Rahul Singh
continue. It may be a very simple fix on their end to save from potential risks, extra ongoing work, and bad practices. This kind of “system debt” catches up. Better to nip it now. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 14, 2018, 7:31 PM -0400, Madhu B , wrote: > Thank

Re: Delete System_Traces Table

2018-03-19 Thread Rahul Singh
I think he just wants to delete the test table not the whole keyspace. Is that correct? -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 19, 2018, 9:08 AM -0500, Chris Lohfink , wrote: > No. > > Why do you want to? If you don't use tracing they will be empty, and if w

Re: Is this SSTable restore merging scenario possible ?

2018-03-21 Thread Rahul Singh
If its not on the same “cluster” and you are not using something like OpsCenter, the snapshotted files will have a diferent schema UUID for each entity. If you rename the files to have the matching UUID in the file names, then you should be able to do what you are talking about. On Mar 21, 2018

Re: How to Protect Tracing Requests From Client Side

2018-03-22 Thread Rahul Singh
Execute ‘nodetool settraceprobability 0’ on all nodes. It does zero percentage of he tracing. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 22, 2018, 11:10 AM -0500, shalom sagges , wrote: > Hi All, > > Is there a way to protect C* on the server side from tracing comm

Re: Using Spark to delete from Transactional Cluster

2018-03-22 Thread Rahul Singh
which will eventually get cleansed. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 22, 2018, 2:19 PM -0500, Charulata Sharma (charshar) , wrote: > Hi, >    Wanted to know the community’s experiences and feedback on using Apache > Spark to delete data from C* transactiona

Re: Using Spark to delete from Transactional Cluster

2018-03-23 Thread Rahul Singh
really old data expire .. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 23, 2018, 11:38 AM -0700, Charulata Sharma (charshar) , wrote: > Hi Rahul, > Thanks for your answer. Why do you say that deleting from spark is > not elegant?? This is the exact feedba

Re: Cassandra certification

2018-03-24 Thread Rahul Singh
. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 24, 2018, 5:38 AM -0700, Chandan Goel , wrote: > Hi, > > Is the professional certification on Cassandra given by Datastax recognized ? > Oreilly has stopped giving any certifications , it seems. Is there any other > a

Re: Can "data_file_directories" make use of multiple disks?

2018-03-27 Thread Rahul Singh
Yes you can have multiple entries from multiple disks. No guarantee as I can see of even distribution. If you want even distribution there are better mechanisms for this at the filesystem later. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 27, 2018, 8:05 AM -0700, Venkata Hari

RE: RE: 答复: A node down every day in a 6 nodes cluster

2018-03-27 Thread Rahul Singh
It may be that the wife partition is bombarded more than other partitions. What’s your RF on that keyspace? If if it’s greater than 1 I’d expect other nodes to get the same type of load. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 27, 2018, 5:56 AM -0700, Kenneth Brotman

Re: Is Cassandra used in Medical industry?

2018-03-29 Thread Rahul Singh
Is that an encryption related policy? If you can clarify — maybe able to get better answers. There are products like Vormetrics (?) which can encrypt data at rest. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 29, 2018, 12:23 AM -0400, Sudhakar Ganesan , wrote: > Hi, >

Re: datastax cassandra minimum hardware recommendation

2018-04-04 Thread Rahul Singh
Agree with Alain. Remember that DSE is not Cassandra. It includes Cassandra, SolR, Spark, and Graph. So if you run all of some , it’s more than just Cassandra. OpsCenter is another thing altogether. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 4, 2018, 5:42 AM -0400, Alain

Re: Urgent Problem - Disk full

2018-04-04 Thread Rahul Singh
Nothing a full repair won’t be able to fix. On Apr 4, 2018, 7:32 AM -0400, Jürgen Albersdorfer , wrote: > Hi, > > I have an urgent Problem. - I will run out of disk space in near future. > Largest Table is a Time-Series Table with TimeWindowCompactionStrategy (TWCS) > and default_time_to_live =

Re: How to restrict users to specific DC.

2018-04-10 Thread Rahul Singh
That seems to be more of a network segmentation issue. Protect the other nodes behind a firewall / security group. Each node in the different DCs would be able to talk to each other but the user client machine can only access the traffic only DC -- Rahul Singh rahul.si...@anant.us Anant

Re: Nodetool repair multiple dc

2018-04-13 Thread Rahul Singh
Makes sense it takes a long time since it has to reconcile against replicas in all DCs. I leverage commercial tools for production clusters, but I’m pretty sure Reaper is the best open source option. Otherwise you’ll waste a lot of time trying to figure it out own your own. No need to reinvent t

Re: Cassandra datastax cerrification

2018-04-14 Thread Rahul Singh
It’s good for a certification... there was a discussion in this list two weeks ago on the same subject. Summary: 1. Yes it’s fine. 2. You need experience or talent to get hired. 3. Talent can be either proven experience or your activity on the Cassandra project. -- Rahul Singh rahul.si

Re: copy from one table to another

2018-04-17 Thread Rahul Singh
1. Make a new table with the same schema. For each node 2. Shutdown node 3. Copy data from Source sstable dir to new sstable dir. This will do what you want. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 16, 2018, 4:21 PM -0500, Kyrylo Lebediev , wrote: > Thanks,  Ali. >

Re: where does c* store the schema?

2018-04-17 Thread Rahul Singh
It uses a “everywhere” replication strategy and its recommended to do all alter / create / drop statements with consistency level all — meaning it wouldn’t make the change to the schema if the nodes are up. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 17, 2018, 12:31 AM -0500

Re: Cassandra read process

2018-04-17 Thread Rahul Singh
Did you look at the answer the guy gave? -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 17, 2018, 5:12 AM -0500, vishal1.sha...@ril.com, wrote: > Dear Community, > > Can you please help in answering the question below: > > https://stackoverflow.com/questions/497

Re: multiple table directories for system_schema keyspace

2018-04-17 Thread Rahul Singh
it reinitialized the system. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 17, 2018, 2:25 PM -0500, John Sanda , wrote: > On a couple different occasions I have run into this exception at start up: > > Exception (org.apache.cassandra.exceptions.InvalidRequest

Re: where does c* store the schema?

2018-04-18 Thread Rahul Singh
, It should catch up but every now and then if the changes are too great, it’s easier to run nodetool resetlocalschema https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsResetLocalSchema.html -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 18, 2018, 1:17 AM -0500, Jinhua

Re: Phantom growth resulting automatically node shutdown

2018-04-19 Thread Rahul Singh
data growth. What does your cfstats / tablestats day? Are you monitoring your key tables data via cfstats metrics like SpaceUsedLive or SpaceUsedTotal. What is your snapshottjng / backup process doing? -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 19, 2018, 7:01 AM -0500, horschi

Re: copy from one table to another

2018-04-19 Thread Rahul Singh
Each table has a different Guid — doing a hard link may work as long as the sstable dir’s guid is he same as the newly created table in the system schema. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 19, 2018, 10:41 AM -0500, Kyrylo Lebediev , wrote: > The table is too la

Re: copy from one table to another

2018-04-21 Thread Rahul Singh
; and > table_name='usr'; > >  id > -- >  ea2f6da0-f931-11e7-8224-43ca70555242 > > > Directory name: > ./data/test/usr-ea2f6da0f93111e7822443ca70555242 > > Correct? > > Regards, > Kyrill > From: Rahul Singh &

Re: read repair with consistency one

2018-04-21 Thread Rahul Singh
Read repairs are one anti-entropy measure. Continuous repairs is another. If you do repairs via Reaper or your own method it will resolve your discrepencies. On Apr 21, 2018, 3:16 AM -0400, Grzegorz Pietrusza , wrote: > Hi all > > I'm a bit confused with how read repair works in my case, which i

Re: GUI clients for Cassandra

2018-04-23 Thread Rahul Singh
Zeppelin and Dbeaver EE are both good. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 23, 2018, 12:53 AM -0400, Eunsu Kim , wrote: > I am now writing dbeaver EE, but I’m waiting for TeamSQL (https://teamsql.io) > to support cassandra. > > > On 23 Apr 2018, at 7:5

Re: Determining active sstables and table- dir

2018-05-01 Thread Rahul Singh
Schema column families is the most authoritative. You may have different data directories. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Apr 27, 2018, 1:24 PM -0700, Carl Mueller , wrote: > IN cases where a table was dropped and re-added, there are now two table > directorie

Re: How to measure time to execute joinToCassandraTable

2018-05-13 Thread Rahul Singh
and saved them into Cassandra. I could then later get time aggregates and average times per operation. -- Rahul Singh rahul.si...@anant.us Anant Corporation On May 13, 2018, 4:14 PM -0500, Guillermo Ortiz , wrote: > I'm using the driver from Cassandra-Spark, I would like to know if th

Re: estimated number of keys vs ttl

2018-05-23 Thread Rahul Singh
If the TTL actually reduces the key count , should. It’s possible to TTL a row from a partition but not the whole partition. 1 key = 1 partition != 1 row != 1 cell -- Rahul Singh rahul.si...@anant.us Anant Corporation On May 23, 2018, 6:07 AM -0500, Grzegorz Pietrusza , wrote: > Hi >

Re: EXT: Cassandra Monitoring tool

2018-05-25 Thread Rahul Singh
Good article about it on LI https://www.linkedin.com/pulse/snap-cassandra-s3-tablesnap-vijaya-kumar-hosamani/ On May 25, 2018, 2:52 PM -0500, Joaquin Casares , wrote: > Hello Aneesh, > > While this doesn't provide a GUI, tablesnap is a community tool that does a > great job at handling backups:

Re: cassandra update vs insert + delete

2018-05-27 Thread Rahul Singh
Deletes create tombstones — not really something to consider. Better to add / update or insert data and do a soft delete on old data and apply a TTL to remove it at a future time. -- Rahul Singh rahul.si...@anant.us Anant Corporation On May 27, 2018, 5:36 AM -0400, onmstester onmstester

Re: Fwd: Re: cassandra update vs insert + delete

2018-05-30 Thread Rahul Singh
> > My 2 cents, if you want to update some information just update it. There’s > > no need to overthink it. > > > > Batches are good if they’re constrained to a single partition, not so hot > > otherwise. > > > > > > On Sun, May 27, 2018 a

Re: Certified Cassandra for Enterprise use

2018-05-31 Thread Rahul Singh
DataStax services partner. -- Rahul Singh rahul.si...@anant.us Anant Corporation On May 29, 2018, 4:01 AM -0400, Ben Slater , wrote: > Hi Pranay > > We (Instaclustr) provide enterprise support for Cassandra > (https://www.instaclustr.com/services/cassandra-support/) which may cover &g

Re: nodetool repair -pr

2018-06-08 Thread Rahul Singh
>From DS dox : "Do not use -pr with this option to repair only a local data >center." On Jun 8, 2018, 10:42 AM -0400, user@cassandra.apache.org, wrote: > > nodetool repair -pr

Re: Options to replace hardware of the cluster

2018-06-14 Thread Rahul Singh
For no downtime and no lost data, I would make a new DC in the same cluster, and wait for the data / MVs to stream over. Otherwise, the best way is to snapshot everything and bring up the nodes all at once. On Jun 14, 2018, 4:11 AM -0400, Christian Lorenz , wrote: > Hi, > > we need to move our e

Re: Options to replace hardware of the cluster

2018-06-14 Thread Rahul Singh
How much daa do you have and what is the timeline? If you can manage with a maintenance window the snapshot / move and restore method may be the fastest. Streaming data can take a long time to sync two DCs if there is a lot of data. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jun

RE: how to avoid lightwieght transactions

2018-06-21 Thread Rahul Singh
application such as an event queue — basically mitigates / hedged performance loss in doing LWT. You can always use CQRS without LWT. Rahul On Jun 21, 2018, 4:38 AM -0400, Jacques-Henri Berthemet , wrote: > Hi, > > Another way would be to make your PK a clustering key with Id as PK and t

RE: [EXTERNAL] Re: Tombstone

2018-06-21 Thread Rahul Singh
. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jun 19, 2018, 12:39 PM -0400, Durity, Sean R , wrote: > This sounds like a queue pattern, which is typically an anti-pattern for > Cassandra. I would say that it is very difficult to get the access patterns, > tombstones, and everyt

Re: How do you monitoring Cassandra Cluster?

2018-06-21 Thread Rahul Singh
I’ve collected a bunch at http://leaves.anant.us/#!/?tag=cassandra,monitoring I reommend Grafana / Prometheus if you don’t have DSE (which has OpsCenter) -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jun 19, 2018, 1:06 PM -0400, Romain Gérard , wrote: > Hi Felipe, > > Yo

Re: C* in multiple AWS AZ's

2018-06-28 Thread Rahul Singh
and at the same time add to the new one. Rahul On Jun 28, 2018, 9:03 AM -0500, Randy Lynn , wrote: > I have a 6-node cluster I'm migrating to the new i3 types. > But at the same time I want to migrate to a different AZ. > > What happens if I do the "running node replace method&

Re: C* in multiple AWS AZ's

2018-06-28 Thread Rahul Singh
replicas in the old DC. Then you can decommissioned. This way you are hundred percent sure that you aren’t missing any new data. No need for a DC to DC repair but a repair is always healthy. Rahul On Jun 28, 2018, 9:15 AM -0500, Randy Lynn , wrote: > Already running with Ec2. > > My

Re: Check Cluster Health

2018-06-28 Thread Rahul Singh
When you run TPstats or Tablestats subcommands in nodetool you are actually accessing data inside Cassandra via JMX. You can start there at first. Rahul On Jun 28, 2018, 10:55 AM -0500, Thouraya TH , wrote: > Hi, > > Please, how can check the health of my cluster / data center using

Re: C* in multiple AWS AZ's

2018-06-29 Thread Rahul Singh
Totally agree. GPFS for the win. EC2 multi region snitch is an automation tool like Ansible or Puppet. Unless you have two orders of magnitude more servers than you do now, you don’t need it. Rahul On Jun 29, 2018, 6:18 AM -0400, kurt greaves , wrote: > Yes. You would just end up with a r

Resources for Monitoring Cassandra, Spark, Solr

2018-07-02 Thread Rahul Singh
/ This is a work in progress and I'll update this with screenshots as well as with links from other contributors. -- Rahul Singh rahul.si...@anant.us Anant Corporation

Re: Is there a plan for Feature like this in C* ?

2018-07-03 Thread Rahul Singh
Kafka Connect. Theoretically you should be able to use the Smart Cat Labs CDC Kafka producer and then use that with Kafka Connect to write to else where. Rahul On Jul 3, 2018, 11:48 AM -0400, Joshua Galbraith , wrote: > There is more info and background context on CDC here: >

Re: Is there a plan for Feature like this in C* ?

2018-07-03 Thread Rahul Singh
Some of my links related to Kafka and Cassandra http://leaves.anant.us/#!/leaf/10767?tag=cassandra,kafka Rahul On Jul 3, 2018, 11:48 AM -0400, Joshua Galbraith , wrote: > There is more info and background context on CDC here: > https://issues.apache.org/jira/browse/CASSANDRA-8844 > &

Re: Jmx_exporter CPU spike

2018-07-08 Thread Rahul Singh
How often are you polling the JMX? How much of a spike are you seeing in CPU? -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jul 5, 2018, 2:45 PM -0500, rajpal reddy , wrote: > > we have Qualys security scan running causing the cpu spike. We are seeing the > CPU spike only

Re: Installation

2018-07-10 Thread Rahul Singh
turn on the new binaries, one node at a time. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jul 9, 2018, 6:35 PM -0500, rajpal reddy , wrote: > We have our infrastructure in cloud so opted for adding new dc with tar.gz > then removed the old dc with package installation > &

Re: Jmx_exporter CPU spike

2018-07-10 Thread Rahul Singh
Nice find, Ben. I added this to my list of c* monitoring tools. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jul 9, 2018, 8:20 PM -0500, rajpal reddy , wrote: > Thanks Ben!. will look into it > > On Jul 9, 2018, at 10:42 AM, Ben Bromhead wrote: > > > > Hi Rajp

Clarification needed on how triggers execute on batch mutations

2018-07-12 Thread Rahul Singh
the coordinator does the write management , I am expecting that regardless of whether I'm doing a logged or unlogged batch, the trigger on any given table will only be triggered once per mutated partition. Is my assumption correct? Rahul Singh Chief Executive Officer | Internet Architecture

Re: Cassandra recommended server uptime?

2018-07-17 Thread Rahul Singh
It’s likely that if you have server stability issues its because of data model or compaction strategy configurations which lead to out of memory issues or massive GC pauses. Rebooting wouldn’t solve those issues. -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jul 17, 2018, 7:28 AM

Re: Bind keyspace to specific data directory

2018-07-17 Thread Rahul Singh
What’s the goal, Abdul? Is it for security reasons or for organizational reasons. You could try prefixing / suffixing the keyspace names if its for organizational reasons (For now) if you don’t want to do the manual management of mounts as Anthony suggested . -- Rahul Singh rahul.si

Re: Cassandra Repair

2018-07-17 Thread Rahul Singh
less than your shortest GC grace seconds. So if you have a GC of 10 days, you want to complete your repairs in 9 days… -- Rahul Singh rahul.si...@anant.us Anant Corporation On Jul 16, 2018, 5:15 PM -0400, rajasekhar kommineni , wrote: > Hello All, > > > I have all cluster nodes i

RE: [EXTERNAL] New cluster vs Increasing nodes to already existed cluster

2018-07-17 Thread Rahul Singh
You can make new clusters or you can isolate with datacenters that don’t have a keyspace replicated. On Jul 16, 2018, 10:41 AM -0400, Durity, Sean R , wrote: > In most cases, we separate clusters by application. This does help with > isolating problems. A bad query in one application won’t affec

Re: Cassandra node RAM amount vs data-per-node/total data?

2018-07-17 Thread Rahul Singh
~ 128GB. The lowest I’ve gone is 16GB but that’s for dev purposes only. -- Rahul Singh rahul.si...@anant.us https://www.anant.us/datastax Anant Corporation On Jul 17, 2018, 8:26 AM -0400, Vsevolod Filaretov , wrote: > What are general community and/or your personal experience viewpoints

Re: Cassandra Repair

2018-07-17 Thread Rahul Singh
17, 2018, at 4:45 AM, Rahul Singh > > wrote: > > > > Have you considered looking into reaper project — could save you time in > > figuring out your own strategy.  > > https://github.com/thelastpickle/cassandra-reaper > > > > Otherwise you can always do a

Re: Cassandra node RAM amount vs data-per-node/total data?

2018-07-17 Thread Rahul Singh
heapspace, so unncessary GC pressure even with G1GC … which has STW pauses … eventually. Non-response was generally due to GC pauses… (considering that Data model was good all around) On Jul 17, 2018, 10:39 AM -0400, Vsevolod Filaretov , wrote: > @Rahul Singh thank you for the answer! > >

Re: apache cassandra development process and future

2018-07-18 Thread Rahul Singh
acle now has a Datastax offering 3. Mesosphere offers supported versions of Cassandra and Datastax 4. Kubernetes and related purveyors use Cassandra as prime example as a part of a Kubernetes backed cloud agnostic orchestration framework 5. What Alain mentioned earlier. -- Rahul Singh rahul.si...@

Re: cassandro nodes restarts

2018-07-26 Thread Rahul Singh
Do the same nodes reboot or is it arbitrary? I’m wondering if it’s an isolated incident related to dat / traffic skew or could happen on any coordinator Rahul On Jul 26, 2018, 12:31 AM -0400, Jeff Jirsa , wrote: > It’s a warning, but probably not causing you problems > > A 20kB batch

Re: Infinite loop of single SSTable compactions

2018-07-26 Thread Rahul Singh
Few questions What is your maximumcompactedbytes across the cluster for this table ? What’s your TTL ? What does your data model look like as in what’s your PK? Rahul On Jul 25, 2018, 1:07 PM -0400, James Shaw , wrote: > nodetool compactionstats  --- see compacting which table > no

Work in Progress - Bringing it all together in one "Awesome Cassandra" README

2018-07-26 Thread Rahul Singh
computing (e.g. Kafka, Spark, Akka, Kubernetes, etc.) . I've got about ~120 or so resources organized in this Readme, and I have a queue of another 100 or so. Please feel free to send me any focused Cassandra blogs related to development, architecture, or devops. Thanks, Rahul Singh Chi

Re: Cassandra crashes after loading data with sstableloader

2018-07-29 Thread Rahul Singh
What does “hash” Data look like? Rahul On Jul 24, 2018, 11:30 AM -0400, Arpan Khandelwal , wrote: > I need to clone data from one keyspace to another keyspace. > We do it by taking snapshot of keyspace1 and restoring in keyspace2 using > sstableloader. > > Suppose we have follo

Re: cassandro nodes restarts

2018-07-29 Thread Rahul Singh
-spark-solr-performance/ I assembled that list recently — I would even add that getting system logs into ELK or Splunk could also show some patterns otherwise not detected tailing and gripping. Rahul On Jul 26, 2018, 10:20 AM -0400, R1 J1 , wrote: > Thanks for your prompt replies. No the same n

Re: optimization to cassandra-env.sh

2018-07-29 Thread Rahul Singh
://product.hubspot.com/blog/g1gc-fundamentals-lessons-from-taming-garbage-collection Rahul On Jul 26, 2018, 1:27 PM -0400, R1 J1 , wrote: > Any one has tried to optimize or change cassandra-env.sh in an server > installation to make it use more heap size for garbage collection ? > Any ideas

Re: Data model storage optimization

2018-07-29 Thread Rahul Singh
a storage optimization problem - it’s a data architecture problem. Rahul On Jul 28, 2018, 3:11 AM -0400, onmstester onmstester , wrote: > The current data model described as table name: > ((partition_key),cluster_key),other_column1,other_column2,... > > user_by_name: ((time_bucket, u

Secure data

2018-07-31 Thread Rahul Reddy
Hello, I'm trying to find a good document on to enable encryption for Apache Cassandra (not on dse) tables and commilogs and store the keystore in kms or vault. If any of you already configured please direct me to documentation for it.

Re: Secure data

2018-08-01 Thread Rahul Reddy
Hello, Any one tried aws ec2 volume encryption for Cassandra instances? On Tue, Jul 31, 2018, 12:25 PM Rahul Reddy wrote: > Hello, > > I'm trying to find a good document on to enable encryption for Apache > Cassandra (not on dse) tables and commilogs and store the keystore

Re: Secure data

2018-08-01 Thread Rahul Reddy
;> >>> On Wed, Aug 1, 2018 at 12:36 PM Jeff Jirsa wrote: >>> >>>> EBS encryption worked well on gp2 volumes (never tried it on any others) >>>> >>>> -- >>>> Jeff Jirsa >>>> >>>> >>>> On Aug 1, 2018, at

Re: Huge daily outbound network traffic

2018-08-07 Thread Rahul Singh
Are you sure you don’t have an outside process that is doing an export , Spark job, non AWS managed backup process ? Is this network out from Cassandra or from the network? Rahul On Aug 7, 2018, 4:09 AM -0400, Behnam B.Marandi , wrote: > Hi, > I have a 3 node Cassandra cluster (version

Re: Hinted Handoff

2018-08-07 Thread Rahul Singh
need for multi-dc global replication and redundancy not quite available at the same level of uptime as in dist. Coaches like REDIS. Rahul On Aug 7, 2018, 1:19 AM -0400, kurt greaves , wrote: > > Does Cassandra TTL out the hints after max_hint_window_in_ms? From my > > understandin

Re: ETL options from Hive/Presto/s3 to cassandra

2018-08-07 Thread Rahul Singh
think about leveraging Kafka as the transport layer and using Kafka Connect. It brings other tooling to get data into Cassandra from a variety of sources. Rahul On Aug 6, 2018, 3:16 PM -0400, srimugunthan dhandapani , wrote: > Hi all, > We have data that gets filled into Hive/ presto  eve

Re: Repair daily refreshed table

2018-08-18 Thread Rahul Singh
Are you loading using a batch process? What’s the frequency of the data Ingest and does it have to very fast. If not too frequent and can be a little slower, you may consider a higher consistency to ensure data is on replicas. Rahul On Aug 18, 2018, 2:29 AM -0700, Maxim Parkachov , wrote: >

Re: Repair daily refreshed table

2018-08-18 Thread Rahul Singh
deleting and then loading? That’s the only way you should see tombstones — or maybe you are setting nulls? Rahul On Aug 18, 2018, 11:16 PM -0700, Maxim Parkachov , wrote: > Hi Rahul, > > I'm already using LOCAL_QUORUM in batch process and it runs every day. As far > as I unders

Re: Work in Progress - Awesome Cassandra Resources w/ Outline

2018-08-22 Thread Rahul Singh
se. Thanks for the suggestion! Rahul On Aug 9, 2018, 3:55 AM -0500, Horia Mocioi , wrote: > Hello Rahul, > > Great compilation of resources. > > Maybe add this one on the Blogs category? https://lostechies.com/ryansv > ihla/tags > > This one is also quite

Re: A blog about Cassandra in the IoT arena

2018-08-23 Thread Rahul Singh
collocates with C*) or DataStax (index in Solr collocated with C*) I personally haven’t used SnappyData but that’s another Spark based DB that could be leveraged for performance real-time queries on the OLTP side. Rahul On Aug 23, 2018, 2:48 AM -0500, Affan Syed , wrote: > Hi, > > w

Re: A blog about Cassandra in the IoT arena

2018-08-23 Thread Rahul Singh
tombstones all the time because they don’t know better. Rahul On Aug 23, 2018, 11:50 AM -0500, DuyHai Doan , wrote: > As I used to tell some people, the day we make : > > 1. partition size unlimited, or at least huge partition easily manageable > (compaction, repair, streaming, partitio

Re: 90million reads

2018-08-23 Thread Rahul Singh
Agreed. If your data model is good and no major read latencies due to little or no data skew, wide partitions, or tombstones, you can literally scale linearly. You could also consider having a plan in which you ramp up as the traffic increases. Rahul Singh Chief Executive Officer m

Re: Cassandra 2.2.7 Compaction after Truncate issue

2018-08-23 Thread Rahul Singh
David , What CL do you set when running this command? Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business technology platforms. On Aug 14, 2018, 11:49 AM -0500, David Payne , wrote

Re: Fwd: Removing Extra Spaces and Row counts while using Capture Command

2018-08-23 Thread Rahul Singh
What’s your goal? Just output the results and save as JSON? There may be a better way to do what you want. https://github.com/tenmax/cqlkit/blob/master/README.md Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We

Re: How to rename the column name in Cassandra tables

2018-08-23 Thread Rahul Singh
static column, since you cannot use a static column in the table's primary key. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business technology platforms. On Aug 13, 2018, 7:42 AM

Re: Tombstone experience

2018-08-24 Thread Rahul Singh
set a short TTL that you would have deleted and that will eventually clear our data depending on the value you set. My suggestion for those cases where you must do business rules deletions, use a continuous spark job / Spark streaming on another DC to maintain data hygiene. Rahul Singh Chief

RE: [EXTERNAL] Re: Re: bigger data density with Cassandra 4.0?

2018-08-29 Thread Rahul Singh
YugaByte is also another new dancer in the Cassandra dance. The data store is based on RocksDB — and it’s written in C++. Although they ar wire compliant with c* I’m pretty are everything under the hood is NOT a port like Scylla was initially. Rahul Singh Chief Executive Officer m 202.905.2818

Re: A blog about Cassandra in the IoT arena

2018-08-29 Thread Rahul Singh
one level deeper. Maybe it’s another compaction strategy that evenly distributes data by either threshold of size or maintain a certain number of sstables. Don’t have any ideas yet on anything better than Merkle trees. Will get back to you with ideas or code. Good stuff. Rahul On Aug 24, 2018

Datastax encryption with kms

2018-09-04 Thread Rahul Reddy
Hello, Has anyone done the dse cassandra stable/ commitlog tde encryption saving the keys in kms or vault instead of kmip. If it's possible please do let me know

Re: Datastax encryption with kms

2018-09-04 Thread Rahul Singh
This is a Cassandra user group — consider joining the Datastax Academy Slack group and asking there. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business technology platforms. On Sep 4

Re: [EXTERNAL] Regarding migrating data from Oracle to Cassandra.migrate data from Oracle to Cassandra.

2018-09-05 Thread Rahul Singh
providing. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business technology platforms. On Sep 5, 2018, 10:47 AM -0500, Jeff Jirsa , wrote: > All of  Sean's points are good, a

Re: [EXTERNAL] Regarding migrating data from Oracle to Cassandra.migrate data from Oracle to Cassandra.

2018-09-05 Thread Rahul Singh
Look here for some “migration” or data modeling articles. https://anant.github.io/awesome-cassandra/ Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business technology platforms. On Sep 5

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread Rahul Singh
you’ll want to do what Jon suggested and source the event from Kafka for all subsequent processes rather than process in Cassandra and the create the event in Kafka. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-10 Thread Rahul Singh
on both clusters / DBS. All that means is that I need to sequence the change before it happens so I can predictably ensure it’s Scheduled for write / Mutation. So I’m Back to square one: having a definitive queue / ledger separate from the individual commit log of the cluster. Rahul Singh

Re: Regarding migrating data from Oracle to Cassandra.migrate data from Oracle to Cassandra.

2018-09-10 Thread Rahul Singh
Look into Kafka Connect. It does tracking internally in a topic. Works better going from relational to Cassandra. Still won’t fix your potential data model issue related to skew and wide partitions. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW

Re: Using CDC Feature to Stream C* to Kafka (Design Proposal)

2018-09-11 Thread Rahul Singh
. Have used it several times. Cassandra is a great place to store data in transport. Rahul On Sep 10, 2018, 5:21 PM -0400, DuyHai Doan , wrote: > Also using Calvin means having to implement a distributed monotonic sequence > as a primitive, not trivial at all ... > > > On Mon, Se

Re: Scrub a single SSTable only?

2018-09-11 Thread Rahul Singh
What’s the RF for that data ? If you can manage downtime one node I’d recommend just bringing it down, and then repairing after you delete the bad file and bring it back up. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C

Re: Cassandra system table diagram

2018-09-21 Thread Rahul Singh
I think his question was related specifically to the system tables. KDM is a good tool for designing the tables but not necessarily for viewing the system tables. Abdul, try out a tool called DB Schema Visualizer. It supports Cassandra Rahul Singh Chief Executive Officer m 202.905.2818 Anant

Re: Cassandra repair in different version

2018-09-21 Thread Rahul Singh
Is there a reason why these versions are so different ? I would recommend bringing 3.0.6 to 3.0.13 before doing cluster wise commands. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital

Re: Optimizing for connections

2018-12-20 Thread Rahul Singh
See inline Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business technology platforms. On Dec 9, 2018, 2:02 PM -0500, Devaki, Srinivas , wrote: > Hi Guys, > > Have a couple of

Re: Alter table

2018-12-20 Thread Rahul Singh
If you use collections such as a map you could get by with just upserts. A collection in a column gives you the ability to have “flexible” schema for your “documents” as in mongo while the regular fields can act as “records” as in a more Traditional table. Rahul Singh Chief Executive Officer m

Re: C* as fluent data storage, 10MB/sec/node?

2018-12-20 Thread Rahul Singh
Agree with JEFF in twcs. Also look At https://github.com/paradoxical-io/cassieq for reference. Good ideas for a queue on Cassandra. Rahul Singh Chief Executive Officer m 202.905.2818 Anant Corporation 1010 Wisconsin Ave NW, Suite 250 Washington, D.C. 20007 We build and manage digital business

Sub range repair

2019-01-01 Thread Rahul Reddy
Hello, Is it possible to find subrange needed for repair in Apache Cassandra like dse which uses dsetool list_subranges like below doc https://docs.datastax.com/en/archived/datastax_enterprise/4.8/datastax_enterprise/srch/srchRepair.html?hl=repair

Maximum memory usage

2019-02-06 Thread Rahul Reddy
Hello, I see maximum memory usage alerts in my system.log couple of times in a day as INFO. So far I haven't seen any issue with db. Why those messages are logged in system.log do we have any impact for reads/writes with those warnings? And what nerd to be looked INFO [RMI TCP Connection(170917)

  1   2   3   4   >