Re: Null values in sasi indexed column

2020-02-19 Thread Erick Ramirez
Rahul, in my opinion SASI is an experimental feature and isn't ready for primetime yet. It has some advantages over secondary indexes but if it were me, I'd stick with native secondary indexes. But test, test and test so you can make an informed decision on what works for your use case. Cheers! Er

Null values in sasi indexed column

2020-02-19 Thread Rahul Reddy
Hello, We have a requirement to query based on a column which is not part of priamry key. We are planning to create sasi index . Before creating index want to confirm how would be index behaviour if it has 10% of data will be null and data distribution across the nodes 6-6 nodes wth RF 3? Please

RE: Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread JOHN, BIBIN
Thank you for suggestion. Full refresh is currently designed because with delta we cannot identify what got deleted. So downstreams prefer full data everyday. Thanks Bibin John From: Reid Pinchback Sent: Wednesday, February 19, 2020 3:14 PM To: user@cassandra.apache.org Subject: Re: Mechanism

Re: Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread Reid Pinchback
To the question of ‘best approach’, so far the comments have been about alternatives in tools. Another axis you might want to consider is from the data model viewpoint. So, for example, let’s say you have 600M rows. You want to do a daily transfer of data for some reason. First question that

Re: Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread Aakash Pandhi
John, copy is not recommended for than 2 millions rows, so copy is ruled out in your case for those 30 tables you mentioned. Sincerely, Aakash Pandhi On Wednesday, February 19, 2020, 02:26:15 PM CST, Amanda Moran wrote: HI there- DataStax recently released their bulkloader into the

Re: Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread Amanda Moran
HI there- DataStax recently released their bulkloader into the OSS community. I would take a look and at least try it out: https://docs.datastax.com/en/dsbulk/doc/dsbulk/dsbulkAbout.html Good luck! Amanda On Wed, Feb 19, 2020 at 12:10 PM JOHN, BIBIN wrote: > Thanks for the response. We need

RE: Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread JOHN, BIBIN
Thanks for the response. We need to export into a flat file and send to another analytical application. There are 137 tables and 30 of them are have 300M+ records. So “COPY TO” taking lot of time. Thank you Bibin John From: Aakash Pandhi Sent: Wednesday, February 19, 2020 12:51 PM To: user@cas

Re: Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread Aakash Pandhi
John, Greetings,  Requirement is to just export data from table and stage it somewhere? OR export it and load them in another cluster/table? sstableloader is a utility which can help you as it is designed for bulk loading.   Sincerely, Aakash Pandhi On Wednesday, February 19, 2020, 10:13:

Mechanism to Bulk Export from Cassandra on daily Basis

2020-02-19 Thread JOHN, BIBIN
Team, We have a requirement to bulk export data from Cassandra on daily basis? Table contain close to 600M records and cluster is having 12 nodes. What is the best approach to do this? Thanks Bibin John