Rahul, in my opinion SASI is an experimental feature and isn't ready for
primetime yet. It has some advantages over secondary indexes but if it were
me, I'd stick with native secondary indexes. But test, test and test so you
can make an informed decision on what works for your use case. Cheers!
Er
Hello,
We have a requirement to query based on a column which is not part of
priamry key. We are planning to create sasi index . Before creating index
want to confirm how would be index behaviour if it has 10% of data will be
null and data distribution across the nodes 6-6 nodes wth RF 3? Please
Thank you for suggestion. Full refresh is currently designed because with delta
we cannot identify what got deleted. So downstreams prefer full data everyday.
Thanks
Bibin John
From: Reid Pinchback
Sent: Wednesday, February 19, 2020 3:14 PM
To: user@cassandra.apache.org
Subject: Re: Mechanism
To the question of ‘best approach’, so far the comments have been about
alternatives in tools.
Another axis you might want to consider is from the data model viewpoint. So,
for example, let’s say you have 600M rows. You want to do a daily transfer of
data for some reason. First question that
John,
copy is not recommended for than 2 millions rows, so copy is ruled out in your
case for those 30 tables you mentioned.
Sincerely,
Aakash Pandhi
On Wednesday, February 19, 2020, 02:26:15 PM CST, Amanda Moran
wrote:
HI there-
DataStax recently released their bulkloader into the
HI there-
DataStax recently released their bulkloader into the OSS community.
I would take a look and at least try it out:
https://docs.datastax.com/en/dsbulk/doc/dsbulk/dsbulkAbout.html
Good luck!
Amanda
On Wed, Feb 19, 2020 at 12:10 PM JOHN, BIBIN wrote:
> Thanks for the response. We need
Thanks for the response. We need to export into a flat file and send to another
analytical application. There are 137 tables and 30 of them are have 300M+
records. So “COPY TO” taking lot of time.
Thank you
Bibin John
From: Aakash Pandhi
Sent: Wednesday, February 19, 2020 12:51 PM
To: user@cas
John,
Greetings,
Requirement is to just export data from table and stage it somewhere? OR export
it and load them in another cluster/table? sstableloader is a utility which can
help you as it is designed for bulk loading.
Sincerely,
Aakash Pandhi
On Wednesday, February 19, 2020, 10:13:
Team,
We have a requirement to bulk export data from Cassandra on daily basis? Table
contain close to 600M records and cluster is having 12 nodes. What is the best
approach to do this?
Thanks
Bibin John