Hi,
Which version of Cassandra should considered most stable in the version 3?
I see two main branch: the branch with the version 3.0.* and the tick-tock one
3.*.*.
So basically my question is: which one is most stable, version 3.0.5 or version
3.3?
I know odd versions in tick-took are bug fix.
At this point I have no additional advice to offer. There seems to be
intense resistance to follow the modeling approach I have recommended, so
there is nothing more I can offer on that front. The bottom line is that if
the techniques referenced in the blog post are not sufficient, then nothing
sho
Check out the text indexing feature of the new SASI feature in Cassandra
3.4. You could write a custom tokenizer to extract entities and then be
able to query for documents that contain those entities.
That said, using a SHA digest key for the primary key has merit for direct
access to the documen
unsubscribe
Disclaimer: This message and the information contained herein is proprietary
and confidential and subject to the Tech Mahindra policy statement, you may
revi
S3 maybe?
On Mon, Apr 11, 2016 at 7:05 PM Robert Wille wrote:
> I do realize its kind of a weird use case, but it is legitimate. I have a
> collection of documents that I need to index, and I want to perform entity
> extraction on them and give the extracted entities special treatment in my
> ful
I do realize its kind of a weird use case, but it is legitimate. I have a
collection of documents that I need to index, and I want to perform entity
extraction on them and give the extracted entities special treatment in my
full-text index. Because entity extraction costs money, and each documen
Check your environment variables, looks like JAVA_HOME is not properly set
On Mon, Apr 11, 2016 at 9:07 AM, Lokesh Ceeba - Vendor <
lokesh.ce...@walmart.com> wrote:
> Hi Team,
>
> Help required
>
>
>
> cassandra:/app/cassandra $ nodetool status
>
>
>
> Cassandra 2.0 and later requir
Hi Robert,
why do you need the actual text as a key? I sounds a bit unatural at
least for me. Keep in mind that you cannot do "like" queries on keys in
cassandra. For performance and keeping things more readable I would
prefer hashing your text and use the hash as key.
You should also take i
On Mon, Apr 11, 2016 at 4:19 PM, Jack Krupansky
wrote:
> Some of this may depend on exactly how you are using so-called COMPACT
> STORAGE. I mean, if your tables really are modeled as all but exactly one
> column in the primary key, then okay, COMPACT STORAGE may be a reasonable
> model, but that
Why does the text need to be the key?
On Mon, Apr 11, 2016 at 6:04 PM Robert Wille wrote:
> I have a need to be able to use the text of a document as the primary key
> in a table. These texts are usually less than 1K, but can sometimes be 10’s
> of K’s in size. Would it be better to use a digest
While large primary keys (within reason) should work, IMO anytime you're
doing equality testing you are really better off minimizing the size of the
key. Huge primary keys will also have very negative impacts on your key
cache. I would err on the side of the digest, but I've never had a need for
la
I have a need to be able to use the text of a document as the primary key in a
table. These texts are usually less than 1K, but can sometimes be 10’s of K’s
in size. Would it be better to use a digest of the text as the key? I have a
background process that will occasionally need to do a full ta
unscribe
Some of this may depend on exactly how you are using so-called COMPACT
STORAGE. I mean, if your tables really are modeled as all but exactly one
column in the primary key, then okay, COMPACT STORAGE may be a reasonable
model, but that seems to be a very special, narrow use case, so for all
other ca
Hello,
In a multi-DC setup (where one DC serves real-time traffic and the other DC
serves up analytical loads), is it possible to setup and restrict secondary
indexes only to the analytics DC? The intent is to not create the overhead of
the secondary index on the DC where real-time traffic is s
Since when did this become a DataStax support email list? If folks have
questions about DataStax products, shouldn't they be contacting the company
directly?
On Sun, Apr 10, 2016 at 1:13 PM Jeff Jirsa
wrote:
> It is possible to use OpsCenter for open source / community versions up to
> 2.2.x.
Thanks Jim. I think you understand the pain of migrating TBs of data to new
tables. There is no command to change from compact to non compact storage and
the fastest solution to migrate data using Spark is too slow for production
systems.
And the pain gets bigger when your performance dips after
Thanks Jack.Let me rephrase and try to fetch some help :)
Cql tables always have schema but Thrift allowed you to have a cf with mix of
statically declared columns in schema and dynamic columns i.e. columns not part
of schema and created as and when needed at runtime. When you drop Thrift code
a
Team,
Can you help ? How to bring up one of the nodes below which
is DOWN ?
[cassandra@rmtm-cassandra-db-103087499-2-111493402 ~]$ nodetool status
Datacenter: dev-cdc1
Status=Up/Down
|/ State=Normal/Leaving/Joining/Moving
-- AddressLoad Token
Jack, the Datastax link he posted (
http://www.datastax.com/dev/blog/thrift-to-cql3) says that for column
families with mixed dynamic and static columns: "The only solution to be
able to access the column family fully is to remove the declared columns
from the thrift schema altogether..." I think t
Disclaimer: This message and the information contained herein is proprietary
and confidential and subject to the Tech Mahindra policy statement, you may
review the poli
You're not mistaken, just thought you were after partition keys and didn't
read the question that carefully. Afaik, you're SOOL if you need to
distinguish clustering keys as unique. Well, other than doing a full table
scan of course, which I'm assuming is not too plausible.
On Mon, 11 Apr 2016 at
Hi Team,
Help required
cassandra:/app/cassandra $ nodetool status
Cassandra 2.0 and later require Java 7u25 or later.
cassandra:/app/cassandra $ nodetool status
Cassandra 2.0 and later require Java 7u25 or later.
cassandra:/app/cassandra $ java -version
Error occurred during initia
Unless I'm mistaken, nodetool tablestats gives you the number of partitions
(partition keys), not the number of primary keys. IOW, the term "keys" is
ambiguous. That's why I phrased the original question as count of (CQL)
rows, to distinguish from the pre-CQL3 concept of a partition being treated
a
Wouldn't the "number of keys" part of *nodetool cfstats* run on every node,
summed and divided by replication factor give you a decent approximation?
Or are you really after a completely precise number?
On Mon, 11 Apr 2016 at 16:18 Jack Krupansky
wrote:
> Agreed, that anything requiring a full t
Scott Thompson
This message and any attached documents are only for the use of the intended
recipient(s), are confidential and may contain privileged information. Any
unauthorized review, use, retransmission, or other disclosure is strictly
prohibited. If
Sorry, but your message is too confusing - you say "reading dynamic columns
in CQL" and "make the table schema less", but neither has any relevance to
CQL! 1. CQL tables always have schemas. 2. All columns in CQL are
statically declared (even maps/collections are statically declared
columns.) Grant
Agreed, that anything requiring a full table scan, short of batch
analytics,is an antipattern, although the goal is not to do a full scan per
se, but just get the row count. It still surprises people that Cassandra
cannot quickly get COUNT(*). The easy answer: Use DSE Search and do a Solr
query for
(And what's the cost of a DSE license for DSE with Cassandra 3.x/3.5? No
fair telling people they have to wait for DSE 5.0! Or 5.x, whenever
Cassandra 3.4/3.5 will be supported.)
-- Jack Krupansky
On Mon, Apr 11, 2016 at 11:04 AM, wrote:
> For 2.2 and earlier, there are no license fees. Some fe
Cassandra is not good for table scan type queries (which count(*) typically
is). While there are some attempts to do that (as noted below), this is a path
I avoid.
Sean Durity
From: Max C [mailto:mc_cassan...@core43.com]
Sent: Saturday, April 09, 2016 6:19 PM
To: user@cassandra.apache.org
Subj
For 2.2 and earlier, there are no license fees. Some features are not available
for Apache rings, though (eg., repair service, capacity service, etc.).
For 3.0+, OpsCenter will not monitor open source rings (as I understand it).
They decided there was too much effort to develop (and test) OpsCen
Any comments or suggestions on this one?
ThanksAnuj
Sent from Yahoo Mail on Android
On Sun, 10 Apr, 2016 at 11:39 PM, Anuj Wadehra wrote:
Hi
We are on 2.0.14 and Thrift. We are planning to migrate to CQL soon but facing
some challenges.
We have a cf with a mix of statically defined colum
The Cassandra team is pleased to announce the release of Apache Cassandra
version 3.0.5.
Apache Cassandra is a fully distributed database. It is the right choice
when you need scalability and high availability without compromising
performance.
http://cassandra.apache.org/
Downloads of source an
Where do you get the ~1ms latency between AZs? Comparing a short term
average to a 99th percentile isn't very fair.
"Over the last month, the median is 2.09 ms, 90th percentile is 20ms,
99th percentile
is 47ms." - per
https://www.quora.com/What-are-typical-ping-times-between-different-EC2-availabi
Hi everyone,
Last week I ran some tests to estimate the latency overhead introduces in a
Cassandra cluster by a multi availability zones setup on AWS EC2.
I started a Cassandra cluster of 6 nodes deployed on 3 different AZs (2
nodes/AZ).
Then, I used cassandra-stress to create an INSERT (write)
unsubscribe
Thanks Alain for all your answer:
- In a few days I am going to set up a maintenance window so I can
test again to run repairs and see what happens. Definitely I will run 'iostat
-mx 5 100' On that time and also use the command you pointed to see why is
consuming so much power.
-
37 matches
Mail list logo