Re: Run Repairs when a Node is Down

2016-01-20 Thread Anuj Wadehra
Thanks Paulo for sharing the JIRA !! I have added my comments there. "It is not advisable to remain with a down node for a long time without replacing it (with risk of not being able to achieve consistency if another node goes down)." I am referring to a generic scenario where a cluster may affor

Re: solr Textsearch in dse 4.8.3

2016-01-20 Thread Jack Krupansky
Solr string fields (solr.StrField) are not tokenized, so they do not have analyzers. That means they will be case-sensitive. Regex is introduced with slash characters, so you appear to be performing a wildcard query instead. Typically, you would make a copy of such a Cassandra text field (using )

Re: Using cassandra a BLOB store / web cache.

2016-01-20 Thread Jack Krupansky
I think the requirement was stated that old versions will be kept, which is consistent with Cassandra and the LSM data model - it would avoid the need for compactions of the actual chunked blob data. Throughput mostly comes down to adequately provisioning your cluster. -- Jack Krupansky On Wed,

Re: Run Repairs when a Node is Down

2016-01-20 Thread Anuj Wadehra
Thanks Paulo for sharing the JIRA !! I have added my comments there. "It is not advisable to remain with a down node for a long time without replacing it (with risk of not being able to achieve consistency if another node goes down)." I am referring to a generic scenario where a cluster may affor

Re: Using cassandra a BLOB store / web cache.

2016-01-20 Thread Mohit Anchlia
The answer to this questions is very much dependent on the throughput, desired latency and access patters (R/W or R/O)? In general what I have seen working for high throughput environment is to either use a distributed file system like Ceph/Gluster or object store like S3 and keep the pointer in th

Re: Using cassandra a BLOB store / web cache.

2016-01-20 Thread Kevin Burton
There's also the 'support' issue.. C* is hard enough as it is... maybe you can bring in another system like ES or HDFS but the more you bring in the more your complexity REALLY goes through the roof. Better to keep things simple. I really like the chunking idea for C*... seems like an easy way to