Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Robert Coli
On Tue, Jan 19, 2016 at 2:07 PM, Richard L. Burton III wrote: > I would ask why do this over say HDFS, S3, etc. seems like this problem > has been solved with other solutions that are specifically designed for > blob storage? > HDFS's default block size is 64mb. If you are storing objects smalle

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Steve Robenalt
Just adding one more item to the discussion. I believe this was announced on the list some time ago. I haven't tried it out yet, just pointing it out since it's on the OP's topic: http://pithos.io/ It's a Cassandra-backed object store using an S3 API. On Tue, Jan 19, 2016 at 2:07 PM, Richard L.

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Richard L. Burton III
I would ask why do this over say HDFS, S3, etc. seems like this problem has been solved with other solutions that are specifically designed for blob storage? On Tue, Jan 19, 2016 at 4:23 PM, wrote: > I recently started noodling with this concept and built a working blob > storage service using n

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread list
I recently started noodling with this concept and built a working blob storage service using node.js and C*. I setup a basic web server using the express web server where you could POST binary files to the server where they would get chunked and assigned to a user and bucket, in the spirit of S

Re: Possible to adjust tokens on a vnode cluster?

2016-01-19 Thread Eric Evans
On Tue, Jan 19, 2016 at 12:21 PM, John Sumsion wrote: > I have a 24 node cluster, with vnodes set to 256. > > > 'nodetool status ' looks like this for our keyspace: > > > UN 588.23 GB 256 11.0% > 0c8708a7-b962-4fc9-996c-617da642d9ee 1a > UN 601.33 GB 256 11.3% > 5ef60730-0b01

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Kevin Burton
Lots of interesting feedback.. I like the ideal of chunking the IO into pages.. it would require more thinking but I could even do cassandra async IO and async HTTP to serve the data and then use HTTP chunks for each range. On Tue, Jan 19, 2016 at 10:47 AM, Robert Coli wrote: > On Mon, Jan 18, 2

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Robert Coli
On Mon, Jan 18, 2016 at 6:52 PM, Kevin Burton wrote: > Internally we have the need for a blob store for web content. It's MOSTLY > key, ,value based but we'd like to have lookups by coarse grained tags. > I know you know how to operate and scale MySQL, so I suggest MogileFS for the actual blob

Possible to adjust tokens on a vnode cluster?

2016-01-19 Thread John Sumsion
I have a 24 node cluster, with vnodes set to 256. 'nodetool status ' looks like this for our keyspace: UN 588.23 GB 256 11.0% 0c8708a7-b962-4fc9-996c-617da642d9ee 1a UN 601.33 GB 256 11.3% 5ef60730-0b01-4a8b-a578-d828cdf78a1f 1b UN 613.02 GB

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread Eric Evans
On Mon, Jan 18, 2016 at 8:52 PM, Kevin Burton wrote: > Internally we have the need for a blob store for web content. It's MOSTLY > key, ,value based but we'd like to have lookups by coarse grained tags. > > This needs to store normal web content like HTML , CSS, JPEG, SVG, etc. > > Highly doubt

Re: Connection error 61 for cassandra

2016-01-19 Thread Carlos Alonso
I ran through those issues a while ago. It was on Ubuntu rather than OSX but probably the same. I compiled my steps here: http://mrcalonso.com/fitting-ipython-notebooks-spark-and-cassandra-all-together/ Cheers! Carlos Alonso | Software Engineer | @calonso On 19 Ja

Re: broadcast_address in multi data center setups

2016-01-19 Thread Francisco Reyes
On 01/18/2016 09:44 AM, Paulo Motta wrote: broadcast_address is the address exposed for internal inter-node communication, while rpc_address is the address that will listen to clients. all nodes need to talk to each other via the broadcast_address, so if they are within the same network, you

Re: Unable to locate Solr Configuration file ( Generated using dsetool )

2016-01-19 Thread Harikrishnan A
Thanks Sebastian , Jack...This really helps Sent from Yahoo Mail on Android On Mon, Jan 18, 2016 at 11:03 AM, Sebastian Estevez wrote: You can post it to the server using either curl or dsetool: http://docs.datastax.com/en/datastax_enterprise/4.8/datastax_enterprise/srch/srchReldCore.html

[RELEASE] Apache Cassandra 3.2.1 released

2016-01-19 Thread Jake Luciani
The Cassandra team is pleased to announce the release of Apache Cassandra version 3.2.1. Apache Cassandra is a fully distributed database. It is the right choice when you need scalability and high availability without compromising performance. http://cassandra.apache.org/ Downloads of source an

Connection error 61 for cassandra

2016-01-19 Thread ankita therese
Hello, I set up a single node on localhost, and it was working fine I connected cassandra with a apache spark, and was able to access the keyspaces After this, i connected to pyspark using the datastax spark-cassandra connector Ever since then, when i try to access spark via cqlsh, all i get is

solr Textsearch in dse 4.8.3

2016-01-19 Thread anuja jain
Hi, I am using solr of dse 4.8.3 to do text search on cassandra data. On a String type column when I am use regex email:*gmail* it does not return me the data that is inserted after starting cassandra in solr mode. Infact on hitting query everytime it is returning different result. Schema.xml has f

Re: Using cassandra a BLOB store / web cache.

2016-01-19 Thread DuyHai Doan
There is also an excellent tutorial video done by Patrick McFadin and Aaron Morton on the subject of data model for storing images into Cassandra: http://youtu.be/gk-B75xgFUg I guess it can be adapted to store binary objects other than images On Tue, Jan 19, 2016 at 6:37 AM, Jack Krupansky wrote