Re: data size difference between supercolumn and regular column

2012-04-01 Thread Jeremiah Jordan
Is that 80% with compression? If not, the first thing to do is turn on compression. Cassandra doesn't behave well when it runs out of disk space. You really want to try and stay around 50%, 60-70% works, but only if it is spread across multiple column families, and even then you can run into

Re: data size difference between supercolumn and regular column

2012-04-01 Thread Yiming Sun
Thanks Aaron. Well I guess it is possible the data files from sueprcolumns could've been reduced in size after compaction. This bring yet another question. Say I am on a shoestring budget and can only put together a cluster with very limited storage space. The first iteration of pushing data in

Re: import

2012-04-01 Thread Maxim Potekhin
Since Python has a native csv module, it's trivial to achieve. I load lots of csv data into my database daily. Maxim On 3/27/2012 11:44 AM, R. Verlangen wrote: You can write your own script to parse the excel file (export as csv) and import it with batch inserts. Should be pretty easy if you

Re: Is the wiki outdated regarding Hive support?

2012-04-01 Thread Ben McCann
Oh, that's fantastic! Thanks so much for the quick response! On Sun, Apr 1, 2012 at 4:21 PM, Jake Luciani wrote: > Hi Ben. That is still the repo. The code that ships with latest DSE is the > hive-0.8.1-merge branch. > > We will try to get this into the Cassandra trunk asap. > > Jake > > > > O

Re: Is the wiki outdated regarding Hive support?

2012-04-01 Thread Jake Luciani
Hi Ben. That is still the repo. The code that ships with latest DSE is the hive-0.8.1-merge branch. We will try to get this into the Cassandra trunk asap. Jake On Apr 1, 2012, at 6:39 PM, Ben McCann wrote: > The wiki says "Hive support is currently a standalone project but will become >

Is the wiki outdated regarding Hive support?

2012-04-01 Thread Ben McCann
The wiki says "Hive support is currently a standalone project but will become part of the main Cassandra source tree in the future. See https://github.com/riptano/hivefor details." This seems outdated to me since Datastax isn't planning any future u

Re: cassandra gui

2012-04-01 Thread Brian O'Neill
If you give Virgil a try, let me know how it goes. The REST layer is pretty solid, but the gui is just a PoC which makes it easy to see what's in the CFs during development/testing. (It's only a couple hundred lines of ExtJS code built on the REST layer) We had plans to add CQL to the gui for CRUD