Re: Any experience of 20 node mini-itx cassandra cluster

2013-04-15 Thread Jabbar Azam
I know the SSD's are a bit small but they should be enough for our application. Out test data is 1.6 TB(including replication of rf=3). Can't we use LCS? This will give us more space at the expensive of more I/O but SSD's have loads of I/Os. Thanks Jabbar Azam On 14 April 2013 20:20, Jabbar

Re: running cassandra on 8 GB servers

2013-04-15 Thread Nikolay Mihaylov
Just a small update here currently running on one node with 7 GB heap and no JNA all defaults except the heap, and everything looks OK. On Sun, Apr 14, 2013 at 9:10 PM, aaron morton wrote: > Hmmm, what is the recommendation for a 10G network if 1G was 300G to > 500GŠI am guessing I can't do 10 ti

Re: CQL3 And ReversedTypes Question

2013-04-15 Thread Gareth Collins
Added: https://issues.apache.org/jira/browse/CASSANDRA-5472 thanks, Gareth On Sun, Apr 14, 2013 at 2:33 PM, aaron morton wrote: > Bad Request: Type error: > org.apache.cassandra.cql3.statements.Selection$SimpleSelector@1e7318cannot be > passed as argument 0 of function dateof of type timeuuid

Re: Problems with shuffle

2013-04-15 Thread Richard Low
On 14 April 2013 00:56, Rustam Aliyev wrote: > Just a followup on this issue. Due to the cost of shuffle, we decided > not to do it. Recently, we added new node and ended up in not well balanced > cluster: > > Datacenter: datacenter1 > === > Status=Up/Down > |/ State=Normal/L

StatusLogger format?

2013-04-15 Thread William Oberman
Mainly the: "ColumnFamilyMemtable ops,data" section. Is data in bytes/kb/mb/etc? Example line: StatusLogger.java (line 116) civicscience.sessions4963,1799916 Thanks!

Re: Extracting data from SSTable files with MapReduce

2013-04-15 Thread Jasper K.
Hi Aaron, I did try to upgrade to 1.2 but it did not work out. Maybe to many versions in between. Why would later formats make this easier you think? Jasper 2013/4/14 aaron morton > The SSTable files are in the -f- format from 0.8.10. > > If you can upgrade to the latest version it will mak

Re: StatusLogger format?

2013-04-15 Thread William Oberman
99% sure it's in bytes. On Mon, Apr 15, 2013 at 11:25 AM, William Oberman wrote: > Mainly the: > "ColumnFamilyMemtable ops,data" > section. > > Is data in bytes/kb/mb/etc? > > Example line: > StatusLogger.java (line 116) civicscience.sessions4963,1799916 > > Thanks! >

Re: Thrift message length exceeded

2013-04-15 Thread Lanny Ripple
A bump to say I found this http://stackoverflow.com/questions/15487540/pig-cassandra-message-length-exceeded so others are seeing similar behavior. From what I can see of org.apache.cassandra.hadoop nothing has changed since 1.1.5 when we didn't see such things but sure looks like there's a

Re: Cassandra 1.2.2 cluster + raspberry

2013-04-15 Thread murat migdisoglu
Hi Aaron, Thank you for your support. It was my mistake indeed. The second node was still configured to have the internode comm to be compressed. After I fixed it, I'm able to start my cluster. Cheers On Thu, Apr 11, 2013 at 12:40 PM, aaron morton wrote: > I've already tried to set internode

Re: Vnodes - HUNDRED of MapReduce jobs

2013-04-15 Thread Alicia Leong
Hi cem , In your previous reply, you mentioned that you have a simple solution. Can you share with us :) Thanks in advance. On Sat, Mar 30, 2013 at 2:33 AM, Edward Capriolo wrote: > It should be easy to control the number of map tasks. > http://wiki.apache.org/hadoop/HowManyMapsAndReduces. It

Re: Does Memtable resides in Heap?

2013-04-15 Thread Jay Svc
Thanks Vitor, So as per recommendation its only efficient when heap size is below 8GB. How about when we have more RAM, does that rest of the RAM can be left for OS to make use? How about the bloom filter and index samples, are they part of off-heap? Thank you for your response. Regards, Jay

Re: Does Memtable resides in Heap?

2013-04-15 Thread Michal Michalski
How about the bloom filter and index samples, are they part of off-heap? Starting from C* 1.2 bloom filters are stored off-heap. Index samples are stored on heap. M.