The reason for multiple keys (and, by extension, multiple columns) is to better 
distribute the write/read load across the cluster as keys will (hopefully) be 
distributed on different nodes. This helps to avoid hot spots.

Hope this helps,

-Jason Brown
Netflix
________________________________
From: Sávio Teles [savio.te...@lupa.inf.ufg.br]
Sent: Monday, January 21, 2013 9:51 AM
To: user@cassandra.apache.org
Subject: Re: How to store large columns?

Astyanax split large objects into multiple keys. Is it a good idea? It is 
better to split into multiple columns?

Thanks

2013/1/21 Sávio Teles 
<savio.te...@lupa.inf.ufg.br<mailto:savio.te...@lupa.inf.ufg.br>>

Thanks Keith Wright.


2013/1/21 Keith Wright <kwri...@nanigans.com<mailto:kwri...@nanigans.com>>
This may be helpful:  
https://github.com/Netflix/astyanax/wiki/Chunked-Object-Store

From: Vegard Berget <p...@fantasista.no<mailto:p...@fantasista.no>>
Reply-To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>, Vegard Berget 
<p...@fantasista.no<mailto:p...@fantasista.no>>
Date: Monday, January 21, 2013 8:35 AM
To: "user@cassandra.apache.org<mailto:user@cassandra.apache.org>" 
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
Subject: Re: How to store large columns?



Hi,

You could split it into multiple columns on the client side:
RowKeyData: Part1: [1mb], Part2: [1mb], Part3: [1mb]...PartN[1mb]

Now you can use multiple get() in parallell to get the files back and then join 
them back to one file.

I _think_ maybe the new CQL3-protocol does not have the same limitation, but I 
have never tried large columns there, so someone with more experience than me 
will have to confirm this.

.vegard,

----- Original Message -----
From:
user@cassandra.apache.org<mailto:user@cassandra.apache.org>

To:
<user@cassandra.apache.org<mailto:user@cassandra.apache.org>>
Cc:

Sent:
Mon, 21 Jan 2013 11:16:40 -0200
Subject:
How to store large columns?


We wish to store a column in a row with size larger than 
thrift_framed_transport_size_in_mb. But, Thrift has a maximum frame size 
configured by thrift_framed_transport_size_in_mb in cassandra.yaml.
so, How to store columns with size larger than 
thrift_framed_transport_size_in_mb? Increasing this value does not solve the 
problem, since we have columns with varying sizes.

--
Atenciosamente,
Sávio S. Teles de Oliveira
voice: +55 62 9136 6996<tel:%2B55%2062%209136%206996>
http://br.linkedin.com/in/savioteles
Mestrando em Ciências da Computação - UFG
Arquiteto de Software
Laboratory for Ubiquitous and Pervasive Applications (LUPA) - UFG



--
Atenciosamente,
Sávio S. Teles de Oliveira
voice: +55 62 9136 6996<tel:%2B55%2062%209136%206996>
http://br.linkedin.com/in/savioteles
Mestrando em Ciências da Computação - UFG
Arquiteto de Software
Laboratory for Ubiquitous and Pervasive Applications (LUPA) - UFG



--
Atenciosamente,
Sávio S. Teles de Oliveira
voice: +55 62 9136 6996
http://br.linkedin.com/in/savioteles
Mestrando em Ciências da Computação - UFG
Arquiteto de Software
Laboratory for Ubiquitous and Pervasive Applications (LUPA) - UFG

Reply via email to