Large files can be split into small blocks, and the size of block can be
tuned. It may increase the complexity of writing such a file system, but can
be for general purpose (not only for relative small files)

On Thu, Apr 15, 2010 at 10:08 AM, Tatu Saloranta <tsalora...@gmail.com>wrote:

> On Wed, Apr 14, 2010 at 6:42 PM, Zhuguo Shi <bluefl...@gmail.com> wrote:
> > Hi,
> > Cassandra has a good distributed model: decentralized, auto-partition,
> > auto-recovery. I am evaluating about writing a file system over Cassandra
> > (like CassFS: http://github.com/jdarcy/CassFS ), but I don't know if
> > Cassandra is good at such use case?
>
> It sort of depends on what you are looking for. From use case for
> which something like S3 is good, yes, except with one difference:
> Cassandra is more geared towards lots of small files, whereas S3 is
> more geared towards moderate number of files (possibly large).
>
> So I think it can definitely be a good use case, and I may use
> Cassandra for this myself in future. Having range queries allows
> implementing directory/path structures (list keys using path as
> prefix). And you can split storage such that metadata could live in
> OPP partition, raw data in RP.
>
> -+ Tatu +-
>

Reply via email to