Thanks for the suggestion.

I've somewhat understand all that, the point where my head begins to explode
is when I want to figure out something like

Continuing with your example: "Over the last X amount of days give me all
the logs for remote_addr:XXX".
I'm guessing I would need to create a separate index ColumnFamily???

On Fri, Aug 6, 2010 at 4:32 PM, Thomas Heller <i...@zilence.net> wrote:

> Howdy,
>
> thought I jump in here. I did something similar, meaning I had lots of
> items coming in per day and wanted to somehow partition them to avoid
> running into the column limit (it was also logging related). Solution
> was pretty simple, log data is immutable, so no SuperColumn needed.
>
> ColumnFamily Standard: LogRecords, CompareWith=TimeUUIDType
>
> Row Key "20100806":
>  Column Name: TimeUUID.new Value: JSON({'remote_addr':...,
> 'user_agent':...., 'url':....)
>  ..., more Columns
>
> In my case I chose to "partition" by day, if you are getting too many
> columns per day, just get hours in there. If you want an extra
> seperation level (foo, bar) in your example you could either go for a
> SuperColumn or just adjust your row key accordingly (eg.
> "foo:20100806")
>
> HTH,
> /thomas
>

Reply via email to