I think there is not an extremely simple solution to your problem. You
will probably need to use multiple tables to get the view you need. One
keyed just by file UUID, which tracks some basic metadata about the file
including the last modified time. Another as a materialized view of the
most rece
t;
>
>
>
> -Original Message-
> From: y2k...@gmail.com on behalf of Jimmy Lin
> Sent: Thu 11-Jul-13 13:09
> To: user@cassandra.apache.org
> Subject: Re: data model question : finding out the n most recent changes
> items
>
> what I mean is, I really just w
-Original Message-
From: y2k...@gmail.com on behalf of Jimmy Lin
Sent: Thu 11-Jul-13 13:09
To: user@cassandra.apache.org
Subject: Re: data model question : finding out the n most recent changes items
what I mean is, I really just want the last modified date instead of series
of timestamp and still
what I mean is, I really just want the last modified date instead of series
of timestamp and still able to sort or order by it.
(maybe I should rephrase my question as how to sort or order by last
modified column in a row)
CREATE TABLE user_file (
user_id uuid,
modified_date timest
What you described this sounds like the most appropriate:
CREATE TABLE user_file (
user_id uuid,
modified_date timestamp,
file_id timeuuid,
PRIMARY KEY(user_id, modified_date)
);
If you normally need more information about the file then either store that as
addit
> Isn't kafka too young for production using purpose ?
The best way to advance the project is to use it and contribute your experience
and time.
btw, checking out kafka is a great idea. There are people around having Fun
Times with Kafka in production
Cheers
-
Aaron Morton
Fre
Isn't kafka too young for production using purpose ?
Clearly that would fit much better my needs but I can't afford early stage
project not ready for production. Is it ?
Le 30 avr. 2012 à 14:28, samal a écrit :
>
>
> On Mon, Apr 30, 2012 at 5:52 PM, Morgan Segalis wrote:
> Hi Samal,
>
> Th
On Mon, Apr 30, 2012 at 5:52 PM, Morgan Segalis wrote:
> Hi Samal,
>
> Thanks for the TTL feature, I wasn't aware of it's existence.
>
> Day's partitioning will be less wider than month partitionning (about 30
> times less give or take ;-) )
> Per day it should have something like 100 000 message
Hi Samal,
Thanks for the TTL feature, I wasn't aware of it's existence.
Day's partitioning will be less wider than month partitionning (about 30 times
less give or take ;-) )
Per day it should have something like 100 000 messages stored, most of it would
be retrieved so deleted before the TTL f
On Mon, Apr 30, 2012 at 4:25 PM, Morgan Segalis wrote:
> Hi Aaron,
>
> Thank you for your answer, I was beginning to think that my question would
> never be answered ;-)
>
> Actually, this is what I was going for, except one thing, instead of
> partitioning row per month, I though about partition
Hi Aaron,
Thank you for your answer, I was beginning to think that my question would
never be answered ;-)
Actually, this is what I was going for, except one thing, instead of
partitioning row per month, I though about partitioning per day, like that
everyday I launch the cleaning tool, and it
Message Queue is often not a great use case for Cassandra. For information on
how to handle high delete workloads see
http://www.datastax.com/dev/blog/leveled-compaction-in-apache-cassandra
It hard to create a model without some idea of the data load, but I would
suggest you start with:
CF: Us
Thanks!
Better than mine, as it considered later additions of services!
Will update my code,
Thanks
*Tamar Fraenkel *
Senior Software Engineer, TOK Media
[image: Inline image 1]
ta...@tok-media.com
Tel: +972 2 6409736
Mob: +972 54 8356490
Fax: +972 2 5612956
On Mon, Mar 12, 2012 at 11:
Alternate would be to add another row to your user CF specific for Facebook
ids. Column ID would be the Facebook identifier and value would be your
internal uuid.
Consider when you want to add another service like twitter. Will you then
add another CF per service or just another row specific now
In this case, where you know the query upfront, I add a custom secondary index
using another CF to support the query. It's a little easier here because the
data wont change.
UserLookupCF (using composite types for the key value)
row_key: e.g. "facebook:12345" or "twitter:12345"
col_name : e.g
Hi!
Thanks for the response.
>From what I read, secondary indices are good only for columns with few
possible values. Is this a good fit for my case? I have unique facebook id
for every user.
Thanks
*Tamar Fraenkel *
Senior Software Engineer, TOK Media
[image: Inline image 1]
ta...@tok-media.com
Either you do that or you could think about using a secondary index on the
fb user name in your primary cf.
See http://www.datastax.com/docs/1.0/ddl/indexes
Cheers
Am 11.03.2012 um 09:51 schrieb Tamar Fraenkel :
Hi!
I need some advise:
I have user CF, which has a UUID key which is my internal u
> 1. regarding time slicing, if at any point of time I am interested in what
> happened in the last T minutes, then I will need to query more than one row
> of the DimentionUpdates, right?
Yerp.
Sometimes that's is what's needed.
> 2. What did you mean by "You will also want to partition the
Hi!
Thank you very much for your response!
I have couple of questions regarding it, some are just to make sure I understood
you:
1. regarding time slicing, ifat any point of time I am interested in what
happened in the last T minutes, then I will need to query more than one row of
the Dimention
In general if you are collecting data over time you should consider
partitioning the row's to avoid creating very large rows. Also if you have a
common request you want to support consider modeling it directly rather than
using secondary indexes.
Assuming my understanding of the problem is in
On Sat, Jan 21, 2012 at 7:49 PM, Jean-Nicolas Boulay Desjardins <
jnbdzjn...@gmail.com> wrote:
> Milind Parikh, Rainbird is back by Twitter... My worry is that you
> might not be around in the future... Also, do you have evidence that
> your system is better? Because Rainbird is used by Twitter.
>
Hi
It may be my lack of knowledge but both has to do with counting, which is not
what I need.
What is wrong with the two models I suggested?
Tamar
Sent from my iPod
On Jan 22, 2012, at 2:49 AM, Jean-Nicolas Boulay Desjardins
wrote:
> Milind Parikh, Rainbird is back by Twitter... My worry is
Milind Parikh, Rainbird is back by Twitter... My worry is that you
might not be around in the future... Also, do you have evidence that
your system is better? Because Rainbird is used by Twitter.
On Sat, Jan 21, 2012 at 6:55 PM, Milind Parikh wrote:
>
> I used rainbird as inspiration for Countand
I used rainbird as inspiration for Countandra (& some of publicly available
data structures from rainbird preso). That said, there are significant
differences between the two architectures. Additiomally as Cassandra begins
to provide triggets, some very interesting things will become possible in
Co
But What about: Rainbird?
On Sat, Jan 21, 2012 at 10:52 AM, R. Verlangen wrote:
>
> A couple of days ago I came across Countandra ( http://countandra.org/ ). It
> seems that it might be a solution for you.
>
> Gr. Robin
>
>
> 2012/1/20 Tamar Fraenkel
>>
>> Hi!
>>
>> I am a newbie to Cassandra a
A couple of days ago I came across Countandra ( http://countandra.org/ ).
It seems that it might be a solution for you.
Gr. Robin
2012/1/20 Tamar Fraenkel
> **
>
> Hi!
>
> I am a newbie to Cassandra and seeking some advice regarding the data
> model I should use to best address my needs.
>
>
Hello Aaron,
Thanks for your reply. I will try it.
Greetings,
Pablo
2010/12/2 Aaron Morton
> I say yes to all your questions about what you can do with Solr.
>
> Some background the on the technology...
>
> Lucene is a Java library for doing full text search
> http://lucene.apache.org/java/doc
I say yes to all your questions about what you can do with Solr. Some background the on the technology...Lucene is a Java library for doing full text search http://lucene.apache.org/java/docs/index.htmlSolr turns lucene into a HTTP server and adds a bunch of other features such as making it easier
Hello Aaron and Jake,
Thank you for your replay. I've worked with cassandra for 6 month but I
never use Lucandra. I will try Lucandra, but I must ask (before start), Is
possible reach my searching/pagination/sorting requeriments with Lucandra?
Thank you in advance,
Pablo
2010/12/2 Jake Luciani
You can also run Solr with Cassandra as the backend:
https://github.com/tjake/Lucandra/tree/solandra
-Jake
On Thu, Dec 2, 2010 at 6:27 AM, aaron morton wrote:
> Have you considered using Solr / lucene for the search? It has a lot more
> search features, and it really good at faceted navigatio
Have you considered using Solr / lucene for the search? It has a lot more
search features, and it really good at faceted navigation through a product
catalogue. It sounds like it would be a better fit for this task.
You can build facets for your price ranges, do the product name thing and
filt
On Thu, Apr 15, 2010 at 6:01 PM, Sonny Heer wrote:
> Need a way to have two different types of indexes.
>
> Key: aTextKey
> ColumnName: aTextColumnName:55
> Value: ""
>
> Key: aTextKey
> ColumnName: 55:aTextColumnName
> Value: ""
>
> All the valuable information is stored in the column name itself
32 matches
Mail list logo