Hi All, With twitter, when I search for words like: "cassandra is the bestest", 4 tweets will appear, including one i just did. My understand that the internals of twitter work in that each word in a tweet is allocated, irrespective of the presence of a # hash tag, and the tweet id is assigned to a row for that word. What is puzzling to me, and hopeful that some smart people on here can shed some light on -- is how would this work with Cassandra?
row [ cassandra ]: key -> tweetid / timestamp row [ bestest ]: key -> tweetid / timestamp I had thought that I could simply pull a list of all column names from each row (representing each word) and flag all occurrences (tweet id's) that exist in each row ... however, these rows would get quite long over time. Am I missing an easier way to get a list of all "tweetid's" that exist in multiple rows? -- Sasha Dolgy sasha.do...@gmail.com