Re: Seeking advice on Schema and Caching

2011-11-16 Thread Aditya
ery only. > > In 2nd case, yes 2 or more query required to get specific user details. As > username is map to user_id's key(unique like UUID) and user_id key store > actual details. > > > On Wed, Nov 16, 2011 at 11:10 AM, Aditya Narayan wrote: > >> Regarding the fi

Re: Seeking advice on Schema and Caching

2011-11-16 Thread Aditya
set to a good limit, right ? >> one catch this is only good for small size row, as your one row contain > all entry with first 3 similar char, this can happen that one row could > become very large while other remain very thin. > eg: > many ppl can have aditya name > adi{ > {ty

Re: Second Cassandra users survey

2011-11-28 Thread Aditya
Ability to mix counter columns & normal columns in same column family. On Thu, Nov 17, 2011 at 6:46 PM, Boris Yen wrote: > I was wondering if it is possible to provide a funtion like "delete from > cf where column='value' " > > I think this shold be useful for people who use secondary index a

Strategies to maintain counters sorted row ?

2011-11-29 Thread Aditya
I know it is not possible to sort columns in a row by counter values so what are the other strategies to maintain a sorted list (of counters) in cassandra? Could you propose some schema that might be helpful to achieve this ? Or do I need to retrieve thousands of columns each time and do the sort

Re: Strategies to maintain counters sorted row ?

2011-12-01 Thread Aditya
, Aditya wrote: > I know it is not possible to sort columns in a row by counter values so > what are the other strategies to maintain a sorted list (of counters) in > cassandra? Could you propose some schema that might be helpful to achieve > this ? > > Or do I need to ret

Retrieve all composite columns from a row, whose composite name's first component matches from a list of Integers

2011-12-26 Thread Aditya
I need to store data of all activities by user's followies in single row. I am trying to do that making use of composite column names in a single user specific row named 'rowX'. On any activity by a user's followie on an item, a column is stored in 'rowX'. The column has a composite type column na

Re: Retrieve all composite columns from a row, whose composite name's first component matches from a list of Integers

2011-12-28 Thread Aditya
> > On Tuesday, December 27, 2011, Aditya wrote: > > I need to store data of all activities by user's followies in single > row. I am trying to do that making use of composite column names in a > single user specific row named 'rowX'. > > On any activity b

Re: Retrieve all composite columns from a row, whose composite name's first component matches from a list of Integers

2011-12-29 Thread Aditya
> column need to be read? Why? > > > > Thanks > > Le 28 déc. 2011 19:26, "Edward Capriolo" a > écrit : > > > >> Super columns have the same fundamental problem and perform worse in > >> general. So switching from composites to supe

Re: Retrieve all composite columns from a row, whose composite name's first component matches from a list of Integers

2011-12-29 Thread Aditya
Also point worth noticing is that there might be at max 8-10 subcolumns per supercolumn. I need to write a subcolumn at a time( but always read entire supercolumn at any time). On Fri, Dec 30, 2011 at 12:20 AM, Aditya wrote: > @Edward: Perhaps you missed to notice that I need to alw

Re: Retrieve all composite columns from a row, whose composite name's first component matches from a list of Integers

2011-12-29 Thread Aditya
or a SliceRange. A ColumnPath takes a > Single SuperColumn. > > I do not see how this is done with Columns or SuperColumns. Maybe you > can provide a code snippet and/or some sample data? > > On 12/29/11, Aditya wrote: > > @Edward: Perhaps you missed to notice

Inconsistent behavior during read

2015-06-25 Thread Aditya Shetty
whereobject_id = '674813358269'and object_type = 0and object_owner_id = 0;* Note that I am querying using the java driver. -- Aditya Shetty *Lead Engineer* *M*: +91 7022423545, *T*: 080 46603000 *EXT*: 4417 2nd FLOOR, WEST WING, SALARPURIA SUPREME, MARATHAHALLI, BENGALURU Downlo

Re: Inconsistent behavior during read

2015-06-25 Thread Aditya Shetty
d reading of the data? > > Cheers, > Perica > > > On Thu, Jun 25, 2015 at 12:12 PM, Aditya Shetty < > aditya.she...@snapdeal.com> wrote: > >> Hi >> >> I have a 3 node cassandra cluster with a replication factor of 2. I have >> a basic column family

Re: Inconsistent behavior during read

2015-06-25 Thread Aditya Shetty
lly. Reads of partition ranges (or a range slice, in old Thrift > terms) don't do read repair. > > > -- > Tyler Hobbs > DataStax <http://datastax.com/> > -- Aditya Shetty *Lead Engineer* *M*: +91 7022423545, *T*: 080 46603000 *EXT*: 4417 2nd FLOOR, WEST WING, S

Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread Aditya Narayan
* For a discussions forum, I need to show a page of most viewed discussions. For implementing this, I maintain a count of views of a discussion & when this views count of a discussion passes a certain threshold limit, the discussion Id is added to a row of most viewed discussions. Thi

Re: Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread Aditya Narayan
help minimize several versions of the same column in the row parts in different SST tables. On Wed, May 18, 2011 at 11:04 PM, Aditya Narayan wrote: > * > For a discussions forum, I need to show a page of most viewed discussions. > > For implementing this, I maintain a count

Re: Design for 'Most viewed Discussions' in a forum

2011-05-18 Thread Aditya Narayan
1 > > Then you just query in the following way : > > MGET <http://redis.io/commands/mget> topics:*:timestampN > > * is the wildcard, you order by viewcount and you have what you are asking > for ! > This is a simplified version of what you should do but personnally I r

Storing counters in the standard column families along with non-counter columns ?

2011-07-10 Thread Aditya Narayan
Is it now possible to store counters in the standard column families along with non counter type columns ? How to achieve this ?

Re: Storing counters in the standard column families along with non-counter columns ?

2011-07-10 Thread Aditya Narayan
ved ... > > https://issues.apache.org/jira/browse/CASSANDRA-2614 > > -sd > > On Sun, Jul 10, 2011 at 5:04 PM, Aditya Narayan wrote: > > Is it now possible to store counters in the standard column families > along > > with non counter type columns ? How to achieve this ? >

Re: Storing counters in the standard column families along with non-counter columns ?

2011-07-10 Thread Aditya Narayan
ff, where as normal CF simply just add or > replace. > > > On Sun, Jul 10, 2011 at 10:39 PM, Aditya Narayan wrote: > >> Thanks for info. >> >> Is there any target version in near future for which this has been >> promised ? >> >> >> On Sun, Jul

Re: Storing counters in the standard column families along with non-counter columns ?

2011-07-11 Thread Aditya Narayan
Oops that's really very much disheartening and it could seriously impact our plans for going live in near future. Without this facility I guess counters currently have very little usefulness. On Mon, Jul 11, 2011 at 8:16 PM, Chris Burroughs wrote: > On 07/10/2011 01:09 PM, Aditya Naray

Re: Storing counters in the standard column families along with non-counter columns ?

2011-07-14 Thread Aditya Narayan
Thanks Aaron & Chris, I appreciate your help. With dedicated CF for counters, in addition to the issue pointed by Chris, the major drawback I see is that I cant read *in a single query* the counters with the regular columns row which is widely required by my application. My use case is like storin

Programmatically allow only one out of two types of rows in a CF to enter the CACHE

2011-10-28 Thread Aditya Narayan
I need to keep the data of some entities in a single CF but split in two rows for each entity. One row contains an overview information for the entity & another row contains detailed information about entity. I am wanting to keep both rows in single CF so they may be retrieved in a single query whe

Re: Programmatically allow only one out of two types of rows in a CF to enter the CACHE

2011-10-29 Thread Aditya Narayan
..so that I can retrieve them through a single query. For reading cols from two CFs you need two queries, right ? On Sat, Oct 29, 2011 at 9:53 PM, Mohit Anchlia wrote: > Why not use 2 CFs? > > On Fri, Oct 28, 2011 at 9:42 PM, Aditya Narayan wrote: > > I need to keep t

Re: Programmatically allow only one out of two types of rows in a CF to enter the CACHE

2011-10-29 Thread Aditya Narayan
do you mean exactly by "indexing some of the higher levels of data" ? Thanks you guys! > Anthony > > > On 28/10/2011, at 21:42 PM, Aditya Narayan wrote: > > > I need to keep the data of some entities in a single CF but split in two > rows for each entity. On

Re: Programmatically allow only one out of two types of rows in a CF to enter the CACHE

2011-10-29 Thread Aditya Narayan
Thanks Zach, Nice Idea ! and what about looking at, may be, some custom caching solutions, leaving aside cassandra caching .. ? On Sun, Oct 30, 2011 at 2:00 AM, Zach Richardson < j.zach.richard...@gmail.com> wrote: > Aditya, > > Depending on how often you have to write to th

Re: Cassandra Cluster Admin - phpMyAdmin for Cassandra

2011-11-01 Thread Aditya Narayan
Yes that would be pretty nice feature to see! On Mon, Oct 31, 2011 at 10:45 PM, Ertio Lew wrote: > Thanks so much SebWajam for this great piece of work! > > Is there a way to set a data type for displaying the column names/ values > of a CF ? It seems that your project always uses String Seri

Concatenating ids with extension to keep multiple rows related to an entity in a single CF

2011-11-03 Thread Aditya Narayan
I am concatenating two Integer ids through bitwise operations(as described below) to create a single primary key of type long. I wanted to know if this is a good practice. This would help me in keeping multiple rows of an entity in a single column family by appending different extensions to the en

Re: Concatenating ids with extension to keep multiple rows related to an entity in a single CF

2011-11-03 Thread Aditya Narayan
10:11 AM, Tyler Hobbs wrote: > On Thu, Nov 3, 2011 at 3:48 PM, Aditya Narayan wrote: > >> I am concatenating two Integer ids through bitwise operations(as >> described below) to create a single primary key of type long. I wanted to >> know if this is a good practice. This w

Store profile pics of users in Cassandra or file system ?

2011-11-11 Thread Aditya Narayan
Would it be recommended to store the profile pics of users on an application in Cassandra ? Or file system would be a better way to go. I came across an interesting paper which advocates storing in DB for blobs sized up to 1 MB. I was planning to store the image bytes in the same row that contained

Re: Store profile pics of users in Cassandra or file system ?

2011-11-11 Thread Aditya Narayan
just forgot to add the paper link if this is useful at all : To BLOB or Not To BLOB: Large Object Storage in a Database or a Filesystem<http://research.microsoft.com/apps/pubs/default.aspx?id=64525> On Sat, Nov 12, 2011 at 12:34 AM, Aditya Narayan wrote: > Would it be recommended to

Seeking advice on Schema and Caching

2011-11-15 Thread Aditya Narayan
Hi I need to add 'search users' functionality to my application. (The trigger for fetching searched items(like google instant search) is made when 3 letters have been typed in). For this, I make a CF with String type keys. Each such key is made of first 3 letters of a user's name. Thus all names

Re: Seeking advice on Schema and Caching

2011-11-15 Thread Aditya Narayan
Any insights on this ? On Tue, Nov 15, 2011 at 9:40 PM, Quintero wrote: > > > Aditya Narayan wrote: > > >Hi > > > >I need to add 'search users' functionality to my application. (The trigger > >for fetching searched items(like google instant search

Re: Seeking advice on Schema and Caching

2011-11-15 Thread Aditya Narayan
with multiple users of same name. Thus super columns seem to fit appropriately but I really want to avoid them since they are seriously discouraged by everyone. On Wed, Nov 16, 2011 at 3:19 AM, Ben Gambley wrote: > Hi Aditya > > Not sure the best way to do in Cassandra but have you c

Re: Seeking advice on Schema and Caching

2011-11-15 Thread Aditya Narayan
I would have multple userIds(of users named "Marcos") to be >>> stored inside columnname "cos" under key "Mar". Thus, >>> > >>> > 1. Supercolumn seems to be a better fit for my use case(so that ids of >>> users with same name

Re: Seeking advice on Schema and Caching

2011-11-15 Thread Aditya Narayan
Regarding the first option that you suggested through composite columns, can I store the username & id both in the column name and keep the column valueless? Will I be able to retrieve both the username and id from the composite col name ? Thanks a lot On Wed, Nov 16, 2011 at 10:56 AM, Ad

Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread Aditya Gupta
I just installed Cassandra on my ubuntu server by adding the following to the sources list: deb http://www.apache.org/dist/cassandra/debian 10x main deb-src http://www.apache.org/dist/cassandra/debian 10x main Soon after install I started getting OOM errors & then the server became unresponsive.

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread Aditya Gupta
decrease heap size. > Update MAX_HEAP_SIZE and NEW_HEAP_SIZE and restart. > > From iPhone > > > On 2012/02/18, at 20:40, Aditya Gupta wrote: > > I just installed Cassandra on my ubuntu server by adding the following to > the sources list: > > deb http://www.apac

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread Aditya Gupta
For clarification, I'm running Cassandra on a VMware ubuntu server instance. On Sat, Feb 18, 2012 at 6:44 PM, Aditya Gupta wrote: > Thanks! But what about the 100% cpu consumption that is causing the server > to hang? > > > On Sat, Feb 18, 2012 at 6:19 PM, Watanabe Maki w

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread Aditya Gupta
No data at all. just a fresh installation On Sat, Feb 18, 2012 at 6:57 PM, R. Verlangen wrote: > You might want to check your Cassandra logs, they contain important > information that might lead you to the actual cause of the problems. > > 2012/2/18 Aditya Gupta > >> Thank

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread Aditya Gupta
Am I installing it the right way ? While installing I didn't verify the signatures using public key. On Sat, Feb 18, 2012 at 8:21 PM, Aditya Gupta wrote: > No data at all. just a fresh installation > > > On Sat, Feb 18, 2012 at 6:57 PM, R. Verlangen wrote: > >> Yo

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-18 Thread Aditya Gupta
ckups on EC2 > yesterday and found: http://wiki.apache.org/cassandra/FAQ#ubuntu_hangs > > > On 18/02/2012 14:58, Aditya Gupta wrote: > > Am I installing it the right way ? While installing I didn't verify the > signatures using public key. > > On Sat, Feb 18, 2012 at 8

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-19 Thread Aditya Gupta
Is there anything to do with running cassandra on a VMware ubuntu instance !? I am trying Cassandra on VMware ubuntu server instance. On Sun, Feb 19, 2012 at 2:45 AM, Aditya Gupta wrote: > In my case, after installing cassandra, as soon as the server is started > it hangs on it own (t

Re: Newbie Question: Cassandra consuming 100% CPU on ubuntu server

2012-02-20 Thread Aditya Gupta
@Tamar: Thanks a ton! works perfect now! On Mon, Feb 20, 2012 at 12:02 PM, Tamar Fraenkel wrote: > On 19/02/2012 21:55, Aditya Gupta wrote: > > Is there anything to do with running cassandra on a VMware ubuntu instance > !? I am trying Cassandra on VMware ubuntu server instance. &g

What linux distro for the Cassandra nodes ?

2012-02-21 Thread Aditya Gupta
I am about to choose a linux distro to be installed on Cassandra nodes. Which are the most popular & recommended ones by Cassandra community? (Not interested in paying licensing fees)

Re: What linux distro for the Cassandra nodes ?

2012-02-21 Thread Aditya Gupta
ormance. > > However I read something about bugs with a certain Ubuntu version in > > combination with a specific Cassandra version. > > > > 2012/2/21 Rishabh Agrawal > >> > >> Hello Aditya, > >> > >> > >> > >> I am

Only the last added node is visible in the cluster

2012-02-25 Thread Aditya Gupta
While creating a multinode cluster, my nodes are unable to identify all the nodes in the cluster. Only the *last added' node is visible when I do: ./nodetool -h localhost ring I am trying to create a 4 nodes cluster. On starting the seed node, the above command shows just itself(ok.. good), the

Re: Only the last added node is visible in the cluster

2012-02-25 Thread Aditya Gupta
25, 2012 at 9:48 AM, Aditya Gupta wrote: > > While creating a multinode cluster, my nodes are unable to identify all > the > > nodes in the cluster. > > Only the *last added' node is visible when I do: > > ./nodetool -h localhost ring > > > > > > I

Re: Only the last added node is visible in the cluster

2012-02-25 Thread Aditya Gupta
system keyspace. In any > case, some nodetool ring output at each node addition will clarify > this. > > -Brandon > > On Sat, Feb 25, 2012 at 1:20 PM, Aditya Gupta wrote: > > Nope, I just re-verified :) > > I have split up the range into 4 parts for 4 nodes. I have spec

Re: Only the last added node is visible in the cluster

2012-02-25 Thread Aditya Gupta
issue ? On Sun, Feb 26, 2012 at 2:49 AM, Aditya Gupta wrote: > yes, exactly I did cloned a single VMware machine to make other > instances.. so how do I correct this now ? > > > On Sun, Feb 26, 2012 at 2:18 AM, Brandon Williams wrote: > >> Then my next guess is you clone

Re: Only the last added node is visible in the cluster

2012-02-25 Thread Aditya Gupta
Thanks removing the system keyspace worked! Thanks! On Sun, Feb 26, 2012 at 3:39 AM, Brandon Williams wrote: > On Sat, Feb 25, 2012 at 3:39 PM, Aditya Gupta wrote: > > The output of nodetool ring after each addition of nodes, make just the > last > > added node visible in th

How to reduce the memory consumed by cassandra (so as to prevent crashes & OOMs) ?

2012-02-27 Thread Aditya Gupta
I'm running a 4 nodes cassandra cluster of VMware ubuntu instances each 768MB memory (on a single machine for development purposes). I need to reduce heap size appropriately as my nodes have been crashing at times with OOMs. How do I configure for this ? I think I would need to make some tweaks wi

nodetool decommission unable to do its job successfully

2012-02-29 Thread Aditya Gupta
I ran nodetool decommission on a node from a 4 nodes cluster as follows: nodetool -h 192.168.26.130 decommission But it never completes & doesn't show any messages, on seeing the logs I noticed: java.util.concurrent.RejectedExecutionException: ThreadPoolExecutor has shut down at org.apache.cass

Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
or just a standard column family containing "all the subcolumns data serialized in single column(s) " ? Thanks Aditya Narayan

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
of tags associated with particular reminder. All tags set at once during first write. The no of tags(subcolumns) will be around 8 maximum. Any comments, suggestions and feedback on the schema design are requested.. Thanks Aditya Narayan On Wed, Feb 2, 2011 at 7:49 PM, Aditya Narayan wrote: >

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
ws of the reminder details would be picked up.." Is supercolumn a preferable choice for this ? Can there be a better schema than this ? -Aditya Narayan On Wed, Feb 2, 2011 at 8:54 PM, William R Speirs wrote: > To reiterate, so I know we're both on the same page, your schema would be

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
ers) for a standard SQL/relational model, then it's > probably too much for a single row. > > I'm not familiar with the TTL functionality of Cassandra... sorry cannot > help/comment there, still learning :-) > > Yea, my $0.02 is that this is an effective way to lev

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
standard type column family. Thanks -Aditya Narayan On Wed, Feb 2, 2011 at 10:11 PM, William R Speirs wrote: > I did not understand before... sorry. > > Again, depending upon how many reminders you have for a single user, this > could be a long/wide row. Again, it really comes down

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-02 Thread Aditya Narayan
Can I have some more feedback about my schema perhaps somewhat more criticisive/harsh ? Thanks again, Aditya Narayan On Wed, Feb 2, 2011 at 10:27 PM, Aditya Narayan wrote: > @Bill > Thank you BIll! > > @Cassandra users > Can others also leave their suggestions and comments

Re: Schema Design Question : Supercolumn family or just a Standard column family with columns containing serialized aggregate data?

2011-02-03 Thread Aditya Narayan
Thanks Tyler! On Thu, Feb 3, 2011 at 12:06 PM, Tyler Hobbs wrote: > On Wed, Feb 2, 2011 at 3:27 PM, Aditya Narayan wrote: >> >> Can I have some more feedback about my schema perhaps somewhat more >> criticisive/harsh ? > > It sounds reasonable to me. > > Since

Sorting in time order without using TimeUUID type column names

2011-02-03 Thread Aditya Narayan
timeline in the order of their due time.) Basically I am trying to avoid 16 bytes long timeUUID first because they are too long and the above defined key pattern is guaranteeing me a unique key/Id for the reminder row always. Thanks Aditya Narayan

Re: Sorting in time order without using TimeUUID type column names

2011-02-03 Thread Aditya Narayan
perhaps not aware of ? On Thu, Feb 3, 2011 at 5:43 PM, Sylvain Lebresne wrote: > On Thu, Feb 3, 2011 at 11:27 AM, Aditya Narayan wrote: >> >> Hey all, >> >> I want to store some columns that are reminders to the users on my >> application, in time sorted order

Re: Sorting in time order without using TimeUUID type column names

2011-02-03 Thread Aditya Narayan
perhaps not aware of ? On Thu, Feb 3, 2011 at 5:43 PM, Sylvain Lebresne wrote: > On Thu, Feb 3, 2011 at 11:27 AM, Aditya Narayan wrote: >> >> Hey all, >> >> I want to store some columns that are reminders to the users on my >> application, in time sorted order

Column Sorting of integer names

2011-02-04 Thread Aditya Narayan
Is there any way to sort the columns named as integers in the descending order ? Regards -Aditya

Re: Using Cassandra to store files

2011-02-04 Thread Aditya Narayan
I am also looking to possible solutions to store pdfs & word documents. But why wont you store in them in the filesystem instead of a database unless your files are too small in which case it would be recommended to use a database. -Aditya On Fri, Feb 4, 2011 at 5:30 PM, Daniel Doubl

Re: Using Cassandra to store files

2011-02-04 Thread Aditya Narayan
yes, definitely a database for mapping ofcourse! On Fri, Feb 4, 2011 at 11:17 PM, buddhasystem wrote: > > Even when storage is in NFS, Cassandra can still be quite useful as a file > catalog. Your physical storage can change, move etc. Therefore, it's a good > idea to provide mapping of logical n

Re: Sorting in time order without using TimeUUID type column names

2011-02-04 Thread Aditya Narayan
omparator type (ReversedIntegerType) to sort those longs in descending order. Regards Aditya On Sat, Feb 5, 2011 at 6:24 AM, aaron morton wrote: > IMHO If you know the time of the event use store the time as a long, rather > than a UUID. It will make it easier to get back to a > time and

Does variation in no of columns in rows over the column family has any performance impact ?

2011-02-06 Thread Aditya Narayan
Does huge variation in no. of columns in rows, over the column family has *any* impact on the performance ? Can I have like just 100 columns in some rows and like hundred thousands of columns in another set of rows, without any downsides ?

Re: Does variation in no of columns in rows over the column family has any performance impact ?

2011-02-07 Thread Aditya Narayan
Thanks for the detailed explanation Peter! Definitely cleared my doubts ! On Mon, Feb 7, 2011 at 1:52 PM, Peter Schuller wrote: >> Does huge variation in no. of columns in rows, over the column family >> has *any* impact on the performance ? >> >> Can I have like just 100 columns in some rows a

Calculating the size of rows in KBs

2011-02-10 Thread Aditya Narayan
How can I get or calculate the size of rows/ columns ? what are the any overheads on memory for each column/row ?

Re: Calculating the size of rows in KBs

2011-02-10 Thread Aditya Narayan
the asked column names exist in indexes for that row/key & they dont need to deserialize the blocks in SST tables searching for column values. Am I thinking right way?? -Aditya On Fri, Feb 11, 2011 at 1:54 AM, Aaron Morton wrote: > If you want to get the byte size of a particular row you w

Re: Merging the rows of two column families(with similar attributes) into one ??

2011-02-12 Thread Aditya Narayan
What if the caching requirements, sorting needs of two kind of data are very much similar, is it preferable to go with a single CF in those cases ? Regards Aditya >>> On Sat, Feb 5, 2011 at 10:43 AM, Tyler Hobbs  wrote: >>>>>> >>>>>> I read somew

Re: Merging the rows of two column families(with similar attributes) into one ??

2011-02-12 Thread Aditya Narayan
Any comments/view points on this? --On Sat, Feb 12, 2011 at 5:05 PM, Aditya Narayan wrote: What if the caching requirements, sorting needs of two kind of data are very much similar, is it preferable to go with a single CF in those cases ? Regards Aditya > > >>> On Sat, Fe

Re: Confused about get_slice SliceRange behavior with bloom filter

2011-02-13 Thread Aditya Narayan
ld use the column names list for this SliceQuery operation). Thanks Aditya On Sun, Feb 13, 2011 at 8:41 PM, Jonathan Ellis wrote: > On Sun, Feb 13, 2011 at 12:37 AM, E S wrote: > > I've gotten myself really confused by > > http://wiki.apache.org/cassandra/ArchitectureInt

Re: Confused about get_slice SliceRange behavior with bloom filter

2011-02-14 Thread Aditya Narayan
to be sequential on disk). So if the columns you ask for are > really randomly distributed, then yes, the biggest the row is, the biggest > the chance is to have to hit many blocks and the biggest the chance is for > these block to be far apart on disk. > > -- > Sylvain > > On

Re: Confused about get_slice SliceRange behavior with bloom filter

2011-02-14 Thread Aditya Narayan
Thanks for the clarifications.. On Mon, Feb 14, 2011 at 6:13 PM, Sylvain Lebresne wrote: > On Mon, Feb 14, 2011 at 11:27 AM, Aditya Narayan wrote: > >> Thanks Sylvain, >> >> I guess I might have misunderstood the meaning of column_index_size_in_kb, >> My previou

Splitting a single row into multiple

2011-02-23 Thread Aditya Narayan
Does it make any difference if I split a row, that needs to be accessed together, into two or three rows and then read those multiple rows ?? (Assume the keys of all the three rows are known to me programatically since I split columns by certain categories). Would the performance be any better if a

Re: Splitting a single row into multiple

2011-02-23 Thread Aditya Narayan
so a > single row read gets what you need. > > Aaron > > On 24/02/2011, at 5:59 AM, Aditya Narayan wrote: > >> Does it make any difference if I split a row, that needs to be >> accessed together, into two or three rows and then read those multiple >> rows

What would be a good strategy for Storing the large text contents like blog posts in Cassandra.

2011-03-06 Thread Aditya Narayan
What would be a good strategy to store large text content/(blog posts of around 1500-3000 characters) in cassandra? I need to store these blog posts along with their metadata like bloggerId, blogTags. I am looking forward to store this data in a single row giving each attribute a single column. So

Re: What would be a good strategy for Storing the large text contents like blog posts in Cassandra.

2011-03-06 Thread Aditya Narayan
t; try one and be prepared to change. > > Note that counters are only in the 0.8 trunk and are still under development, > they are not going to be released for a couple of months. > > Your per column data size is nothing to be concerned abut. > > Hope that helps. > Aaron >

Splitting the data of a single blog into 2 CFs (to implement effective caching) according to views.

2011-03-07 Thread Aditya Narayan
My application displays list of several blogs' overview data (like blogTitle/ nameOfBlogger/ shortDescrption for each blog) on 1st page (in very much similar manner like Digg's newsfeed) and when the user selects a particular blog to see., the application takes him to that specific blog's full pag

Re: Splitting the data of a single blog into 2 CFs (to implement effective caching) according to views.

2011-03-08 Thread Aditya Narayan
CF2 as well (use a batch_mutation > through whatever client you have). So when serving the second page you only > need to read one row from CF2. > > > Aaron > > On 8/03/2011, at 8:13 PM, Norman Maurer wrote: > > Yeah this make sense as far as I can tell. > > > Bye,

Does the memtable replace the old version of column with the new overwriting version or is it just a simple append ?

2011-03-08 Thread Aditya Narayan
, since Cassandra will have to read so many versions of the same column. If this is just replacement with old column then I guess read will be much better since it needs to see just single existing version of column. Thanks Aditya Narayan

Re: Does the memtable replace the old version of column with the new overwriting version or is it just a simple append ?

2011-03-08 Thread Aditya Narayan
d the reconciliation of that happens > during read (read repair). This is why reads are slower than writes because > conflict resolution happens during read. > > Hope this answers the question! > > Thanks, > -Naren > > On Tue, Mar 8, 2011 at 10:44 PM, Aditya Narayan wrote: &

RE: WordCount example problem

2010-11-10 Thread Aditya Muralidharan
Also, your Mapper class needs to look like this: MyMapper extends Mapper,Text,SumWritable> ... with all the necessary fixes to the map method. AD -Original Message- From: Jonathan Ellis [mailto:jbel...@gmail.com] Sent: Wednesday, November 10, 2010 8:40 AM To: user Subject: Re: WordCount

RE: MapReduce/Hadoop in cassandra 0.7 beta3

2010-11-10 Thread Aditya Muralidharan
My bad. Moved to Cassandra user list. -Original Message- From: Aditya Muralidharan [mailto:aditya.muralidha...@nisc.coop] Sent: Wednesday, November 10, 2010 10:48 AM To: u...@pig.apache.org Subject: RE: MapReduce/Hadoop in cassandra 0.7 beta3 Hi, I'm still getting the error assoc

Cassandra 0.7 beta3 BinaryMemtable and Supercolumns

2010-11-11 Thread Aditya Muralidharan
Is it possible for BinaryMemtable RowMutations to a ColumnFamily with supercolumns?

RE: Cassandra 0.7 beta3 BinaryMemtable and Supercolumns

2010-11-11 Thread Aditya Muralidharan
From: Aditya Muralidharan [mailto:aditya.muralidha...@nisc.coop] Sent: Thursday, November 11, 2010 3:27 PM To: user@cassandra.apache.org Subject: Cassandra 0.7 beta3 BinaryMemtable and Supercolumns Is it possible for BinaryMemtable RowMutations to a ColumnFamily with supercolumns?

RE: Cassandra 0.7 beta3 BinaryMemtable and Supercolumns

2010-11-12 Thread Aditya Muralidharan
lumns Before you dig into that, are you sure normal Thrift inserts are not fast enough? On Thu, Nov 11, 2010 at 4:41 PM, Aditya Muralidharan wrote: > Pretty sure I could ask that better: > > > > Is it possible for me to perform RowMutations on BinaryMemtable for a > ColumnFamily of

RE: Booting Cassandra v0.7.0 on Windows: rename failed

2010-11-29 Thread Aditya Muralidharan
I've run into this as well. Having confirmed that there are no handles on the file (it's only ever created and used by Cassandra), and having stepped through the code, I've concluded that something in the io (not sure if it's the jvm or the os) stack is lazy about releasing the file handle for '