ery only.
>
> In 2nd case, yes 2 or more query required to get specific user details. As
> username is map to user_id's key(unique like UUID) and user_id key store
> actual details.
>
>
> On Wed, Nov 16, 2011 at 11:10 AM, Aditya Narayan wrote:
>
>> Regarding the fi
set to a good limit, right ?
>> one catch this is only good for small size row, as your one row contain
> all entry with first 3 similar char, this can happen that one row could
> become very large while other remain very thin.
> eg:
> many ppl can have aditya name
> adi{
> {ty
Ability to mix counter columns & normal columns in same column family.
On Thu, Nov 17, 2011 at 6:46 PM, Boris Yen wrote:
> I was wondering if it is possible to provide a funtion like "delete from
> cf where column='value' "
>
> I think this shold be useful for people who use secondary index a
I know it is not possible to sort columns in a row by counter values so
what are the other strategies to maintain a sorted list (of counters) in
cassandra? Could you propose some schema that might be helpful to achieve
this ?
Or do I need to retrieve thousands of columns each time and do the sort
, Aditya wrote:
> I know it is not possible to sort columns in a row by counter values so
> what are the other strategies to maintain a sorted list (of counters) in
> cassandra? Could you propose some schema that might be helpful to achieve
> this ?
>
> Or do I need to ret
I need to store data of all activities by user's followies in single row. I
am trying to do that making use of composite column names in a single user
specific row named 'rowX'.
On any activity by a user's followie on an item, a column is stored in
'rowX'. The column has a composite type column na
>
> On Tuesday, December 27, 2011, Aditya wrote:
> > I need to store data of all activities by user's followies in single
> row. I am trying to do that making use of composite column names in a
> single user specific row named 'rowX'.
> > On any activity b
> column need to be read? Why?
> >
> > Thanks
> > Le 28 déc. 2011 19:26, "Edward Capriolo" a
> écrit :
> >
> >> Super columns have the same fundamental problem and perform worse in
> >> general. So switching from composites to supe
Also point worth noticing is that there might be at max 8-10 subcolumns
per supercolumn.
I need to write a subcolumn at a time( but always read entire supercolumn
at any time).
On Fri, Dec 30, 2011 at 12:20 AM, Aditya wrote:
> @Edward: Perhaps you missed to notice that I need to alw
or a SliceRange. A ColumnPath takes a
> Single SuperColumn.
>
> I do not see how this is done with Columns or SuperColumns. Maybe you
> can provide a code snippet and/or some sample data?
>
> On 12/29/11, Aditya wrote:
> > @Edward: Perhaps you missed to notice
whereobject_id =
'674813358269'and object_type = 0and object_owner_id = 0;*
Note that I am querying using the java driver.
--
Aditya Shetty
*Lead Engineer*
*M*: +91 7022423545, *T*: 080 46603000 *EXT*: 4417
2nd FLOOR, WEST WING,
SALARPURIA SUPREME, MARATHAHALLI, BENGALURU
Downlo
d reading of the data?
>
> Cheers,
> Perica
>
>
> On Thu, Jun 25, 2015 at 12:12 PM, Aditya Shetty <
> aditya.she...@snapdeal.com> wrote:
>
>> Hi
>>
>> I have a 3 node cassandra cluster with a replication factor of 2. I have
>> a basic column family
lly. Reads of partition ranges (or a range slice, in old Thrift
> terms) don't do read repair.
>
>
> --
> Tyler Hobbs
> DataStax <http://datastax.com/>
>
--
Aditya Shetty
*Lead Engineer*
*M*: +91 7022423545, *T*: 080 46603000 *EXT*: 4417
2nd FLOOR, WEST WING,
S
*
For a discussions forum, I need to show a page of most viewed discussions.
For implementing this, I maintain a count of views of a discussion & when
this views count of a discussion passes a certain threshold limit, the
discussion Id is added to a row of most viewed discussions.
Thi
help minimize several versions of the same column in the row
parts in different SST tables.
On Wed, May 18, 2011 at 11:04 PM, Aditya Narayan wrote:
> *
> For a discussions forum, I need to show a page of most viewed discussions.
>
> For implementing this, I maintain a count
1
>
> Then you just query in the following way :
>
> MGET <http://redis.io/commands/mget> topics:*:timestampN
>
> * is the wildcard, you order by viewcount and you have what you are asking
> for !
> This is a simplified version of what you should do but personnally I r
Is it now possible to store counters in the standard column families along
with non counter type columns ? How to achieve this ?
ved ...
>
> https://issues.apache.org/jira/browse/CASSANDRA-2614
>
> -sd
>
> On Sun, Jul 10, 2011 at 5:04 PM, Aditya Narayan wrote:
> > Is it now possible to store counters in the standard column families
> along
> > with non counter type columns ? How to achieve this ?
>
ff, where as normal CF simply just add or
> replace.
>
>
> On Sun, Jul 10, 2011 at 10:39 PM, Aditya Narayan wrote:
>
>> Thanks for info.
>>
>> Is there any target version in near future for which this has been
>> promised ?
>>
>>
>> On Sun, Jul
Oops that's really very much disheartening and it could seriously impact our
plans for going live in near future. Without this facility I guess counters
currently have very little usefulness.
On Mon, Jul 11, 2011 at 8:16 PM, Chris Burroughs
wrote:
> On 07/10/2011 01:09 PM, Aditya Naray
Thanks Aaron & Chris, I appreciate your help.
With dedicated CF for counters, in addition to the issue pointed by Chris,
the major drawback I see is that I cant read *in a single query* the
counters with the regular columns row which is widely required by my
application.
My use case is like storin
I need to keep the data of some entities in a single CF but split in two
rows for each entity. One row contains an overview information for the
entity & another row contains detailed information about entity. I am
wanting to keep both rows in single CF so they may be retrieved in a single
query whe
..so that I can retrieve them through a single query.
For reading cols from two CFs you need two queries, right ?
On Sat, Oct 29, 2011 at 9:53 PM, Mohit Anchlia wrote:
> Why not use 2 CFs?
>
> On Fri, Oct 28, 2011 at 9:42 PM, Aditya Narayan wrote:
> > I need to keep t
do you mean exactly by "indexing some of the higher levels of data" ?
Thanks you guys!
> Anthony
>
>
> On 28/10/2011, at 21:42 PM, Aditya Narayan wrote:
>
> > I need to keep the data of some entities in a single CF but split in two
> rows for each entity. On
Thanks Zach, Nice Idea !
and what about looking at, may be, some custom caching solutions, leaving
aside cassandra caching .. ?
On Sun, Oct 30, 2011 at 2:00 AM, Zach Richardson <
j.zach.richard...@gmail.com> wrote:
> Aditya,
>
> Depending on how often you have to write to th
Yes that would be pretty nice feature to see!
On Mon, Oct 31, 2011 at 10:45 PM, Ertio Lew wrote:
> Thanks so much SebWajam for this great piece of work!
>
> Is there a way to set a data type for displaying the column names/ values
> of a CF ? It seems that your project always uses String Seri
I am concatenating two Integer ids through bitwise operations(as described
below) to create a single primary key of type long. I wanted to know if
this is a good practice. This would help me in keeping multiple rows of an
entity in a single column family by appending different extensions to the
en
10:11 AM, Tyler Hobbs wrote:
> On Thu, Nov 3, 2011 at 3:48 PM, Aditya Narayan wrote:
>
>> I am concatenating two Integer ids through bitwise operations(as
>> described below) to create a single primary key of type long. I wanted to
>> know if this is a good practice. This w
Would it be recommended to store the profile pics of users on an
application in Cassandra ? Or file system would be a better way to go. I
came across an interesting paper which advocates storing in DB for blobs
sized up to 1 MB. I was planning to store the image bytes in the same row
that contained
just forgot to add the paper link if this is useful at all : To BLOB or Not
To BLOB: Large Object Storage in a Database or a
Filesystem<http://research.microsoft.com/apps/pubs/default.aspx?id=64525>
On Sat, Nov 12, 2011 at 12:34 AM, Aditya Narayan wrote:
> Would it be recommended to
Hi
I need to add 'search users' functionality to my application. (The trigger
for fetching searched items(like google instant search) is made when 3
letters have been typed in).
For this, I make a CF with String type keys. Each such key is made of first
3 letters of a user's name.
Thus all names
Any insights on this ?
On Tue, Nov 15, 2011 at 9:40 PM, Quintero wrote:
>
>
> Aditya Narayan wrote:
>
> >Hi
> >
> >I need to add 'search users' functionality to my application. (The trigger
> >for fetching searched items(like google instant search
with multiple users of same name. Thus super
columns seem to fit appropriately but I really want to avoid them since
they are seriously discouraged by everyone.
On Wed, Nov 16, 2011 at 3:19 AM, Ben Gambley wrote:
> Hi Aditya
>
> Not sure the best way to do in Cassandra but have you c
I would have multple userIds(of users named "Marcos") to be
>>> stored inside columnname "cos" under key "Mar". Thus,
>>> >
>>> > 1. Supercolumn seems to be a better fit for my use case(so that ids of
>>> users with same name
Regarding the first option that you suggested through composite columns,
can I store the username & id both in the column name and keep the column
valueless?
Will I be able to retrieve both the username and id from the composite col
name ?
Thanks a lot
On Wed, Nov 16, 2011 at 10:56 AM, Ad
I just installed Cassandra on my ubuntu server by adding the following to
the sources list:
deb http://www.apache.org/dist/cassandra/debian 10x main
deb-src http://www.apache.org/dist/cassandra/debian 10x main
Soon after install I started getting OOM errors & then the server became
unresponsive.
decrease heap size.
> Update MAX_HEAP_SIZE and NEW_HEAP_SIZE and restart.
>
> From iPhone
>
>
> On 2012/02/18, at 20:40, Aditya Gupta wrote:
>
> I just installed Cassandra on my ubuntu server by adding the following to
> the sources list:
>
> deb http://www.apac
For clarification, I'm running Cassandra on a VMware ubuntu server instance.
On Sat, Feb 18, 2012 at 6:44 PM, Aditya Gupta wrote:
> Thanks! But what about the 100% cpu consumption that is causing the server
> to hang?
>
>
> On Sat, Feb 18, 2012 at 6:19 PM, Watanabe Maki w
No data at all. just a fresh installation
On Sat, Feb 18, 2012 at 6:57 PM, R. Verlangen wrote:
> You might want to check your Cassandra logs, they contain important
> information that might lead you to the actual cause of the problems.
>
> 2012/2/18 Aditya Gupta
>
>> Thank
Am I installing it the right way ? While installing I didn't verify the
signatures using public key.
On Sat, Feb 18, 2012 at 8:21 PM, Aditya Gupta wrote:
> No data at all. just a fresh installation
>
>
> On Sat, Feb 18, 2012 at 6:57 PM, R. Verlangen wrote:
>
>> Yo
ckups on EC2
> yesterday and found: http://wiki.apache.org/cassandra/FAQ#ubuntu_hangs
>
>
> On 18/02/2012 14:58, Aditya Gupta wrote:
>
> Am I installing it the right way ? While installing I didn't verify the
> signatures using public key.
>
> On Sat, Feb 18, 2012 at 8
Is there anything to do with running cassandra on a VMware ubuntu instance
!? I am trying Cassandra on VMware ubuntu server instance.
On Sun, Feb 19, 2012 at 2:45 AM, Aditya Gupta wrote:
> In my case, after installing cassandra, as soon as the server is started
> it hangs on it own (t
@Tamar: Thanks a ton! works perfect now!
On Mon, Feb 20, 2012 at 12:02 PM, Tamar Fraenkel wrote:
> On 19/02/2012 21:55, Aditya Gupta wrote:
>
> Is there anything to do with running cassandra on a VMware ubuntu instance
> !? I am trying Cassandra on VMware ubuntu server instance.
&g
I am about to choose a linux distro to be installed on Cassandra nodes.
Which are the most popular & recommended ones by Cassandra community? (Not
interested in paying licensing fees)
ormance.
> > However I read something about bugs with a certain Ubuntu version in
> > combination with a specific Cassandra version.
> >
> > 2012/2/21 Rishabh Agrawal
> >>
> >> Hello Aditya,
> >>
> >>
> >>
> >> I am
While creating a multinode cluster, my nodes are unable to identify all the
nodes in the cluster.
Only the *last added' node is visible when I do:
./nodetool -h localhost ring
I am trying to create a 4 nodes cluster. On starting the seed node, the
above command shows just itself(ok.. good), the
25, 2012 at 9:48 AM, Aditya Gupta wrote:
> > While creating a multinode cluster, my nodes are unable to identify all
> the
> > nodes in the cluster.
> > Only the *last added' node is visible when I do:
> > ./nodetool -h localhost ring
> >
> >
> > I
system keyspace. In any
> case, some nodetool ring output at each node addition will clarify
> this.
>
> -Brandon
>
> On Sat, Feb 25, 2012 at 1:20 PM, Aditya Gupta wrote:
> > Nope, I just re-verified :)
> > I have split up the range into 4 parts for 4 nodes. I have spec
issue ?
On Sun, Feb 26, 2012 at 2:49 AM, Aditya Gupta wrote:
> yes, exactly I did cloned a single VMware machine to make other
> instances.. so how do I correct this now ?
>
>
> On Sun, Feb 26, 2012 at 2:18 AM, Brandon Williams wrote:
>
>> Then my next guess is you clone
Thanks removing the system keyspace worked! Thanks!
On Sun, Feb 26, 2012 at 3:39 AM, Brandon Williams wrote:
> On Sat, Feb 25, 2012 at 3:39 PM, Aditya Gupta wrote:
> > The output of nodetool ring after each addition of nodes, make just the
> last
> > added node visible in th
I'm running a 4 nodes cassandra cluster of VMware ubuntu instances each
768MB memory (on a single machine for development purposes). I need to
reduce heap size appropriately as my nodes have been crashing at times
with OOMs. How do I configure for this ? I think I would need to make some
tweaks wi
I ran nodetool decommission on a node from a 4 nodes cluster as follows:
nodetool -h 192.168.26.130 decommission
But it never completes & doesn't show any messages, on seeing the logs
I noticed:
java.util.concurrent.RejectedExecutionException: ThreadPoolExecutor has
shut down
at
org.apache.cass
or just a standard
column family containing "all the subcolumns data serialized in single
column(s) " ?
Thanks
Aditya Narayan
of tags associated
with particular reminder. All tags set at once during first write. The
no of tags(subcolumns) will be around 8 maximum.
Any comments, suggestions and feedback on the schema design are requested..
Thanks
Aditya Narayan
On Wed, Feb 2, 2011 at 7:49 PM, Aditya Narayan wrote:
>
ws of the reminder details would be picked up.."
Is supercolumn a preferable choice for this ? Can there be a better
schema than this ?
-Aditya Narayan
On Wed, Feb 2, 2011 at 8:54 PM, William R Speirs wrote:
> To reiterate, so I know we're both on the same page, your schema would be
ers) for a standard SQL/relational model, then it's
> probably too much for a single row.
>
> I'm not familiar with the TTL functionality of Cassandra... sorry cannot
> help/comment there, still learning :-)
>
> Yea, my $0.02 is that this is an effective way to lev
standard type column family.
Thanks
-Aditya Narayan
On Wed, Feb 2, 2011 at 10:11 PM, William R Speirs wrote:
> I did not understand before... sorry.
>
> Again, depending upon how many reminders you have for a single user, this
> could be a long/wide row. Again, it really comes down
Can I have some more feedback about my schema perhaps somewhat more
criticisive/harsh ?
Thanks again,
Aditya Narayan
On Wed, Feb 2, 2011 at 10:27 PM, Aditya Narayan wrote:
> @Bill
> Thank you BIll!
>
> @Cassandra users
> Can others also leave their suggestions and comments
Thanks Tyler!
On Thu, Feb 3, 2011 at 12:06 PM, Tyler Hobbs wrote:
> On Wed, Feb 2, 2011 at 3:27 PM, Aditya Narayan wrote:
>>
>> Can I have some more feedback about my schema perhaps somewhat more
>> criticisive/harsh ?
>
> It sounds reasonable to me.
>
> Since
timeline in the order of their due time.)
Basically I am trying to avoid 16 bytes long timeUUID first because
they are too long and the above defined key pattern is guaranteeing me
a unique key/Id for the reminder row always.
Thanks
Aditya Narayan
perhaps not aware of ?
On Thu, Feb 3, 2011 at 5:43 PM, Sylvain Lebresne wrote:
> On Thu, Feb 3, 2011 at 11:27 AM, Aditya Narayan wrote:
>>
>> Hey all,
>>
>> I want to store some columns that are reminders to the users on my
>> application, in time sorted order
perhaps not aware of ?
On Thu, Feb 3, 2011 at 5:43 PM, Sylvain Lebresne wrote:
> On Thu, Feb 3, 2011 at 11:27 AM, Aditya Narayan wrote:
>>
>> Hey all,
>>
>> I want to store some columns that are reminders to the users on my
>> application, in time sorted order
Is there any way to sort the columns named as integers in the descending order ?
Regards
-Aditya
I am also looking to possible solutions to store pdfs & word documents.
But why wont you store in them in the filesystem instead of a database
unless your files are too small in which case it would be recommended
to use a database.
-Aditya
On Fri, Feb 4, 2011 at 5:30 PM, Daniel Doubl
yes, definitely a database for mapping ofcourse!
On Fri, Feb 4, 2011 at 11:17 PM, buddhasystem wrote:
>
> Even when storage is in NFS, Cassandra can still be quite useful as a file
> catalog. Your physical storage can change, move etc. Therefore, it's a good
> idea to provide mapping of logical n
omparator type (ReversedIntegerType) to sort those longs in
descending order.
Regards
Aditya
On Sat, Feb 5, 2011 at 6:24 AM, aaron morton wrote:
> IMHO If you know the time of the event use store the time as a long, rather
> than a UUID. It will make it easier to get back to a
> time and
Does huge variation in no. of columns in rows, over the column family
has *any* impact on the performance ?
Can I have like just 100 columns in some rows and like hundred
thousands of columns in another set of rows, without any downsides ?
Thanks for the detailed explanation Peter! Definitely cleared my doubts !
On Mon, Feb 7, 2011 at 1:52 PM, Peter Schuller
wrote:
>> Does huge variation in no. of columns in rows, over the column family
>> has *any* impact on the performance ?
>>
>> Can I have like just 100 columns in some rows a
How can I get or calculate the size of rows/ columns ? what are the
any overheads on memory for each column/row ?
the asked
column names exist in indexes for that row/key & they dont need to
deserialize the blocks in SST tables searching for column values. Am I
thinking right way??
-Aditya
On Fri, Feb 11, 2011 at 1:54 AM, Aaron Morton wrote:
> If you want to get the byte size of a particular row you w
What if the caching requirements, sorting needs of two kind of data
are very much similar, is it preferable to go with a single CF in
those cases ?
Regards
Aditya
>>> On Sat, Feb 5, 2011 at 10:43 AM, Tyler Hobbs wrote:
>>>>>>
>>>>>> I read somew
Any comments/view points on this?
--On Sat, Feb 12, 2011 at 5:05 PM, Aditya Narayan wrote:
What if the caching requirements, sorting needs of two kind of data
are very much similar, is it preferable to go with a single CF in
those cases ?
Regards
Aditya
>
> >>> On Sat, Fe
ld use the column names
list for this SliceQuery operation).
Thanks
Aditya
On Sun, Feb 13, 2011 at 8:41 PM, Jonathan Ellis wrote:
> On Sun, Feb 13, 2011 at 12:37 AM, E S wrote:
> > I've gotten myself really confused by
> > http://wiki.apache.org/cassandra/ArchitectureInt
to be sequential on disk). So if the columns you ask for are
> really randomly distributed, then yes, the biggest the row is, the biggest
> the chance is to have to hit many blocks and the biggest the chance is for
> these block to be far apart on disk.
>
> --
> Sylvain
>
> On
Thanks for the clarifications..
On Mon, Feb 14, 2011 at 6:13 PM, Sylvain Lebresne wrote:
> On Mon, Feb 14, 2011 at 11:27 AM, Aditya Narayan wrote:
>
>> Thanks Sylvain,
>>
>> I guess I might have misunderstood the meaning of column_index_size_in_kb,
>> My previou
Does it make any difference if I split a row, that needs to be
accessed together, into two or three rows and then read those multiple
rows ??
(Assume the keys of all the three rows are known to me programatically
since I split columns by certain categories).
Would the performance be any better if a
so a
> single row read gets what you need.
>
> Aaron
>
> On 24/02/2011, at 5:59 AM, Aditya Narayan wrote:
>
>> Does it make any difference if I split a row, that needs to be
>> accessed together, into two or three rows and then read those multiple
>> rows
What would be a good strategy to store large text content/(blog posts
of around 1500-3000 characters) in cassandra? I need to store these
blog posts along with their metadata like bloggerId, blogTags. I am
looking forward to store this data in a single row giving each
attribute a single column. So
t; try one and be prepared to change.
>
> Note that counters are only in the 0.8 trunk and are still under development,
> they are not going to be released for a couple of months.
>
> Your per column data size is nothing to be concerned abut.
>
> Hope that helps.
> Aaron
>
My application displays list of several blogs' overview data (like
blogTitle/ nameOfBlogger/ shortDescrption for each blog) on 1st page (in
very much similar manner like Digg's newsfeed) and when the user selects a
particular blog to see., the application takes him to that specific blog's
full pag
CF2 as well (use a batch_mutation
> through whatever client you have). So when serving the second page you only
> need to read one row from CF2.
>
>
> Aaron
>
> On 8/03/2011, at 8:13 PM, Norman Maurer wrote:
>
> Yeah this make sense as far as I can tell.
>
>
> Bye,
, since Cassandra will have to read so many
versions of the same column. If this is just replacement with old
column then I guess read will be much better since it needs to see
just single existing version of column.
Thanks
Aditya Narayan
d the reconciliation of that happens
> during read (read repair). This is why reads are slower than writes because
> conflict resolution happens during read.
>
> Hope this answers the question!
>
> Thanks,
> -Naren
>
> On Tue, Mar 8, 2011 at 10:44 PM, Aditya Narayan wrote:
&
Also, your Mapper class needs to look like this:
MyMapper extends Mapper,Text,SumWritable> ... with all the necessary fixes to the map method.
AD
-Original Message-
From: Jonathan Ellis [mailto:jbel...@gmail.com]
Sent: Wednesday, November 10, 2010 8:40 AM
To: user
Subject: Re: WordCount
My bad. Moved to Cassandra user list.
-Original Message-
From: Aditya Muralidharan [mailto:aditya.muralidha...@nisc.coop]
Sent: Wednesday, November 10, 2010 10:48 AM
To: u...@pig.apache.org
Subject: RE: MapReduce/Hadoop in cassandra 0.7 beta3
Hi,
I'm still getting the error assoc
Is it possible for BinaryMemtable RowMutations to a ColumnFamily with
supercolumns?
From: Aditya Muralidharan [mailto:aditya.muralidha...@nisc.coop]
Sent: Thursday, November 11, 2010 3:27 PM
To: user@cassandra.apache.org
Subject: Cassandra 0.7 beta3 BinaryMemtable and Supercolumns
Is it possible for BinaryMemtable RowMutations to a ColumnFamily with
supercolumns?
lumns
Before you dig into that, are you sure normal Thrift inserts are not
fast enough?
On Thu, Nov 11, 2010 at 4:41 PM, Aditya Muralidharan
wrote:
> Pretty sure I could ask that better:
>
>
>
> Is it possible for me to perform RowMutations on BinaryMemtable for a
> ColumnFamily of
I've run into this as well. Having confirmed that there are no handles on the
file (it's only ever created and used by Cassandra), and having stepped through
the code, I've concluded that something in the io (not sure if it's the jvm or
the os) stack is lazy about releasing the file handle for '
89 matches
Mail list logo