Re: Thoughts on adding complex queries to Cassandra

2010-05-27 Thread Vick Khera
On Thu, May 27, 2010 at 9:50 AM, Jonathan Ellis wrote: > There definitely seems to be demand for something like this.  Maybe for 0.8? > The Riak data store has something like this: you can submit queries (and map reduce jobs) written in javascript that run on the data nodes using data local to th

Re: get() with TTL update?

2010-05-25 Thread Vick Khera
On Mon, May 24, 2010 at 4:53 PM, Jonathan Ellis wrote: > (a) cassandra does not use update-in-place storage so doing the update > as part of the get call isn't much of an efficiency gain If you could issue an "update" type of command, any other data needed for the new copy of the object could be

Re: Hadoop over Cassandra

2010-05-17 Thread Vick Khera
On Mon, May 17, 2010 at 3:46 PM, Jonathan Ellis wrote: > Moving to the user@ list. > > http://wiki.apache.org/cassandra/HadoopSupport should be useful. That document doesn't really answer the "is data locality preserved" when running the map phase, but my hunch is "no". > > On Mon, May 17, 2010

Re: Cassandra training on May 21 in Palo Alto

2010-05-11 Thread Vick Khera
On Fri, May 7, 2010 at 6:56 AM, Matt Revelle wrote: > Reston, VA is a good spot in the DC metro area for tech events. +1

Re: performance tuning - where does the slowness come from?

2010-05-06 Thread Vick Khera
On Thu, May 6, 2010 at 2:05 PM, Weijun Li wrote: > Anyway, for mmap, in order for you to access the data in the buffer or > virtual address, OS has to read/page in the data to a block of physical > memory and assign your virtual address to that physical memory block. So if > you use random partiti

Re: performance tuning - where does the slowness come from?

2010-05-06 Thread Vick Khera
On Thu, May 6, 2010 at 1:06 PM, Weijun Li wrote: > In this case using mmap will cause Cassandra to use sometimes > 100G virtual > memory which is much more than the physical ram, since we are using random > partitioner the OS will be busy doing swap. mmap uses the virtual address space to referen

Re: How to initialize the Cassandra

2010-05-06 Thread Vick Khera
On Thu, May 6, 2010 at 10:39 AM, Dop Sun wrote: > Basically, my question is whether there is a tool, like json2sstable, which > can import some data in a format which can be easily manual created. I don't > know the experience of other people, but what I'm currently doing is writing > a small appl

Re: performance tuning - where does the slowness come from?

2010-05-06 Thread Vick Khera
On Wed, May 5, 2010 at 8:08 PM, Kyusik Chung wrote: > if the data from the sstables hasnt already been loaded into memory by mmap, > load it into memory; if you're out of memory on the box, swap some of the > old mmapped data out of memory mmap() does not copy your data into memory; it maps your

Re: Cassandra training on May 21 in Palo Alto

2010-05-04 Thread Vick Khera
On Tue, May 4, 2010 at 8:50 PM, Jonathan Ellis wrote: > Yes, although when and where are TBD. > Having it the day before/after Velocity conference at the end of June would be ideal (hint, hint). I'm sure a lot of people with interest in Cassandra will be in the area.

Re: performance tuning - where does the slowness come from?

2010-05-04 Thread Vick Khera
On Tue, May 4, 2010 at 2:57 PM, Ran Tavory wrote: > I'm also interested in a way to cap mmap so I can take advantage of it but > not swap the host to death... > Isn't the point of mmap() to just directly access a file as if it were memory? I can see how it would fool the reporting tools into thi

Re: Re: Modelling assets and user permissions

2010-04-20 Thread Vick Khera
On Tue, Apr 20, 2010 at 1:37 PM, tsuraan wrote: > The assets are binary files on a document tracking system.  Our > current platform is postgres-backed; the entire system we've written > is fairly easily distributed across multiple computers, but postgres > isn't.  There are reliable databases tha

Re: question about deleting from cassandra

2010-03-18 Thread Vick Khera
On Thu, Mar 18, 2010 at 9:15 AM, Bill Au wrote: > In theory there is a breaking point somewhere, right? I don't think google has hit it yet, so I'd have to say nobody has reached "the breaking point" yet What do the big places do when people quit the service? Ie, if I close my facebook or t

Re: Cassandra Demo/Tutorial Applications

2010-03-14 Thread Vick Khera
On Sat, Mar 13, 2010 at 1:46 AM, Jonathan Ellis wrote: > I'd probably go with "virtualbox images" as being simpler for people > who don't have an AWS key already.  (VB can read vmware player images, > i think.  But there is no free vmware for OS X, so you'd want to check > that before going w/ vmw