GC Exceptions and cluster nodes are dying

2010-12-01 Thread asakka
Hello, I'm making some tests on a data model with 3 CF and 1 SCF, I want to start by inserting 1 million rows (my target is to have 1billion rows) . I have three nodes cluster (I'm using the same machines with 3GB of RAM each , intel core2 duo 1,6GHZ), RF = 2, CL = 1, HEAPSIZE of the seed = 3

Is there any way to store muti-version data based on the timestamp?

2010-12-01 Thread zhen ye
Hi, dear Guys I did some test to see if cassandra can store muti-version of the same data, but from the below test code seems it only can store one version's data, which is different from HBase. Can somebody help to confirm this? It will be very appreciative if some one are kindly enough to give

Re: Is there any way to store muti-version data based on the timestamp?

2010-12-01 Thread Daniel Lundin
> I did some test to see if cassandra can store muti-version of the same > data, but from the below test code seems it only can store one > version's data, which is different from HBase. > Can somebody help to confirm this? Correct. Unlike BigTable and HBase, Cassandra columns don't have a version

Re: When to call the major compaction ?

2010-12-01 Thread Ying Tang
1. So posterior to 0.6.6/0.7 , minor compaction and major compaction both can clean out rows 'tagged' tombstones , this kind of clean out doesn't mead remove it from the disk permanently. The real remove is done by the jvm GC ? 2. The intence of compaction is merging multi sstables into one ,

Re: Can not connect to cassandra 0.7 using CLI

2010-12-01 Thread Ying Tang
try bin/cassandra-cli --host On Wed, Dec 1, 2010 at 7:29 PM, Joshua Partogi wrote: > Hi there, > > I just downloaded cassandra 0.7rc1. I started it using bin/cassandra > without making any configuration changes. > > I then tried to connect using the CLI with command like this: > > f...@ubuntu:~

Re: When to call the major compaction ?

2010-12-01 Thread Ying Tang
I'm confused , plz ingore the mail above. Here is my confusion , posterior to 0.6.6/0.7 , minor compaction and major compaction both can clean out rows 'tagged' tombstones , and generate a new , without tombstones , sstable . And the tombstones remains in memory ,waiting to be removed by

Re: Can not connect to cassandra 0.7 using CLI

2010-12-01 Thread Brayton Thompson
All of the times I have had similar issues the problem has always been misconfigured iptables. You said it was running fine on 0.6.8 though? On the same box or a different box? On Dec 1, 2010, at 6:29 AM, Joshua Partogi wrote: > Hi there, > > I just downloaded cassandra 0.7rc1. I started it usi

Sorted Integer -> UUID

2010-12-01 Thread Benjamin Waldher
I have a fairly simple problem that might require a complicated solution. I need to store Integer -> UUID in a column family, and be able to query (and then paginate) the rows ordered by the integer in descending order. This is simple enough if no two rows have the same integer, as the integer

Re: When to call the major compaction ?

2010-12-01 Thread Nick Bailey
The part about gc refers to old sstable files on disk. After a compaction, the old files on disk will be deleted when garbage collection happens. On Wed, Dec 1, 2010 at 7:31 AM, Ying Tang wrote: > I'm confused , plz ingore the mail above. > Here is my confusion , >posterior to 0.6.6/0.7 , m

Re: When to call the major compaction ?

2010-12-01 Thread Chen Xinli
2010/12/1 Ying Tang > I'm confused , plz ingore the mail above. > Here is my confusion , >posterior to 0.6.6/0.7 , minor compaction and major compaction both > can clean out rows 'tagged' tombstones , and generate a new , without > tombstones , sstable . > This is right. > And the t

Re: TheGC inspector's frequency

2010-12-01 Thread Jonathan Ellis
The key statement: if (gcw.getDuration() > MIN_DURATION_TPSTATS) { logStats(); } On Wed, Dec 1, 2010 at 2:44 AM, Ying Tang wrote: > The GCInspector's start() method , > In this method , > StorageService.scheduledTasks.scheduleWithFixedDelay(t,

Re: Sorted Integer -> UUID

2010-12-01 Thread Daniel Lundin
Unless I misunderstand the Q, composing the column names with the row keys and merging the resulting would yield something useful. keyA => (1, uuid), (2, uuid), (3, uid) keyB => (1, uuid), (2, uuid), (3, uid) Should be transformed into: (1, keyA, uuid), (1, keyB, uuid), (2, keyA, uuid), (2,

Re: [RELEASE] 0.7.0 rc1

2010-12-01 Thread Olivier Rosello
> FYI, 0.7.0~rc1 debs are available in a new PPA for experimental > releases: > > http://launchpad.net/~cassandra-ubuntu/+archive/experimental > It seems there is a dependancy on libjets3t-java Is it really needed ? This dependancy cannot be resolved on Ubuntu Lucid :-(

Re: C++ client for Cassandra

2010-12-01 Thread Jonathan Ellis
There is https://github.com/posulliv/libcassandra, but I think it's 0.6 only atm. On Wed, Dec 1, 2010 at 12:13 AM, Narendra Sharma wrote: > Are there any C++ clients out there similar to Hector (in terms of features) > for Cassandra? I am looking for C++ Client for Cassandra 0.7. > > Thanks, > Na

Re: C++ client for Cassandra

2010-12-01 Thread David Replogle
And I've contacted Padraig and he has no intention of upgrading to 0.7. I'm working heavily in C++ and Cassandra, though, so hopefully I can contribute in some way eventually. I may be able to help a little bit with C++ and Cassandra if you're totally stuck, but I'm basically just using thrift,

Re: Is there any way to store muti-version data based on the timestamp?

2010-12-01 Thread Ed Anuff
If you go this route, be sure to take a look at the custom column comparator I wrote to make this sort of thing easier: https://github.com/edanuff/CassandraCompositeType On Wed, Dec 1, 2010 at 4:56 AM, Daniel Lundin wrote: > You could also use a standard column family, composing the version > i

How to shutdown Cassandra?

2010-12-01 Thread rambabu pakala
Hi,   Can someone please let me know how to shutdown Cassandra on Windows Environment?   stop-server is actually kiing the Cassandra Server and I was unable to create/get a pid file for the Cassandra process. Are there any setup steps that are needed that I am missing?   Thanks, -Ram.

Re: Is there any way to store muti-version data based on the timestamp?

2010-12-01 Thread Robert Coli
On 12/1/10 4:56 AM, Daniel Lundin wrote: Correct. Unlike BigTable and HBase, Cassandra columns don't have a version dimension. Timestamp is used for (crude) conflict resolution, and older versions are always overwritten. I would be careful with the word "overwritten" here as it obfuscated the im

How to shutdown Cassandra on Ubuntu?

2010-12-01 Thread Melton Low
I am just starting to play with Cassandra. My environment is Ubuntu Lucid 10.04 and latest Cassandra stable 0.6.8 binaries. I am unclear on how to shut down the Cassandra server. Documentation is quite clear on starting up the server. I didn't find anything useful on the user mailing list archiv

Re: How to shutdown Cassandra on Ubuntu?

2010-12-01 Thread Rafał Krupiński
On Wed, Dec 1, 2010 at 18:39, Melton Low wrote: > I am just starting to play with Cassandra.  My environment is Ubuntu Lucid > 10.04 and latest Cassandra stable 0.6.8 binaries. > I am unclear on how to shut down the Cassandra server. Documentation is > quite clear on starting up the server.  I did

Re: C++ client for Cassandra

2010-12-01 Thread Chris Trimble
Are there any that compile on Windows without the need for linking in cygwin? C On Tue, Nov 30, 2010 at 10:16 PM, sharanabasava raddi wrote: > Thrift is there.. > > > On Wed, Dec 1, 2010 at 11:43 AM, Narendra Sharma < > narendra.sha...@gmail.com> wrote: > >> Are there any C++

Re: C++ client for Cassandra

2010-12-01 Thread Adi
You can look at this patch. It has a patched version for thrift revision 818530. You will need to apply the patch to the thrift version which your cassandra release is using. It is thrift revision 917130 for the later releases of the 0.6 branch not sure about 0.7. https://issues.apache.org/jira/br

Data Model Question

2010-12-01 Thread Pablo D. Salgado
Hello, I need to store "products" data (product.name, product.price, product.state and product.owner) in Cassandra 0.7 rc1. The problem is that I need to get "products" where product.price > XX AND product.price < XX AND product.name = XXX AND product.state = XXX. Also I need return the products

Re: How to shutdown Cassandra?

2010-12-01 Thread Aaron Morton
There is no "shutdown" command in cassandra, it's designed to be stopped by killing it. The pid file is created by the *nix scripts and is not supported by the cassandra.bat file for windows. There have been a couple of discussions on running cassandra as a service under windows...http://www.mail-a

Range Queries in RP on SCF in 0.7 with UUID SCs

2010-12-01 Thread Frank LoVecchio
Is it possible to perform paginated queries using Random Partitioner in 0.7 with Super Column Families whose Super Columns are UUID's? I don't believe it is, based on this article: http://ria101.wordpress.com/2010/02/22/cassandra-randompartitioner-vs-orderpreservingpartitioner, and my attempts wit

Re: GC Exceptions and cluster nodes are dying

2010-12-01 Thread Aaron Morton
Running nodes with different JVM heap sizes would not be recommended practice, for many reasons. Nor would I recommend running them with all the memory the machine has, it will just lead to the OS swapping the JVM out to disk and considerable slow things down.I would suggest a heap size of 1.5 or 2

Re: Can not connect to cassandra 0.7 using CLI

2010-12-01 Thread Joshua Partogi
Hi Brayton. Thanks for the reply. It was running find on 0.6.8 on the same box. Kind regards, Joshua On Thu, Dec 2, 2010 at 1:05 AM, Brayton Thompson wrote: > All of the times I have had similar issues the problem has always been > misconfigured iptables. You said it was running fine on 0.6.8 t

Re: Sorted Integer -> UUID

2010-12-01 Thread Aaron Morton
Could you use a Super CF?Super Col name is the Integer, and the Col Names are the UUID. Not sure what your col values are or your key. There are some limitations to Super CF but I do not think they would apply in this case http://wiki.apache.org/cassandra/CassandraLimitationsYou can the slice the s

Re: Can not connect to cassandra 0.7 using CLI

2010-12-01 Thread Aaron Morton
Take a look at your cassandra.yaml file at the rpc_address this is the address it's listening to connections on. The comments there should help, if you set it to 0.0.0.0 it will bind to all interfaces.  Probably not what you want in production but handy for dev.Hope that helps. AaronOn 02 Dec, 2010

Re: Can not connect to cassandra 0.7 using CLI

2010-12-01 Thread Joshua Partogi
It is set to localhost I didn't change it and it is the same as configured in 0.6.8. Why doesn't it work out of the box? Thanks heaps. On Thu, Dec 2, 2010 at 7:49 AM, Aaron Morton wrote: > Take a look at your cassandra.yaml file at the rpc_address this is the > address it's listening to connecti

Re: Range Queries in RP on SCF in 0.7 with UUID SCs

2010-12-01 Thread Aaron Morton
The Partitioner applies to the row keys, not the columns. Their order is determined by the compare_with and compare_subcolumns_with CF settings So where you say "get the last 25 inserts for a key" I'm translating that into "get the most recent 25 super columns for a row, where the super column name

Re: Range Queries in RP on SCF in 0.7 with UUID SCs

2010-12-01 Thread Frank LoVecchio
Hey Aaron, Yes, in regards to SCF definition, you are correct: name: Sensor column_type: Super compare_with: TimeUUIDType gc_grace_seconds: 864000 keys_cached: 1.0 read_repair_chance: 1.0 rows_cached: 0.0 I'm not quite sure I follow you, though, as

[no subject]

2010-12-01 Thread Moldován Eduárd
unsubscribe

unsubscribe

2010-12-01 Thread Moldován Eduárd
unsubscribe

Re: Solr DataImportHandler (DIH) and Cassandra

2010-12-01 Thread Aaron Morton
Try the solr source code. AaronOn 30 Nov, 2010,at 01:37 PM, Mark wrote: The DataSource subclass route is what I will probably be interested in. Are there are working examples of this already out there? On 11/29/10 12:32 PM, Aaron Morton wrote: AFAIK ther

thrift error

2010-12-01 Thread Michael Fortin
Hello, I'm trying to insert a super column but I can't get passed this error. the error: InvalidRequestException(why:column name must not be empty) at org.apache.cassandra.thrift.Cassandra$insert_result.read(Cassandra.java:14408) at org.apache.cassandra.thrift.Cassandra$Client

Re: Range Queries in RP on SCF in 0.7 with UUID SCs

2010-12-01 Thread Aaron Morton
When you say "I want to get rows starting from a Super Column..." it's a bit confusing. Do you want to get super columns from a single row, or multiple rows? I'm assuming you are talking about getting columns from a single row / key as that's what your code does.For the pelops code, it looks OK but

Re: How to shutdown Cassandra?

2010-12-01 Thread rambabu pakala
Hi Aaron,   Thanks, that helped and it works. --- On Wed, 12/1/10, Aaron Morton wrote: From: Aaron Morton Subject: Re: How to shutdown Cassandra? To: user@cassandra.apache.org Date: Wednesday, December 1, 2010, 12:18 PM There is no "shutdown" command in cassandra, it's designed to be stoppe

Re: Range Queries in RP on SCF in 0.7 with UUID SCs

2010-12-01 Thread Frank LoVecchio
Actually, it was a class issue at this line: System.*out*.println("NAME: " + UUID.*nameUUIDFromBytes*(col.getName())); The native Pelops class timeUuidHelper is what should be used. On Wed, Dec 1, 2010 at 4:16 PM, Aaron Morton wrote: > When you say "I want to get rows starting from a Super Colu

Re: thrift error

2010-12-01 Thread Tyler Hobbs
Is there a particular reason why you're not using a high level client? http://wiki.apache.org/cassandra/ClientOptions Raw thrift is painful in many ways. - Tyler On Wed, Dec 1, 2010 at 5:06 PM, Michael Fortin wrote: > Hello, > > I'm trying to insert a super column but I can't get passed this

OutOfMemory exceptions w/ Cassandra 0.6.8

2010-12-01 Thread Aram Ayazyan
Hi, We have a small cluster of 3 Cassandra servers running w/ full replication. Every once in a while we get an OutOfMemory exception and have to restart servers. Sometimes just restarting doesn’t do it and we have to clean the commitlog or data directory. We are running Cassandra 0.6.8. There is

Re: Range Queries in RP on SCF in 0.7 with UUID SCs

2010-12-01 Thread Dan Washusen
Using the methods on the Bytes class would be preferable. The byte[] related methods on UuidHelper should have been deprecated with the Bytes class was introduced... e.g. new Bytes(col.getName()).toUuid() Cheers, Dan On Thu, Dec 2, 2010 at 10:26 AM, Frank LoVecchio wrote: > Actually, it was a

Re: thrift error

2010-12-01 Thread Aaron Morton
Try turning up the logging on the server side to DEBUG and see what it says. Chances are you are not sending what you think you are. Or if you feel like it put a breakpoint in o.a.c.thrift.Cassandra$Client.send_insert to see how when the client is doing. I agree with Tyler, higher level clients are

Re: OutOfMemory exceptions w/ Cassandra 0.6.8

2010-12-01 Thread Aaron Morton
Do you have a log message for the OOM? And some GC messages around it? Have you tried watching the server with jconsole?Is the OOM happening on system start or after it's been running ? Or both?Do you have any row/key caches? Cannot remember but is 0.6* has this but have you enabled the save cache

Re: OutOfMemory exceptions w/ Cassandra 0.6.8

2010-12-01 Thread Aram Ayazyan
Hi Aaron, OOM is happening both after the system has been running for a while as well as when I restart it afterwards. The only way to make it run after it has crashed, is to remove everything from data and commitlog directories. Unfortunately I don't have the original log from when cassandra cras

Re: OutOfMemory exceptions w/ Cassandra 0.6.8

2010-12-01 Thread Aram Ayazyan
Regarding caches, I haven't explicitly enabled them and the "saved_caches" directory is empty. -Aram On Wed, Dec 1, 2010 at 5:05 PM, Aram Ayazyan wrote: > Hi Aaron, > > OOM is happening both after the system has been running for a while as > well as when I restart it afterwards. The only way to

Re: OutOfMemory exceptions w/ Cassandra 0.6.8

2010-12-01 Thread Jonathan Ellis
Stack trace looks like an OS-level thread limit causing problems, not actually memory. On Wed, Dec 1, 2010 at 7:05 PM, Aram Ayazyan wrote: > Hi Aaron, > > OOM is happening both after the system has been running for a while as > well as when I restart it afterwards. The only way to make it run > a

cassandra version update and my cluster

2010-12-01 Thread Nick Santini
Hi, this is an hypothetical situation highly likely to happen: I have a cassandra 0.7 cluster, filled with production data, and I want to upgrade cassandra to the 0.8 version (and so on as new versions get released) what happen with my data and keyspace / column families definitions? whats the pr

Re: When to call the major compaction ?

2010-12-01 Thread Ying Tang
@Chen Xinli "and mark old sstables as deleted which will be deleted while jvm gc." SSTable is on the harddisk , how could jvm gc delete it ? JVM GC is in charge the using of the space in the memory. @Nick The GC in cassandra doesn't refer to jvm gc ? This kind of gc is cassandda's gc , intend to r

how can i ran the word count example on windows?

2010-12-01 Thread Bingbing Liu
i don't know how to set the command line is there a word_count.bat ? like the word_count in the bin on linux? 2010-12-02 Bingbing Liu

Re: When to call the major compaction ?

2010-12-01 Thread Chen Xinli
You are right, jvm gc is for memory. In cassandra, there is a small trick called *PhantomReference*, which will be called when jvm gc. And deletion is actually done in PhantomReference. 2010/12/2 Ying Tang > @Chen Xinli > "and mark old sstables as deleted which will be deleted while jvm gc." > S

Re: how can i ran the word count example on windows?

2010-12-01 Thread Jeremy Hanna
There isn't currently, but perhaps you could contribute one :). If you take a look at the sh script in the bin directory of the word count example, it shouldn't be terribly difficult to mimic the behavior. It's mostly just setting up the classpath and executing the Java class with some argumen

Re: OutOfMemory exceptions w/ Cassandra 0.6.8

2010-12-01 Thread Aram Ayazyan
Thanks a lot Jonathan! That seems to be it, since the exact same configuration w/ the same data starts up and works fine on a different server. -Aram On Wed, Dec 1, 2010 at 5:24 PM, Jonathan Ellis wrote: > Stack trace looks like an OS-level thread limit causing problems, not > actually memory. >

TheGC inspector's frequency

2010-12-01 Thread Ying Tang
The GCInspector's start() method , In this method , StorageService.scheduledTasks.scheduleWithFixedDelay(t, INTERVAL_IN_MS, INTERVAL_IN_MS, TimeUnit.MILLISECONDS); t is Runnable t and it's run method is logIntervalGCStats. According to this code segment , the logIntervalGCStats should be run every

Re: When to call the major compaction ?

2010-12-01 Thread Chen Xinli
2010/12/1 Ying Tang > Every time cassandra creates a new sstable , it will call the > CompactionManager.submitMinorIfNeeded ? And if the number of memtables is > beyond MinimumCompactionThreshold , the minor compaction will be called. > And there is also a method named CompactionManager.submit

Re: When to call the major compaction ?

2010-12-01 Thread Ying Tang
And i have another question , what's the difference between minor compaction and major compaction? On 12/1/10, Chen Xinli wrote: > 2010/12/1 Ying Tang > >> Every time cassandra creates a new sstable , it will call the >> CompactionManager.submitMinorIfNeeded ? And if the number of memtables is

Can not connect to cassandra 0.7 using CLI

2010-12-01 Thread Joshua Partogi
Hi there, I just downloaded cassandra 0.7rc1. I started it using bin/cassandra without making any configuration changes. I then tried to connect using the CLI with command like this: f...@ubuntu:~/Applications/apache-cassandra-0.7.0-rc1$ bin/cassandra-cli Welcome to cassandra CLI. Type 'help' o

Re: When to call the major compaction ?

2010-12-01 Thread Sylvain Lebresne
On Wed, Dec 1, 2010 at 12:11 PM, Ying Tang wrote: > And i have another question , what's the difference between minor > compaction and major compaction? A major compaction is a compaction that compact *all* the SSTables of a given column family (compaction compacts one CF at a time). Before http