Hello,
I'm making some tests on a data model with 3 CF and 1 SCF, I want to start
by inserting 1 million rows (my target is to have 1billion rows) .
I have three nodes cluster (I'm using the same machines with 3GB of RAM
each , intel core2 duo 1,6GHZ), RF = 2, CL = 1, HEAPSIZE of the seed = 3
Hi, dear Guys
I did some test to see if cassandra can store muti-version of the same
data, but from the below test code seems it only can store one
version's data, which is different from HBase.
Can somebody help to confirm this?
It will be very appreciative if some one are kindly enough to give
> I did some test to see if cassandra can store muti-version of the same
> data, but from the below test code seems it only can store one
> version's data, which is different from HBase.
> Can somebody help to confirm this?
Correct. Unlike BigTable and HBase, Cassandra columns don't have a
version
1. So posterior to 0.6.6/0.7 , minor compaction and major compaction both
can clean out rows 'tagged' tombstones , this kind of clean out doesn't
mead remove it from the disk permanently.
The real remove is done by the jvm GC ?
2. The intence of compaction is merging multi sstables into one ,
try
bin/cassandra-cli --host
On Wed, Dec 1, 2010 at 7:29 PM, Joshua Partogi wrote:
> Hi there,
>
> I just downloaded cassandra 0.7rc1. I started it using bin/cassandra
> without making any configuration changes.
>
> I then tried to connect using the CLI with command like this:
>
> f...@ubuntu:~
I'm confused , plz ingore the mail above.
Here is my confusion ,
posterior to 0.6.6/0.7 , minor compaction and major compaction both can
clean out rows 'tagged' tombstones , and generate a new , without
tombstones , sstable .
And the tombstones remains in memory ,waiting to be removed by
All of the times I have had similar issues the problem has always been
misconfigured iptables. You said it was running fine on 0.6.8 though?
On the same box or a different box?
On Dec 1, 2010, at 6:29 AM, Joshua Partogi wrote:
> Hi there,
>
> I just downloaded cassandra 0.7rc1. I started it usi
I have a fairly simple problem that might require a complicated solution.
I need to store Integer -> UUID in a column family, and be able to query
(and then paginate) the rows ordered by the integer in descending order.
This is simple enough if no two rows have the same integer, as the
integer
The part about gc refers to old sstable files on disk. After a compaction,
the old files on disk will be deleted when garbage collection happens.
On Wed, Dec 1, 2010 at 7:31 AM, Ying Tang wrote:
> I'm confused , plz ingore the mail above.
> Here is my confusion ,
>posterior to 0.6.6/0.7 , m
2010/12/1 Ying Tang
> I'm confused , plz ingore the mail above.
> Here is my confusion ,
>posterior to 0.6.6/0.7 , minor compaction and major compaction both
> can clean out rows 'tagged' tombstones , and generate a new , without
> tombstones , sstable .
>
This is right.
> And the t
The key statement:
if (gcw.getDuration() > MIN_DURATION_TPSTATS)
{
logStats();
}
On Wed, Dec 1, 2010 at 2:44 AM, Ying Tang wrote:
> The GCInspector's start() method ,
> In this method ,
> StorageService.scheduledTasks.scheduleWithFixedDelay(t,
Unless I misunderstand the Q, composing the column names with the row
keys and merging the resulting would yield something useful.
keyA => (1, uuid), (2, uuid), (3, uid)
keyB => (1, uuid), (2, uuid), (3, uid)
Should be transformed into:
(1, keyA, uuid),
(1, keyB, uuid),
(2, keyA, uuid),
(2,
> FYI, 0.7.0~rc1 debs are available in a new PPA for experimental
> releases:
>
> http://launchpad.net/~cassandra-ubuntu/+archive/experimental
>
It seems there is a dependancy on libjets3t-java
Is it really needed ? This dependancy cannot be resolved on Ubuntu Lucid :-(
There is https://github.com/posulliv/libcassandra, but I think it's
0.6 only atm.
On Wed, Dec 1, 2010 at 12:13 AM, Narendra Sharma
wrote:
> Are there any C++ clients out there similar to Hector (in terms of features)
> for Cassandra? I am looking for C++ Client for Cassandra 0.7.
>
> Thanks,
> Na
And I've contacted Padraig and he has no intention of upgrading to 0.7. I'm
working heavily in C++ and Cassandra, though, so hopefully I can contribute in
some way eventually. I may be able to help a little bit with C++ and Cassandra
if you're totally stuck, but I'm basically just using thrift,
If you go this route, be sure to take a look at the custom column comparator
I wrote to make this sort of thing easier:
https://github.com/edanuff/CassandraCompositeType
On Wed, Dec 1, 2010 at 4:56 AM, Daniel Lundin wrote:
> You could also use a standard column family, composing the version
> i
Hi,
Can someone please let me know how to shutdown Cassandra on Windows Environment?
stop-server is actually kiing the Cassandra Server and I was unable to
create/get a pid file for the Cassandra process. Are there any setup steps that
are needed that I am missing?
Thanks,
-Ram.
On 12/1/10 4:56 AM, Daniel Lundin wrote:
Correct. Unlike BigTable and HBase, Cassandra columns don't have a
version dimension.
Timestamp is used for (crude) conflict resolution, and older versions
are always overwritten.
I would be careful with the word "overwritten" here as it obfuscated the
im
I am just starting to play with Cassandra. My environment is Ubuntu Lucid
10.04 and latest Cassandra stable 0.6.8 binaries.
I am unclear on how to shut down the Cassandra server. Documentation is
quite clear on starting up the server. I didn't find anything useful on the
user mailing list archiv
On Wed, Dec 1, 2010 at 18:39, Melton Low wrote:
> I am just starting to play with Cassandra. My environment is Ubuntu Lucid
> 10.04 and latest Cassandra stable 0.6.8 binaries.
> I am unclear on how to shut down the Cassandra server. Documentation is
> quite clear on starting up the server. I did
Are there any that compile on Windows without the need for linking in
cygwin?
C
On Tue, Nov 30, 2010 at 10:16 PM, sharanabasava raddi
wrote:
> Thrift is there..
>
>
> On Wed, Dec 1, 2010 at 11:43 AM, Narendra Sharma <
> narendra.sha...@gmail.com> wrote:
>
>> Are there any C++
You can look at this patch. It has a patched version for thrift revision
818530. You will need to apply the patch to the thrift version which your
cassandra release is using. It is thrift revision 917130 for the later
releases of the 0.6 branch not sure about 0.7.
https://issues.apache.org/jira/br
Hello,
I need to store "products" data (product.name, product.price, product.state
and product.owner) in Cassandra 0.7 rc1.
The problem is that I need to get "products" where product.price > XX AND
product.price < XX AND product.name = XXX AND product.state = XXX. Also I
need return the products
There is no "shutdown" command in cassandra, it's designed to be stopped by killing it. The pid file is created by the *nix scripts and is not supported by the cassandra.bat file for windows. There have been a couple of discussions on running cassandra as a service under windows...http://www.mail-a
Is it possible to perform paginated queries using Random Partitioner in 0.7
with Super Column Families whose Super Columns are UUID's? I don't believe
it is, based on this article:
http://ria101.wordpress.com/2010/02/22/cassandra-randompartitioner-vs-orderpreservingpartitioner,
and my attempts wit
Running nodes with different JVM heap sizes would not be recommended practice, for many reasons. Nor would I recommend running them with all the memory the machine has, it will just lead to the OS swapping the JVM out to disk and considerable slow things down.I would suggest a heap size of 1.5 or 2
Hi Brayton.
Thanks for the reply. It was running find on 0.6.8 on the same box.
Kind regards,
Joshua
On Thu, Dec 2, 2010 at 1:05 AM, Brayton Thompson wrote:
> All of the times I have had similar issues the problem has always been
> misconfigured iptables. You said it was running fine on 0.6.8 t
Could you use a Super CF?Super Col name is the Integer, and the Col Names are the UUID. Not sure what your col values are or your key. There are some limitations to Super CF but I do not think they would apply in this case http://wiki.apache.org/cassandra/CassandraLimitationsYou can the slice the s
Take a look at your cassandra.yaml file at the rpc_address this is the address it's listening to connections on. The comments there should help, if you set it to 0.0.0.0 it will bind to all interfaces. Probably not what you want in production but handy for dev.Hope that helps. AaronOn 02 Dec, 2010
It is set to localhost I didn't change it and it is the same as configured
in 0.6.8. Why doesn't it work out of the box?
Thanks heaps.
On Thu, Dec 2, 2010 at 7:49 AM, Aaron Morton wrote:
> Take a look at your cassandra.yaml file at the rpc_address this is the
> address it's listening to connecti
The Partitioner applies to the row keys, not the columns. Their order is determined by the compare_with and compare_subcolumns_with CF settings So where you say "get the last 25 inserts for a key" I'm translating that into "get the most recent 25 super columns for a row, where the super column name
Hey Aaron,
Yes, in regards to SCF definition, you are correct:
name: Sensor
column_type: Super
compare_with: TimeUUIDType
gc_grace_seconds: 864000
keys_cached: 1.0
read_repair_chance: 1.0
rows_cached: 0.0
I'm not quite sure I follow you, though, as
unsubscribe
unsubscribe
Try the solr source code. AaronOn 30 Nov, 2010,at 01:37 PM, Mark wrote:
The DataSource subclass route is what I will probably be interested
in. Are there are working examples of this already out there?
On 11/29/10 12:32 PM, Aaron Morton wrote:
AFAIK ther
Hello,
I'm trying to insert a super column but I can't get passed this error.
the error:
InvalidRequestException(why:column name must not be empty)
at
org.apache.cassandra.thrift.Cassandra$insert_result.read(Cassandra.java:14408)
at
org.apache.cassandra.thrift.Cassandra$Client
When you say "I want to get rows starting from a Super Column..." it's a bit confusing. Do you want to get super columns from a single row, or multiple rows? I'm assuming you are talking about getting columns from a single row / key as that's what your code does.For the pelops code, it looks OK but
Hi Aaron,
Thanks, that helped and it works.
--- On Wed, 12/1/10, Aaron Morton wrote:
From: Aaron Morton
Subject: Re: How to shutdown Cassandra?
To: user@cassandra.apache.org
Date: Wednesday, December 1, 2010, 12:18 PM
There is no "shutdown" command in cassandra, it's designed to be stoppe
Actually, it was a class issue at this line:
System.*out*.println("NAME: " + UUID.*nameUUIDFromBytes*(col.getName()));
The native Pelops class timeUuidHelper is what should be used.
On Wed, Dec 1, 2010 at 4:16 PM, Aaron Morton wrote:
> When you say "I want to get rows starting from a Super Colu
Is there a particular reason why you're not using a high level client?
http://wiki.apache.org/cassandra/ClientOptions
Raw thrift is painful in many ways.
- Tyler
On Wed, Dec 1, 2010 at 5:06 PM, Michael Fortin wrote:
> Hello,
>
> I'm trying to insert a super column but I can't get passed this
Hi,
We have a small cluster of 3 Cassandra servers running w/ full
replication. Every once in a while we get an OutOfMemory exception and
have to restart servers. Sometimes just restarting doesn’t do it and
we have to clean the commitlog or data directory.
We are running Cassandra 0.6.8. There is
Using the methods on the Bytes class would be preferable. The byte[]
related methods on UuidHelper should have been deprecated with the Bytes
class was introduced...
e.g. new Bytes(col.getName()).toUuid()
Cheers,
Dan
On Thu, Dec 2, 2010 at 10:26 AM, Frank LoVecchio wrote:
> Actually, it was a
Try turning up the logging on the server side to DEBUG and see what it says. Chances are you are not sending what you think you are. Or if you feel like it put a breakpoint in o.a.c.thrift.Cassandra$Client.send_insert to see how when the client is doing. I agree with Tyler, higher level clients are
Do you have a log message for the OOM? And some GC messages around it? Have you tried watching the server with jconsole?Is the OOM happening on system start or after it's been running ? Or both?Do you have any row/key caches? Cannot remember but is 0.6* has this but have you enabled the save cache
Hi Aaron,
OOM is happening both after the system has been running for a while as
well as when I restart it afterwards. The only way to make it run
after it has crashed, is to remove everything from data and commitlog
directories. Unfortunately I don't have the original log from when
cassandra cras
Regarding caches, I haven't explicitly enabled them and the
"saved_caches" directory is empty.
-Aram
On Wed, Dec 1, 2010 at 5:05 PM, Aram Ayazyan wrote:
> Hi Aaron,
>
> OOM is happening both after the system has been running for a while as
> well as when I restart it afterwards. The only way to
Stack trace looks like an OS-level thread limit causing problems, not
actually memory.
On Wed, Dec 1, 2010 at 7:05 PM, Aram Ayazyan wrote:
> Hi Aaron,
>
> OOM is happening both after the system has been running for a while as
> well as when I restart it afterwards. The only way to make it run
> a
Hi,
this is an hypothetical situation highly likely to happen:
I have a cassandra 0.7 cluster, filled with production data, and I want to
upgrade cassandra to the 0.8 version (and so on as new versions get
released)
what happen with my data and keyspace / column families definitions? whats
the pr
@Chen Xinli
"and mark old sstables as deleted which will be deleted while jvm gc."
SSTable is on the harddisk , how could jvm gc delete it ? JVM GC is in
charge the using of the space in the memory.
@Nick
The GC in cassandra doesn't refer to jvm gc ? This kind of gc is cassandda's
gc , intend to r
i don't know how to set the command line
is there a word_count.bat ? like the word_count in the bin on linux?
2010-12-02
Bingbing Liu
You are right, jvm gc is for memory.
In cassandra, there is a small trick called *PhantomReference*, which will
be called when jvm gc. And deletion is actually done in PhantomReference.
2010/12/2 Ying Tang
> @Chen Xinli
> "and mark old sstables as deleted which will be deleted while jvm gc."
> S
There isn't currently, but perhaps you could contribute one :). If you take a
look at the sh script in the bin directory of the word count example, it
shouldn't be terribly difficult to mimic the behavior. It's mostly just
setting up the classpath and executing the Java class with some argumen
Thanks a lot Jonathan! That seems to be it, since the exact same
configuration w/ the same data starts up and works fine on a different
server.
-Aram
On Wed, Dec 1, 2010 at 5:24 PM, Jonathan Ellis wrote:
> Stack trace looks like an OS-level thread limit causing problems, not
> actually memory.
>
The GCInspector's start() method ,
In this method ,
StorageService.scheduledTasks.scheduleWithFixedDelay(t,
INTERVAL_IN_MS, INTERVAL_IN_MS, TimeUnit.MILLISECONDS);
t is Runnable t and it's run method is logIntervalGCStats.
According to this code segment , the logIntervalGCStats should be run
every
2010/12/1 Ying Tang
> Every time cassandra creates a new sstable , it will call the
> CompactionManager.submitMinorIfNeeded ? And if the number of memtables is
> beyond MinimumCompactionThreshold , the minor compaction will be called.
> And there is also a method named CompactionManager.submit
And i have another question , what's the difference between minor
compaction and major compaction?
On 12/1/10, Chen Xinli wrote:
> 2010/12/1 Ying Tang
>
>> Every time cassandra creates a new sstable , it will call the
>> CompactionManager.submitMinorIfNeeded ? And if the number of memtables is
Hi there,
I just downloaded cassandra 0.7rc1. I started it using bin/cassandra without
making any configuration changes.
I then tried to connect using the CLI with command like this:
f...@ubuntu:~/Applications/apache-cassandra-0.7.0-rc1$ bin/cassandra-cli
Welcome to cassandra CLI.
Type 'help' o
On Wed, Dec 1, 2010 at 12:11 PM, Ying Tang wrote:
> And i have another question , what's the difference between minor
> compaction and major compaction?
A major compaction is a compaction that compact *all* the SSTables of a given
column family (compaction compacts one CF at a time).
Before http
58 matches
Mail list logo