>>>>> "Eric" == Eric Czech <e...@nextbigsound.com> writes:

    Eric> Yea that's not a mapping I'd like to maintain either -- as an
    Eric> experiment, I copied production sstables to the analysis
    Eric> cluster and ran brisk/cassandra without specifying an initial
    Eric> token (after deleting the LocationInfo* files and renaming the
    Eric> cluster). 

Based on my understanding this will allow everything to start up, yes.

    Eric>  As far as I can tell, everything is running normally but I'm
    Eric> not sure how the cluster chose tokens for the nodes given that
    Eric> I didn't specify them after just dropping the raw sstables
    Eric> in.  I can still read data as usual from the column families
    Eric> that were copied but I'm not sure how not specifying the
    Eric> tokens affects everything.

Did you check the ring to see what tokens you got for the analysis
cluster? I would be surprised if you got the same ring configuration as
production.

    Eric>  Is some of my data just unreachable now because the tokens
    Eric> weren't manually defined?

I suspect your data is messed up. But the best way to determine it would
be to examine the ring (use nodetool) - if it is the same as your
production cluster you are good to go.

Also, did you set your (non seed) nodes in the analysis cluster to auto
bootstrap or not? That impacts what happens.


    Eric>  This doesn't appear to be the case but is this something you
    Eric> have tried too or do you understand the storage / topology
    Eric> logic well enough to know that this isn't a viable strategy?

No and No. I have been reading the code. Line 497 of
org.apache.cassandra.service.StorageService.java on trunk is a good
place to start since what happens depends somewhat on your specific
cassandra.yaml settings (specifically auto bootstrap).

I would be betting you are getting random tokens (look for "Generated
random token..." in your log). Don't trust me, read the code. I have all
of two weeks of experience with this stuff (and it's not quite my day
job to be doing it either :-)

Bottom line: I think you need to fix the seeds for your use case.

Cheers!
Shyamal

Reply via email to