Re: Corrupt SSTable

2020-08-13 Thread Nitan Kainth
If you are not deleting or updating data then it should be safe to use 2nd approach. Regards, Nitan Cell: 510 449 9629 > On Aug 13, 2020, at 11:48 AM, Pushpendra Rajpoot > wrote: > >  > Hi, > > I have a cluster of 2 DC, each DC has 5 nodes in production. This cluster is > based on active-

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-25 Thread manish khandelwal
Thanks all for your support. I executed the discussed process (barring repair, as table was read for reporting only) and it worked fine in production. Regards Manish >

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-14 Thread Jeff Jirsa
The risk is you violate consistency while you run repair Assume you have three replicas for that range, a b c At some point b misses a write, but it’s committed on a and c for quorum Now c has a corrupt sstable You empty c and bring it back with no data and start repair Then the app reads at q

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread manish khandelwal
Thanks Jeff for your response. Do you see any risk in following approach 1. Stop the node. 2. Remove all sstable files from */var/lib/cassandra/data/keyspace/tablename-23dfadf32adf33d33s333s33s3s33 * directory. 3. Start the node. 4. Run full repair on this particular table I wanted to go thi

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread Jeff Jirsa
Agree this is both strictly possible and more common with LCS. The only thing that's strictly correct to do is treat every corrupt sstable exception as a failed host, and replace it just like you would a failed host. On Thu, Feb 13, 2020 at 10:55 PM manish khandelwal < manishkhandelwa...@gmail.co

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread manish khandelwal
Thanks Erick I would like to explain how data resurrection can take place with single SSTable deletion. Consider this case of table with Levelled Compaction Strategy 1. Data A written a long time back. 2. Data A is deleted and tombstone is created. 3. After GC grace tombstone is purgeable. 4. No

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread Erick Ramirez
The log shows that the the problem occurs when decompressing the SSTable but there's not much actionable info from it. I would like to know what will be "ordinary hammer" in this case. Do you > want to suggest that deleting only corrupt sstable file ( in this case > mc-1234-big-*.db) would be su

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread manish khandelwal
Hi Erick Thanks for your quick response. I have attached the full stacktrace which show exception during validation phase of table repair. I would like to know what will be "ordinary hammer" in this case. Do you want to suggest that deleting only corrupt sstable file ( in this case *mc-1234-big

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread Erick Ramirez
It will achieve the outcome you are after but I doubt anyone would recommend that approach. It's like using a sledgehammer when an ordinary hammer would suffice. And if you were hitting some bug then you'd run into the same problem anyway. Can you post the full stack trace? It might provide us som

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread manish khandelwal
Hi Eric Thanks for reply. Reason for corruption is unknown to me. I just found the corrupt table when scheduled repair failed with logs showing *ERROR [ValidationExecutor:16] 2020-01-21 19:13:18,123 CassandraDaemon.java:228 - Exception in thread Thread[ValidationExecutor:16,1,main]org.apach

Re: Corrupt SSTable Cassandra 3.11.2

2020-02-13 Thread Erick Ramirez
You need to stop C* in order to run the offline sstable scrub utility. That's why it's referred to as "offline". :) Do you have any idea on what caused the corruption? It's highly unusual that you're thinking of removing all the files for just one table. Typically if the corruption was a result of

Re: Corrupt SSTABLE over and over

2016-08-17 Thread Kai Wang
This might not be good news to you. But my experience is that C* 2.X/Windows is not ready for production yet. I've seen various file system related errors. And in one of the JIRAs I was told major work (or rework) is done in 3.X to improve C* stability on Windows. On Tue, Aug 16, 2016 at 3:44 AM,

Re: Corrupt SSTABLE over and over

2016-08-15 Thread Bryan Cheng
Hi Alaa, Sounds like you have problems that go beyond Cassandra- likely filesystem corruption or bad disks. I don't know enough about Windows to give you any specific advice but I'd try a run of chkdsk to start. --Bryan On Fri, Aug 12, 2016 at 5:19 PM, Alaa Zubaidi (PDF) wrote: > Hi Bryan, > >

Re: Corrupt SSTABLE over and over

2016-08-12 Thread Alaa Zubaidi (PDF)
Hi Bryan, Changing disk_failure_policy to best_effort, and running nodetool scrub, did not work, it generated another error: java.nio.file.AccessDeniedException Also tried to remove all files (data, commitlog, savedcaches) and restart the node fresh, and still I am getting corruption. and Still

Re: Corrupt SSTABLE over and over

2016-08-12 Thread Bryan Cheng
Should also add that if the scope of corruption is _very_ large, and you have a good, aggressive repair policy (read: you are confident in the consistency of the data elsewhere in the cluster), you may just want to decommission and rebuild that node. On Fri, Aug 12, 2016 at 11:55 AM, Bryan Cheng

Re: Corrupt SSTABLE over and over

2016-08-12 Thread Bryan Cheng
Looks like you're doing the offline scrub- have you tried online? Here's my typical process for corrupt SSTables. With disk_failure_policy set to stop, examine the failing sstables. If they are very small (in the range of kbs), it is unlikely that there is any salvageable data there. Just delete

Re: Corrupt SSTABLE over and over

2016-08-12 Thread Alaa Zubaidi (PDF)
Hi Jason, Thanks for your input... Thats what I am afraid of? Did you find any HW error in the VMware and HW logs? any indication that the HW is the reason? I need to make sure that this is the reason before asking the customer to spend more money? Thanks, Alaa On Thu, Aug 11, 2016 at 11:02 PM,

Re: Corrupt SSTABLE over and over

2016-08-12 Thread Alaa Zubaidi (PDF)
One more thing I noticed.. The corrupted SSTable is mentioned twice in the log file [CompactionExecutor:10253] 2016-08-11 08:59:01,952 - Compacting (.) [...la-1104-big-Data.db, ] [CompactionExecutor:10253] 2016-08-11 09:32:04,814 - Compacting (.) [...la-1104-big-Data.db] Is

Re: Corrupt SSTABLE over and over

2016-08-12 Thread Jason Wee
cassandra run on virtual server (vmware)? > I tried sstablescrub but it crashed with hs-err-pid-... maybe try with larger heap allocated to sstablescrub this sstable corrupt i ran into it as well (on cassandra 1.2), first i try nodetool scrub, still persist, then offline sstablescrub still persis

Re: corrupt sstable

2013-05-24 Thread Robert Coli
On Fri, May 24, 2013 at 7:07 AM, Hiller, Dean wrote: > We have a corrupt sstable databus5-nreldata-ib-36763-Data.db. How do we > safely blow this away? (and then we would run repair to make sure all data > is still there)… > > Can we just move the file out from under cassandra? (or would cass

Re: corrupt sstable

2013-05-24 Thread Edward Capriolo
Delete the data, and associated index/filter/etc files. (all the files with the same number). If you are reading at one, bring the node up and disable thrift then run repair. If you are reading at quorum or higher, not a big need to stop thrift. Just run repair. On Fri, May 24, 2013 at 10:07 AM,