Re: Repair session failed

aaron morton Mon, 06 May 2013 01:31:37 -0700

Can your raise a ticket at https://issues.apache.org/jira/browse/CASSANDRA  and 
update the thread with the link?


Please include:
* nodetool status
* nodetool ring (so we have all the token assignments)
* The IP you started repair on 
* As much log as you can share, if you can run DEBUG for the 
org.apache.cassandra.service.AntiEntropyService it would be handy. 
* the command you used to start nodetool

A range selected for the repair is not fully contained by any of ranges the 
node replicates. 

Cheers

-----------------
Aaron Morton
Freelance Cassandra Consultant
New Zealand

@aaronmorton
http://www.thelastpickle.com

On 3/05/2013, at 9:02 PM, Christopher Wirt <chris.w...@struq.com> wrote:

> Hi Aaron,
>  
> We’re running 1.2.4, so with vNodes
>  
> We ran scrub but saw the issue again when repairing
>  
> nodetool status –
>  
> Datacenter: DC01
> =================
> Status=Up/Down
> |/ State=Normal/Leaving/Joining/Moving
> --  Address       Load       Tokens  Owns   Host ID                           
>     Rack
> UN  10.70.48.23   35.16 GB   256     13.1%  
> 4a7bc489-25af-4c20-80f8-499ffcb18e2d  RAC1
> UN  10.70.6.79    30.04 GB   256     12.6%  
> 98a1167f-cf75-4201-a454-695e0f7d2d72  RAC1
> UN  10.70.6.78    41.94 GB   256     11.9%  
> 62a418b5-3c38-4f66-874d-8138d6d565e5  RAC1
> UN  10.70.47.66   54.79 GB   256     13.8%  
> ab564d16-4081-4866-b8ba-26461d9a93d7  RAC1
> UN  10.70.6.91    46.96 GB   256     12.6%  
> 2e1e7179-82e6-4ae6-b986-383acc9fc8a2  RAC1
> UN  10.70.47.126  38.04 GB   256     11.8%  
> d4bed3b1-ffaf-4c68-b560-d270355c8c4b  RAC1
> Datacenter: DC02
> =================
> Status=Up/Down
> |/ State=Normal/Leaving/Joining/Moving
> --  Address       Load       Tokens  Owns   Host ID                           
>     Rack
> UN  10.56.0.144   31.71 GB   256     12.0%  
> 1860011e-fa7c-4ce1-ad6b-c8a38a5ddd02  RAC1
> UN  10.56.0.140   86.28 GB   256     12.3%  
> f3fa985d-5056-4ddc-b146-d02432c3a86e  RAC1
>  
>  
> Thanks,
>  
> Chris
>  
>  
> From: aaron morton [mailto:aa...@thelastpickle.com] 
> Sent: 02 May 2013 19:31
> To: user@cassandra.apache.org
> Subject: Re: Repair session failed
>  
> Hold off on running scrub (but yes it's an online operation). This is an 
> issue with the token ranges. 
>  
> What version are you using ? 
> Are you using vNodes ?
> Can you share the output of nodetool ring (if no vnodes) or nodetool status 
> (if using vnodes) ?
>  
> Cheers
>  
> -----------------
> Aaron Morton
> Freelance Cassandra Consultant
> New Zealand
>  
> @aaronmorton
> http://www.thelastpickle.com
>  
> On 2/05/2013, at 3:08 AM, Haithem Jarraya <haithem.jarr...@struq.com> wrote:
> 
> 
> Can I run scrub while the node is in the ring and receiving writes?
> Or I should disable thrift before?
>  
> 
> On 1 May 2013 15:52, <moshe.kr...@barclays.com> wrote:
> Sounds like a job for “nodetool scrub”, which rewrites the SStable rows in 
> the correct order. After the scrub, nodetool repair should succeed.
>  
> From: Haithem Jarraya [mailto:haithem.jarr...@struq.com] 
> Sent: Wednesday, May 01, 2013 5:46 PM
> To: user@cassandra.apache.org
> Subject: Repair session failed
>  
> Hi, 
>  
> I am seeing this error message during repair,
>  
>  INFO [AntiEntropyStage:1] 2013-05-01 14:30:54,300 AntiEntropyService.java 
> (line 764) [repair #ed104480-b26a-11e2-af9b-05179fa66b76] mycolumnfamily is 
> fully synced (1 remaining column family to sync for this session)
> ERROR [Thread-12725] 2013-05-01 14:30:54,304 StorageService.java (line 2420) 
> Repair session failed:
> java.lang.IllegalArgumentException: Requested range intersects a local range 
> but is not fully contained in one; this would lead to imprecise repair
>         at 
> org.apache.cassandra.service.AntiEntropyService.getNeighbors(AntiEntropyService.java:175)
>         at 
> org.apache.cassandra.service.AntiEntropyService$RepairSession.<init>(AntiEntropyService.java:621)
>         at 
> org.apache.cassandra.service.AntiEntropyService$RepairSession.<init>(AntiEntropyService.java:610)
>         at 
> org.apache.cassandra.service.AntiEntropyService.submitRepairSession(AntiEntropyService.java:127)
>         at 
> org.apache.cassandra.service.StorageService.forceTableRepair(StorageService.java:2480)
>         at 
> org.apache.cassandra.service.StorageService$4.runMayThrow(StorageService.java:2416)
>         at 
> org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
>         at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
>         at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
>         at java.util.concurrent.FutureTask.run(FutureTask.java:138)
>         at java.lang.Thread.run(Thread.java:662)
>  
>  
> what does it mean imprecise repair?
> Is it maybe because I went over the gcgrade period?
> What you do if you go over that period?
> Any hint will be valuable, 
> Also I noticed when I run a repair on different node, I see a message like 
> this
>  
> [2013-05-01 14:30:54,305] Starting repair command #5, repairing 1120 ranges 
> for keyspace struqrealtime
>  
> I have couple of questions, why I have repair command #5?
> And why the ranges values changes from one node to another?
>  
>  
> Many Thanks,
>  
> H
> _______________________________________________
> 
> This message is for information purposes only, it is not a recommendation, 
> advice, offer or solicitation to buy or sell a product or service nor an 
> official confirmation of any transaction. It is directed at persons who are 
> professionals and is not intended for retail customer use. Intended for 
> recipient only. This message is subject to the terms at: 
> www.barclays.com/emaildisclaimer.
> 
> For important disclosures, please see: 
> www.barclays.com/salesandtradingdisclaimer regarding market commentary from 
> Barclays Sales and/or Trading, who are active market participants; and in 
> respect of Barclays Research, including disclosures relating to specific 
> issuers, please see http://publicresearch.barclays.com.
> 
> _______________________________________________
> 
>

Re: Repair session failed

Reply via email to