Saw your more recent email as well.

We’re currently working on attempting patch up the issues with 
_yz_events:handle_info and what that crash was causing. Are you seeing "Core 
with name ‘<index_name>' already exists” in your solr logs? If so, that’s means 
it’s a similar issue, and, again, we’re working on testing patches internally.


Thanks.

Zeeshan Lakhani
programmer | 
software engineer at @basho | 
org. member/founder of @papers_we_love | paperswelove.org
twitter => @zeeshanlakhani

> On Mar 7, 2015, at 11:02 AM, Jason Ryan <jason.r...@trustev.com> wrote:
> 
> We have seen crash _yz_events yes, none today for example, but there was 
> quite a bit yesterday.
> 
> It's a 5 node cluster - 14GB of RAM in each, solr jvm is set at 8GB on each.
> 
> I've not seen any corrupted data, but we could be looking in the wrong place? 
> Our buckets are set to allow_mult false and last_write_wins true, so we don't 
> expect any siblings.
> 
> We had a health done by yourselves also, the main change, which we have not 
> yet applied is switching AAE from active to passive and setting erlang buffer 
> to 128MB
> 
> 
>       
> Jason Ryan
> VP Engineering
> 
> Trustev
> Real Time, Online Identity Verification
>  
> email: jason.r...@trustev.com <mailto:jason.r...@trustev.com> 
> skype: jason_j_ryan
> web: www.trustev.com <http://www.trustev.com/>
> 
> Trustev Ltd, 2100 Cork Airport Business Park, Cork, Ireland. 
> 
> On 7 March 2015 at 15:54, Zeeshan Lakhani <zlakh...@basho.com 
> <mailto:zlakh...@basho.com>> wrote:
> Hello Jason,
> 
> I initially was thinking that you’re issues were similar to what we’ve found 
> here, https://github.com/basho/yokozuna/issues/442#issuecomment-77233636 
> <https://github.com/basho/yokozuna/issues/442#issuecomment-77233636>, but 
> seeing {error, retry_later} and the 500s seem to place the problems more on 
> the Solr end. Just to make sure, are you seeing `yz_events` crashes in your 
> logs?
> 
> Can you tell me how much RAM each node has and have you adjusted/increased 
> the search.sol.jmv_options max heap size at all (more info on issues/factors 
> w/ Solr performance here -> 
> http://docs.basho.com/riak/latest/ops/advanced/configs/search/#Solr-for-Operators
>  
> <http://docs.basho.com/riak/latest/ops/advanced/configs/search/#Solr-for-Operators>)?
>  
> 
> What kind of issues are you getting in your solr logs? Anything related to 
> bad/corrupted data (which will index into _yz_err fields when dealing with 
> failed extractions) or possible sibling explosion (duplicates of the same 
> object with different _yz_vtag fields)? 
> 
> Thanks.
> 
> 
> Zeeshan Lakhani
> programmer | 
> software engineer at @basho | 
> org. member/founder of @papers_we_love | paperswelove.org 
> <http://paperswelove.org/>
> twitter => @zeeshanlakhani
> 
>> On Mar 7, 2015, at 5:43 AM, Jason Ryan <jason.r...@trustev.com 
>> <mailto:jason.r...@trustev.com>> wrote:
>> 
>> Hi all,
>> 
>> We're having real trouble with Riak Search.
>> We are seeing an awful lot of errors, which leads to alot of logging for 
>> solr and the disk IO reaching 95% + which is causing lots of issues.
>> 
>> - We consistently see errors around Riak trying to create indexes which 
>> already exist - only a restart of each Riak node stops this for a period of 
>> time - indexes are only ever created manually, not by software.
>> 
>> - We see lots of errors around failing to index objects - details of what 
>> appears in the log are:
>> 
>> 2015-03-07 10:30:26.871 [error] <0.2538.0>@yz_kv:index:215 failed to index 
>> object 
>> {{<<"Production">>,<<"Grains.Domain.Case">>},<<"455203890918dfc6fd3c7da49dd6adb00300000043a46a51">>}
>>  with error {"Failed to index docs",{error,retry_later}} because 
>> [{yz_solr,index,3,[{file,"src/yz_solr.erl"},{line,192}]},{yz_kv,index,7,[{file,"src/yz_kv.erl"},{line,267}]},{yz_kv,index,3,[{file,"src/yz_kv.erl"},{line,202}]},{riak_kv_vnode,actual_put,6,[{file,"src/riak_kv_vnode.erl"},{line,1418}]},{riak_kv_vnode,perform_put,3,[{file,"src/riak_kv_vnode.erl"},{line,1406}]},{riak_kv_vnode,do_put,7,[{file,"src/riak_kv_vnode.erl"},{line,1201}]},{riak_kv_vnode,handle_command,3,[{file,"src/riak_kv_vnode.erl"},{line,486}]},{riak_core_vnode,vnode_command,3,[{file,"src/riak_core_vnode.erl"},{line,345}]}]
>> 
>> 
>> - We are also starting to see 500's being returned for search queries - the 
>> response looks like this:
>> 
>> <html><head><title>500 Internal Server 
>> Error</title></head><body><h1>Internal Server Error</h1>The server 
>> encountered an error while processing this request:<br><pre>{error,
>> 
>>     {throw,
>> 
>>         {"Failed to search",
>> 
>>          "http://localhost:8093/internal_solr/sessions/select 
>> <http://localhost:8093/internal_solr/sessions/select>",
>> 
>>          {error,retry_later}},
>> 
>>         [{yz_solr,search,3,[{file,"src/yz_solr.erl"},{line,278}]},
>> 
>>          {yz_wm_search,search,2,[{file,"src/yz_wm_search.erl"},{line,129}]},
>> 
>>          {webmachine_resource,resource_call,3,
>> 
>>              [{file,"src/webmachine_resource.erl"},{line,186}]},
>> 
>>          {webmachine_resource,do,3,
>> 
>>              [{file,"src/webmachine_resource.erl"},{line,142}]},
>> 
>>          {webmachine_decision_core,resource_call,1,
>> 
>>              [{file,"src/webmachine_decision_core.erl"},{line,48}]},
>> 
>>          {webmachine_decision_core,decision,1,
>> 
>>              [{file,"src/webmachine_decision_core.erl"},{line,558}]},
>> 
>>          {webmachine_decision_core,handle_request,2,
>> 
>>              [{file,"src/webmachine_decision_core.erl"},{line,33}]},
>> 
>>          {webmachine_mochiweb,loop,2,
>> 
>> * Connection #0 to host 10.0.5.6 left intact
>> 
>>              
>> [{file,"src/webmachine_mochiweb.erl"},{line,74}]}]}}</pre><P><HR><ADDRESS>mochiweb+webmachine
>>  web server</ADDRESS></body></html>
>> 
>> 
>> 
>> Anyone that could point us in the right direction of where to look and 
>> debug, as this is becoming a huge issue for us.
>> 
>> 
>> 
>> Thanks,
>> 
>> Jason
>> 
>> 
>> 
>> 
>> 
>> 
>> This message is for the named person's use only. If you received this 
>> message in error, please immediately delete it and all copies and notify the 
>> sender. You must not, directly or indirectly, use, disclose, distribute, 
>> print, or copy any part of this message if you are not the intended 
>> recipient. Any views expressed in this message are those of the individual 
>> sender and not Trustev Ltd. Trustev is registered in Ireland No. 516425 and 
>> trades from 2100 Cork Airport Business Park, Cork, Ireland.
>> 
>> _______________________________________________
>> riak-users mailing list
>> riak-users@lists.basho.com <mailto:riak-users@lists.basho.com>
>> http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com 
>> <http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com>
> 
> 
> 
> This message is for the named person's use only. If you received this message 
> in error, please immediately delete it and all copies and notify the sender. 
> You must not, directly or indirectly, use, disclose, distribute, print, or 
> copy any part of this message if you are not the intended recipient. Any 
> views expressed in this message are those of the individual sender and not 
> Trustev Ltd. Trustev is registered in Ireland No. 516425 and trades from 2100 
> Cork Airport Business Park, Cork, Ireland.
> 

_______________________________________________
riak-users mailing list
riak-users@lists.basho.com
http://lists.basho.com/mailman/listinfo/riak-users_lists.basho.com

Reply via email to