MoreLikeThis query parser can not fetch document with id using implicit router

2021-09-20 Thread r ohara
We are using the SOLR 8.9 in cloud mode and having issues getting results
with the MLT query parser. We get this error  "Error completing MLT
request. Could not fetch document with id "

We're using an implicit router field for sharding, so it looks like this
same issue I found from 2017 here:
http://mail-archives.apache.org/mod_mbox/lucene-solr-user/201704.mbox/%3c1492518046754-4330485.p...@n3.nabble.com%3E
 "We've bumped into this issue too, but it was through the MoreLikeThis
query parser. Internally it uses the get handler to obtain the seed
document. One of our SOLR collections uses a shard router that is not the
document id. The get handler will fail if the value of the document id is
not the same as the routing key. And then this causes the CloudMLTQParser
to issue the message "Error completing MLT request. Could not fetch
document with id [/x/]"

Can we not use the MLT query parser if we're using implicit router
sharding? Any suggestions would be appreciated.

Thanks in advance


Re: Delete by Id in solr cloud

2022-06-29 Thread r ohara
Hi Satya,
I think it's a bug with using compositeId. We had the same issue, and had
to use deleteByQuery instead, but like you said, it's much slower. We're
using solr 8.11

On Tue, Jun 28, 2022 at 4:59 AM Satya Nand 
wrote:

> Thanks, Peter,
> I am checking that, also UpdateRequest class seems to have methods that
> take routes as input. I will see if it helps.
>
> On Tue, Jun 28, 2022 at 3:19 PM Peter Lancaster <
> peter.lancas...@findmypast.com> wrote:
>
> > Hi Satya,
> >
> > I think you would need to use a HttpSolrClient that uses the url of the
> > shard where the record exists.
> >
> > Regards,
> > Peter.
> >
> > -Original Message-
> > From: Satya Nand 
> > Sent: 28 June 2022 10:43
> > To: users@solr.apache.org
> > Subject: Re: Delete by Id in solr cloud
> >
> > EXTERNAL SENDER: Do not click any links or open any attachments unless
> you
> > trust the sender and know the content is safe.
> >
> >
> > Hi Radu,
> >
> > I am using solrj for executing the query. I couldn't find any function
> > with accepts additional parameters like routing, shards, solr Params etc.
> >
> > I also tried delete by query instead of deleteById, But it is very slow.
> >
> >
> >
> https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsolr.apache.org%2Fdocs%2F8_1_0%2Fsolr-solrj%2Forg%2Fapache%2Fsolr%2Fclient%2Fsolrj%2Fimpl%2FCloudSolrClient.html&data=05%7C01%7Cpeter.lancaster%40findmypast.com%7C52e71d1ca9294234c62808da58eaa4a0%7C75e41e0807c2445db397039b2b54c244%7C0%7C0%7C637920062049080011%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=4d7N5LCpx8TXnEv7GW%2BN2TmoE8YvHa0tgr4c%2FamgOBw%3D&reserved=0
> > deleteById
> > <
> >
> https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fsolr.apache.org%2Fdocs%2F7_3_1%2Fsolr-solrj%2Forg%2Fapache%2Fsolr%2Fclient%2Fsolrj%2FSolrClient.html%23deleteById-java.lang.String-java.util.List-int-&data=05%7C01%7Cpeter.lancaster%40findmypast.com%7C52e71d1ca9294234c62808da58eaa4a0%7C75e41e0807c2445db397039b2b54c244%7C0%7C0%7C637920062049080011%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=MdBKsoMlbTqUjx5xzUny1Hrop0La2cwkg6cVZgZ76Es%3D&reserved=0
> > >
> > (String
> > <
> >
> https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdocs.oracle.com%2Fjavase%2F8%2Fdocs%2Fapi%2Fjava%2Flang%2FString.html%3Fis-external%3Dtrue&data=05%7C01%7Cpeter.lancaster%40findmypast.com%7C52e71d1ca9294234c62808da58eaa4a0%7C75e41e0807c2445db397039b2b54c244%7C0%7C0%7C637920062049080011%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=3gPAaYNOQvAkYD8coSuGjm28gau5i3lEJabT4Kqu%2BCk%3D&reserved=0
> > >
> >  collection, List
> > <
> >
> https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdocs.oracle.com%2Fjavase%2F8%2Fdocs%2Fapi%2Fjava%2Futil%2FList.html%3Fis-external%3Dtrue&data=05%7C01%7Cpeter.lancaster%40findmypast.com%7C52e71d1ca9294234c62808da58eaa4a0%7C75e41e0807c2445db397039b2b54c244%7C0%7C0%7C637920062049080011%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=bUQ0Fe0pPkP2kFeRy%2BLg%2FuTIBSEM1HVQdk4EEAdQYCQ%3D&reserved=0
> > >
> >  > <
> >
> https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fdocs.oracle.com%2Fjavase%2F8%2Fdocs%2Fapi%2Fjava%2Flang%2FString.html%3Fis-external%3Dtrue&data=05%7C01%7Cpeter.lancaster%40findmypast.com%7C52e71d1ca9294234c62808da58eaa4a0%7C75e41e0807c2445db397039b2b54c244%7C0%7C0%7C637920062049080011%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=3gPAaYNOQvAkYD8coSuGjm28gau5i3lEJabT4Kqu%2BCk%3D&reserved=0
> > >>
> > ids,
> > int commitWithinMs)
> >
> >
> >
> >
> > On Tue, Jun 28, 2022 at 12:58 PM Radu Gheorghe <
> radu.gheor...@sematext.com
> > >
> > wrote:
> >
> > > Hi Satya,
> > >
> > > I didn't try it, but does it work if you add "shards=shard1,shard2..."
> > > to the request?
> > >
> > > Worst case scenario, if you have the address of each shard (you can
> > > get it from Zookeeper), you can run the delete command N times, one
> > > hitting each shard address.
> > >
> > > Best regards,
> > > Radu
> > > --
> > > Elasticsearch/OpenSearch & Solr Consulting, Production Support &
> > > Training Sematext Cloud - Full Stack Observability
> > > https://gbr01.safelinks.protection.outlook.com/?url=http%3A%2F%2Fsemat
> > > ext.com%2F&data=05%7C01%7Cpeter.lancaster%40findmypast.com%7C52e71
> > > d1ca9294234c62808da58eaa4a0%7C75e41e0807c2445db397039b2b54c244%7C0%7C0
> > > %7C637920062049080011%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQ
> > > IjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C3000%7C%7C%7C&sdata=5
> > > T28n9NppzIpUi9MaWeF1ZYcQuh%2FreGX2iVvsDczleI%3D&reserved=0
> > >
> > >
> > > On Tue, Jun 28, 2022 at 7:55 AM Satya Nand  > > .invalid>
> > > wrote:
> > >
> > > > Hi,
> > > >
> > > > I have an 8 shards 

replicate blendedInfixSuggesterIndexDir

2022-07-15 Thread r ohara
Hi all,
Is there a way to replicate the suggester (AnalyzingInfixSuggester) index
or does that have to be built on every node? Our index is quite large and
it takes a few days to build.

We're using Solr 8.11 in cloud mode.

Thanks in advance.


Re: Suggester index replication

2023-03-20 Thread r ohara
Would it work if we just copied over the directory? In my case the
blendedInfixSuggesterIndexDir?

Thanks

On Thu, Mar 2, 2023 at 7:17 PM Walter Underwood 
wrote:

> When we were using old style replication, I did have the suggester lexicon
> replicated along with other config files, and I think I triggered a
> suggester build
> on replication or maybe commit (which happens with every replication).
> I remember it being kind of fussy to set up. You might want to set up an
> extra
> downstream machine to play with until you get it right.
>
> wunder
> Walter Underwood
> wun...@wunderwood.org
> http://observer.wunderwood.org/  (my blog)
>
> > On Mar 2, 2023, at 10:42 AM, gnandre  wrote:
> >
> > Thanks! I am using non-cloud mode at the moment. So, there is no way to
> > just index it to the index node and get it replicated to the search
> nodes?
> > Do I have to index to each search node?
> >
> > Do you know why the suggester indexing does not follow the usual search
> > indexing model?
> >
> > On Thu, Mar 2, 2023, 12:22 PM Walter Underwood 
> > wrote:
> >
> >> You need to send a build request to each node. I used to have some code
> to
> >> dig out the nodes from a cluster status, then send a build to each one,
> but
> >> I think that is marooned at my previous company. It isn’t super hard,
> just
> >> dig it out of the JSON.
> >>
> >> wunder
> >> Walter Underwood
> >> wun...@wunderwood.org
> >> http://observer.wunderwood.org/  (my blog)
> >>
> >>> On Mar 2, 2023, at 9:03 AM, gnandre  wrote:
> >>>
> >>> Can anybody please answer this? Many thanks in advance!
> >>>
> >>> On Wed, Feb 16, 2022 at 12:52 AM gnandre 
> >> wrote:
> >>>
>  Is there a way to get suggester index replicated to all search nodes
> >> from
>  index node? Do I need to build suggester index for each search node
>  separately?
> 
> >>
> >>
>
>


solr backups failing

2023-03-22 Thread r ohara
Hi all,
We are using solrcloud 8.11 with 10 shards. We've been getting errors
trying to back up our index using the collections API. Here's the error we
see in some of the nodes when we run the REQUESTSTATUS api.

Failed to backup core=collection_02_replica_t21 because
java.io.IOException: Input/output error

We ran it successfully once, but since then at least one node, but not
always the same one fails. It is a large index with almost 100 million
documents and 2T total in size. What could cause this error?

Thanks in advance


Re: solr backups failing

2023-04-05 Thread r ohara
The logs did not show anything that would explain the error. I believe
the issue was because I had added a shard since the last backup. Once I
deleted the backup, it worked as expected and subsequent backups worked as
well.

Thanks for the comments.

On Wed, Mar 29, 2023 at 7:10 AM Vivaldi  wrote:

> Are you using HDFS to backup or shared network directory? In the case of
> shared directory the same location should be writable (accessible and has
> write permissions) from all solr nodes.
>
>
> > On 28 Mar 2023, at 19:22, Houston Putman 
> wrote:
> >
> > Can you provide more information, such as the request you send to solr
> and
> > logs that might explain the IOException?
> >
> > - Houston
> >
> >> On Wed, Mar 22, 2023 at 10:51 AM r ohara  wrote:
> >>
> >> Hi all,
> >> We are using solrcloud 8.11 with 10 shards. We've been getting errors
> >> trying to back up our index using the collections API. Here's the error
> we
> >> see in some of the nodes when we run the REQUESTSTATUS api.
> >>
> >> Failed to backup core=collection_02_replica_t21 because
> >> java.io.IOException: Input/output error
> >>
> >> We ran it successfully once, but since then at least one node, but not
> >> always the same one fails. It is a large index with almost 100 million
> >> documents and 2T total in size. What could cause this error?
> >>
> >> Thanks in advance
> >>
>
>


Replication and generation number

2023-06-14 Thread r ohara
Hi Solr folks,
We are using solr 8.11 in solrcloud mode with one TLOG and two PULL replica
setup. We're having issues not getting the latest updates on our PULL
replicas. We have autocommit set to 10 minutes and we're seeing that every
once in a while the generation id gets updated, but not the version number
and therefore the replication does not get kicked off until another update
comes in, sometimes several hours later, which changes the version number
and subsequently gets kicked off during the next polling.

Does anyone know what the generation id is for and when/how that gets
updated? And why the version number might not get updated?

Thanks in advance!


BlendedInfixSuggester replication

2023-07-24 Thread r ohara
Hello all,

We are using Solr 8.11.2 in solrcloud mode and using the
BlendedInfixSuggester for autocomplete for our site. We have a very large
index and it takes almost 2 days to finish building so during this time,
autosuggest isn't available. It's a TLOG/PULL replica setup, so we tried to
build on the TLOG, and copy over the blendedInfixSuggesterIndexDir to the
PULL replicas but we just get empty results back. I found this ticket (
https://issues.apache.org/jira/browse/SOLR-866)
 which implies that
replication is not supported. Is there a good way to deal with this? We
have continuous updates so we would like to build at least once a week.

Thanks in advance


Prevent traffic to solr node

2023-08-03 Thread r ohara
Hello!
Is there a way to mark a solr node (pull) as 'sick' to prevent traffic
going to it? We're using solrcloud 8.9. I'm wondering if there's a way
using the cluster node management api?

Thanks in advance