Re: Deleting document on wrong shard?

2023-05-25 Thread Thomas Corthals
Hi Walter Deleting multiple IDs at once with JSON is mentioned here: https://solr.apache.org/guide/solr/latest/indexing-guide/indexing-with-update-handlers.html#sending-json-update-commands Or a list of document IDs: > > { "delete":["id1","id2"] } > > Thomas Op wo 24 mei 2023 om 22:23 schreef Wa

join query parser performance

2023-05-25 Thread Ron Haines
I've been using the 'join' query parser to 'filter out' related documents that should not be part of the result set. Functionally, it is working fine. However, when we throw a 'real' level of customer traffic at it, it pretty much brings Solr to its knees. CPU increases ALOT. Close to 3X, when

Re: join query parser performance

2023-05-25 Thread Joel Bernstein
If you are using a recent version of Solr try adding the parameter method=topLevelDV Let us know how this effects performance in your use case. What matters most here is the number of documents the from side of join matches. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, May 25, 202

Re: join query parser performance

2023-05-25 Thread Joel Bernstein
One thing to understand about the topLevelDv approach is you'll need to warm both sides of the join. You can do this by adding a static warming query that facets on 'group_id_mv' and 'group_member_id' in both collections. Joel Bernstein http://joelsolr.blogspot.com/ On Thu, May 25, 2023 at 9:

Re: join query parser performance

2023-05-25 Thread Deepak Goel
Ron, please post actual figures: 1. Cpu, mem, disk, network utilisation 2. Response times 3. Load 4. Hardware config of server 5. Software config of server On Thu, 25 May 2023, 18:51 Joel Bernstein, wrote: > One thing to understand about the topLevelDv approach is you'll need to > warm both s

Redindex Collection API stalled

2023-05-25 Thread Rajani Maski
Hi Solr Users, The reindex collection api stops importing after it imports around 20% of total docs from source collection, how to restart it? I could find "start", "abort" and "status" commands on the documentation page, but there is no command to restart from where it left. The status is runn

Re: join query parser performance

2023-05-25 Thread Andy Lester
> On May 25, 2023, at 7:51 AM, Ron Haines wrote: > > So, when this feature is enabled, this negative &fq gets added: > -{!join fromIndex=primary_rollup from=group_id_mv to=group_member_id > score=none}${q} Can we see collection definitions of both the source collection and the join? Also, a

Re: Redindex Collection API stalled

2023-05-25 Thread Ed K
it has happened to me in the past and I've had to restart it from beginning. On Thursday, May 25, 2023 at 06:46:04 AM PDT, Rajani Maski wrote: Hi Solr Users,   The reindex collection api stops importing after it imports around 20% of total docs from source collection, how to restart i

Re: Deleting document on wrong shard?

2023-05-25 Thread Walter Underwood
Hmm. That doesn’t seem consistent with the format change added in SOLR-5980, where each ID can have a _route_ attribute. { "id":"ID" , "_route_":"route”} https://issues.apache.org/jira/browse/SOLR-5890?focusedCommentId=14234208&page=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpa

Re: Redindex Collection API stalled

2023-05-25 Thread ufuk yılmaz
Are you specifying a list of field names when calling the api? -ufuk yilmaz — > On 25 May 2023, at 17:47, Ed K wrote: > > it has happened to me in the past and I've had to restart it from beginning. > >On Thursday, May 25, 2023 at 06:46:04 AM PDT, Rajani Maski > wrote: > > Hi Solr Us

Question on Solr filter syntax

2023-05-25 Thread Wei
Send on behalf of my colleague as his email seems not showing up in the list. Hi Team, I was reading about special filter syntax in Solr documentation - https://solr.apache.org/guide/8_3/the-standard-query-parser.html#differences-between-lucenes-classic-query-parser-and-solrs-standard-query-p

Question on solr filter syntax

2023-05-25 Thread Wei
Send on behalf of my colleague as his email seems not showing up in the list. Hi Team, I was reading about special filter syntax in Solr documentation - https://solr.apache.org/guide/8_3/the-standard-query-parser.html#differences-between-lucenes-classic-query-parser-and-solrs-standard-query-par

Re: Redindex Collection API stalled

2023-05-25 Thread Rajani Maski
no, whatever the default is, assuming it is fl=* Would it help if it lists field names? It will be a large list, about 50-60 fields so leaving it as default. All the fields are stored. On Thu, May 25, 2023 at 11:09 AM ufuk yılmaz wrote: > Are you specifying a list of field names when calling th

Vector Search Parser(!vp) : calculate cosine

2023-05-25 Thread Rajani Maski
Hi Solr Users, Is there a vector search parser that allows computing distance between a document's vector field and query vector passed as query param? The jiras SOLR-14397 SOLR-12890 sugges

Re: Redindex Collection API stalled

2023-05-25 Thread ufuk yılmaz
Afaik reindexcollection command uses streaming expressions and export handler under the hood, so the fields need to have docValues. I need to check the source code to see if it also works with just stored fields. If you have copyField s in your schema, it may also interfere because they would

Re: Redindex Collection API stalled

2023-05-25 Thread rajani m
oh ok, it may be the silent timeout then. It has reindexed over a million records that mostly have stored fields. It stalled after that, the source collection has 20m records so it may be a large collection and silent timeout. The endpoint created a checkpoint collection that has one document in

Re: Vector Search Parser(!vp) : calculate cosine

2023-05-25 Thread kumar gaurav
HI Rajani Read here : https://solr.apache.org/guide/solr/latest/query-guide/dense-vector-search.html The knn query parser can be used . &q={!knn f=vector topK=10}[1.0, 2.0, 3.0, 4.0] Thanks Kumar Gaurav On Fri, 26 May 2023 at 00:14, Rajani Maski wrote: > Hi Solr Users, > >Is there a v

Re: join query parser performance

2023-05-25 Thread Mikhail Khludnev
Ron, how often both indices are updated? Presumably if they are static, filter cache may help. It's worth making sure that the app gives a chance to filter cache.; To better understand the problem it is worth taking a few treadumps under load: a deep stack gives a clue for hotspot (or just take a s

Re: Vector Search Parser(!vp) : calculate cosine

2023-05-25 Thread rajani m
That also invokes knn, correct? I just need the vector math response, a cosine value of the query vector and document vector. For example, the streaming api endpoint vector-math.html#dot-product-and-cosine-similarity

Re: Question on Solr filter syntax

2023-05-25 Thread Shawn Heisey
On 5/25/23 11:00, Wei wrote: For e.g. - Current FQ - fq: planName:all_features OR zipCode:1234 (in this case entire FQ is cached) I don't want to cache the entire FQ given it's very unique , but I want to cache the first part (plan_name:all_features) to improve performance . How do I achieve th