Creating custom analyzer

2023-02-02 Thread Neeraj giri
Hi all, I have a use case, the data I have is *mc burger* and I want to get this result on querying *mcburger*, can you please suggest the tokenizer and filters I can use while indexing and querying to achieve this. Regards, Neeraj giri

Re: Creating custom analyzer

2023-02-02 Thread Mikhail Khludnev
Hi, May it be DictionaryCompoundWordTokenFilter? see https://lucene.apache.org/core/9_0_0/analysis/common/org/apache/lucene/analysis/compound/package-summary.html On Thu, Feb 2, 2023 at 12:41 PM Neeraj giri wrote: > Hi all, > I have a use case, the data I have is *mc burger* and I want to get th

Re: Slow / unresponsive admin ui since solr 9.1 update

2023-02-02 Thread Jan Høydahl
Hi, Following up on this. I'd still say that the issue here seems to be that your zookeeper config lists 0.0.0.0 as ip address for client connections. >>> The problem is related to the fact that we run solr and the zookeeper >>> ensemble dockerized. As we cannot bind zookeeper from docker to its

[Dense Vectors][Streaming] Store dense vector of a query in a streaming variable and use it for vector math

2023-02-02 Thread Gabriel Magno
Hi. I'm exploring the streaming expressions feature together with the Dense Vectors feature introduced in Solr 9. I am wondering rather it is possible to make a streaming expression that I make a query for a specific ID then store its vector field into a variable, then use this variable as an arr

facet.field reported as individual words and not the complete string

2023-02-02 Thread Matthew Castrigno
Hello community, thank you for taking the time to read my question. Your insights are most appreciated. I am making queries utilizing the facet.field parameter. The field I provide is multivalued. It is indexed using a dynamic field: It is indexed in with the name "facets_ss". The values are of

Re: facet.field reported as individual words and not the complete string

2023-02-02 Thread Jeremy Buckley - IQ-C
The culprit here is text_general. Your field is getting tokenized at index time, and each token gets returned as a facet value. Fields that you plan to use for faceting (or sorting) should be string or some numeric type. Common practice is to define a second field of type string and use a copyFie

Re: facet.field reported as individual words and not the complete string

2023-02-02 Thread Matthew Castrigno
Thank you Jeremy. Can use a dynamic field for the source field in a copyField directive? Does the order in which these statements appear relavant? Thank you! [cid:32a6a450-c63f-4531-9b8e-d5545113bc0b] Matthew Castrigno IHT Developer II St. Luke’s Health System • 208-859-4276 • castr

Re: facet.field reported as individual words and not the complete string

2023-02-02 Thread Jeremy Buckley - IQ-C
Yes, you should be able to use a dynamicField as the source. That's really just a shorthand to keep you from having to enumerate all possible field names in the schema. I don't think order matters, but I tend to put copyField directives after all the field and dynamicField definitions, for readab

Re: Slow / unresponsive admin ui since solr 9.1 update

2023-02-02 Thread michael dürr
Hi Jan, Thanks for answering! > I don't know how you run these zk's dockerized, but I'd look for a workaround where you can configure the correct address in zk's configuration. Then Solr will be happy. There exists a workaround where you can assign certain address ranges to a docker host to run