Can issue described in LUCENE-9981 be encountered with simple wildcard searches?

2021-10-19 Thread Andy C
Hi, We are using Solr 8.8.2. An automated scan indicated that this version of Solr is vulnerable to the issue described in https://issues.apache.org/jira/browse/LUCENE-9981 where certain searches can be extremely slow. Does anyone know if this issue will only occur with true regular expression qu

Re: RE: Index for text with space

2021-10-22 Thread Andy C
The issue looks to me to be with the use of EdgeNGramFilterFactory in your field type. You have configured it with minGramSize="3" and have not specified preserveOriginal="true". So words less than 3 characters will not be indexed, and therefore can't be searched. See https://solr.apache.org/guid

Re: Index for text with space

2021-10-25 Thread Andy C
t 7:28 AM, son hoang wrote: > >>>> > >>>> Yep, words less than 3 chars will not be indexed. But if "Al Abbas" > text can be separated into a token "Abbas" (and "Al" but it is not counted > as a token as it has 2 chars only) then we can

Re: Solr and CVE-2021-44228

2021-12-10 Thread Andy C
The statement on the https://solr.apache.org/security.html page states that all 7.X and all 8.X versions are vulnerable, however looking at my 7.3.1 Solr instance I am still finding the 1.2.17 version of the log4j jar. I found https://issues.apache.org/jira/browse/SOLR-7887 which indicates that th

Re: Solr and CVE-2021-44228

2021-12-10 Thread Andy C
Drob wrote: > Andy - you are correct, we will update the notice on the site. Thank you > for checking the details. > > On Fri, Dec 10, 2021 at 4:08 PM Andy C wrote: > > > The statement on the https://solr.apache.org/security.html page states > > that > > all 7.X and a

Re: Zookeeper and Solr and CVE-2021-44228

2021-12-13 Thread Andy C
Zookeeper has not yet migrated to log4j2. Even their latest releases (3.6.3, 3.7.0) are still using version 1.2.17 of log4j. So I would think that Zookeeper would be in the same situation as the pre-7.4.0 Solr releases as described here: https://solr.apache.org/security.html#apache-solr-affected-b

Re: Range query on TextField

2022-01-07 Thread Andy C
The behavior of the range query would depend on how the fieldType used by the staffName_txt is configured. I believe you will find that TextField is not the fieldType, but the base class your fieldType is implemented on. To use an example from one of the provided example schemas, the "_text" fiel

Re: Range query on TextField

2022-01-12 Thread Andy C
dTokenizerFactory". I restart my Solr and repeat two range > queries: > > q: staffName_txt:["Gross Bob" TO "Lindmar Deborah"] result: "Lindmar > Deborah", "Mr Kenyon John" and " Saab Jerry" > > q: staffName_txt:[* TO "

Re: Range query on TextField

2022-01-12 Thread Andy C
tions). - Andy - On Wed, Jan 12, 2022 at 2:02 PM Andy C wrote: > How are you changing the managed-schema? I have never used the managed > schema feature myself, but according to the documentation ( > https://solr.apache.org/guide/8_5/overview-of-documents-fields-and-schema-design.html#sol

Re: Customize sort-behaviour on solr.StrField for German language

2022-02-01 Thread Andy C
I haven't used it personally, but for fine grained control you might want to look at using either ICUCollationField or CollationField as documented here: https://solr.apache.org/guide/8_10/language-analysis.html#sorting-text-with-custom-rules There is a write up about implementing a custom sort or

Re: unexpected wildcard results

2022-02-18 Thread Andy C
I think the issue is the doublequotes around your query string. Try searching for text:witch* instead. It appears that when surrounded by doublequotes the * is treated as text and not a syntax character (wildcard). You can verify this by performing the query in the Solr Admin UI and checking the "

Re: Search query with uppercase getting different results

2022-04-01 Thread Andy C
I suspect that the fieldType of the field you are searching against is configured to use the Word Delimiter Graph Filter ( https://solr.apache.org/guide/8_11/filter-descriptions.html#word-delimiter-graph-filter) or perhaps the older variant of this filter, the Word Delimiter Filter. If that is the

Re: facet.field reported as individual words and not the complete string

2023-02-03 Thread Andy C
You can also use a dynamic field as the destination. So with the configuration below, if your document had a field 'foo_ss' it would create a 'foo_facets' < dynamicField name="*_facets" type="string" indexed="true" stored="true" required="false" multiValued="true" /> - Andy - On Thu, Feb 2, 20

Re: Filtering facets

2023-02-28 Thread Andy C
Have you looked at https://solr.apache.org/guide/8_11/faceting.html#limiting-facet-with-certain-terms Is something like *facet.field={!terms='user1,user2'}field1* what you want? - Andy - On Tue, Feb 28, 2023 at 3:36 AM Deepak Michael wrote: > Hi > > I have a muti-valued field, containing usern

Re: phonetic search and accents

2023-03-16 Thread Andy C
A perhaps simplistic option would be to map accented letters to their unaccented versions using either the ASCII Folding Filter or the ICU Folding Filter. - Andy - On Thu, Mar 16, 2023 at 10:36 AM Luoni Cornelia wrote: > Hi, > > > > I'm using Solr for a search in a name database and get the bes

Re: Newbie Help: Replicating Between Two SolrCloud Instances (Solr 9.2.1)

2023-10-20 Thread Andy C
Hi Dave, The zoo.cfg does not reference Solr at all. Each Zookeeper instance has 3 ports of note: - The "client port" that accepts requests from external clients (in this case Solr) - Two ports used for internal zookeeper to zookeeper communication The client port is configured by the