solr query sanitizer?

2024-05-29 Thread Dmitri Maziuk
Hi all, our website has a search box that essentially passes its contents to Solr without any massaging. This works fine 99% of the time, the other 1% is when a misbehaving bot hits it and tries stuffing all sorts of crap in there. Then bad things happen: Java's overly verbose exception stac

Re: Seeking Advice: Setting up SSL in Solr 9.5 on Centos 7

2024-05-29 Thread Lee Daniel
Yh you're right, I did some more reading last night. I tried a few different domains last night and even disabled the SNI Check but no luck. I believe the issue is the 2 step process they have in the documentation for generating a self-signed certificate. There is more to the process and the

Re: solr query sanitizer?

2024-05-29 Thread Thomas Corthals
Solarium (a PHP client for Solr) has a helper method to escape search terms that uses a regex to escape special characters. https://github.com/solariumphp/solarium/blob/c2744ff706a2f0be148a45d702700fc346429679/src/Core/Query/Helper.php#L82 Thomas Op wo 29 mei 2024 om 16:11 schreef Dmitri Maziuk

Re: solr query sanitizer?

2024-05-29 Thread Mikhail Khludnev
Hello Dima You didn't mention the query parser. Perhaps https://solr.apache.org/guide/solr/latest/query-guide/other-parsers.html#simple-query-parser might be suitable. Regarding stacktraces in logs. I believe stack trances might be disabled via log config pls check https://logging.apache.org/log4j/

Re: solr query sanitizer?

2024-05-29 Thread Walter Underwood
I’ve done three kinds of sanity checks/fixes to avoid performance problems. 1. Prevent deep paging. Have to do this every time. When a request comes in for a page past 50, it gets rewritten to the 50th page. 2. Limit the size of queries. With homework help, we had people pasting in 800 word que

Re: solr query sanitizer?

2024-05-29 Thread Dmitri Maziuk
On 5/29/24 11:43, Walter Underwood wrote: I’ve done three kinds of sanity checks/fixes to avoid performance problems. 1. Prevent deep paging. Have to do this every time. When a request comes in for a page past 50, it gets rewritten to the 50th page. 2. Limit the size of queries. With homework

Re: solr query sanitizer?

2024-05-29 Thread Walter Underwood
Honestly, there is a missing feature here. Solr should have a free text query parser. Run the query through standard tokenizer, ignore all the syntax, and make a bunch of word/phrase queries. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On May 29, 2

Re: Performance Suggestion for Dense Vectors

2024-05-29 Thread David Smiley
There *is* a Solr blog site that just launched: https://solr.apache.org/blog.html On Thu, Mar 28, 2024 at 3:49 PM rajani m wrote: > > @Alessandro, > Is there a solr blog site where we can submit work/articles or are you > suggesting to post on my own site and share a link here? I prefer the > fo

[ANNOUNCE] Apache Solr 9.6.1 released

2024-05-29 Thread Houston Putman
The Solr PMC is pleased to announce the release of Apache Solr 9.6.1. Solr is the popular, blazing fast, open source NoSQL search platform from the Apache Solr project. Its major features include powerful full-text search, hit highlighting, faceted search, dynamic clustering, database integration,

Solr query on multivalued field , removing duplicate results and getting distinct results

2024-05-29 Thread Natarajan, Rajeswari
Hi , Looking to get distinct results from the query to multi value field. Field collapsing will not work on multi value field and not inclined to use faceting. Is there any other way this can be achieved. Thanks, Rajeswari

Re: Solr query on multivalued field , removing duplicate results and getting distinct results

2024-05-29 Thread Mikhail Khludnev
For some of these cases JSON Facets were an answer. However, topdocs https://issues.apache.org/jira/browse/SOLR-7830 isn't available. On Thu, May 30, 2024 at 5:41 AM Natarajan, Rajeswari wrote: > Hi , > > Looking to get distinct results from the query to multi value field. Field > collapsing wil