It is standard practice in search to improve recall by transforming the inputs. This transformation is known as an analysis chain. Common operations are:
- Lowercase everything - Throw out punctuation - Split on whitespace - Split on case change or alphabetic to numeric transition (RedCar23 -> red car 23) There are lots of other possibilities and what's appropriate depends on your use case. You can add more analyses or remove one or more default analyses by configuring the analyzer for your fields in the schema. To see how analysis is affecting your document text, and how it is affecting the query text you can enter the corresponding text on the analysis admin screen https://solr.apache.org/guide/solr/latest/indexing-guide/analysis-screen.html On Mon, Nov 10, 2025 at 3:13 AM Michal Steinberger <[email protected]> wrote: > Hi all, > > I created a new Solr service and added the following files to it: > > [{ > "id":"doc1", > "title":["First Document"], > "content":["This is the content of the first document."], > "_version_":1848387427708698624, > "_root_":"doc1" > },{ > "id":"doc2", > "title":["Second Document"], > "content":["This is the content * of the first document."], > "_version_":1848387447438704640, > "_root_":"doc2" > },{ > "id":"doc3", > "title":["Third Document"], > "content":["This is the content ? of the first document."], > "_version_":1848387463546929152, > "_root_":"doc3" > },{ > "id":"doc4", > "title":["Four Document"], > "content":["This is the (content) of the first document."], > "_version_":1848387754669375488, > "_root_":"doc4" > }] > > I ran this query: > > > http://localhost:8983/solr/new_collection/select?_=1762759655755&indent=true&q=*&q.op=OR&useParams= > > and I got all the documents back. I tried to use \* to get only doc2, but > nothing is returned. What am I missing? > > Michal > > ________________________________ > This email and any attachments thereto may contain private, confidential, > and privileged material for the sole use of the intended recipient. Any > review, copying, or distribution of this email (or any attachments thereto) > by others is strictly prohibited. If you are not the intended recipient, > please contact the sender immediately and permanently delete the original > and any copies of this email and any attachments thereto. > -- http://www.needhamsoftware.com (work) https://a.co/d/b2sZLD9 (my fantasy fiction book)
