Re: Index for text with space

2021-10-25 Thread Andy C
I would think your problem goes beyond 1 and 2 characters words not being indexed. With your current field type definition, if someone searches for "can" it will retrieve documents that contain any word that start with "can". So "candidate", canadian", "cantina", etc. Is this really the desired s

Re: Index for text with space

2021-10-25 Thread Dave
You can pre process the query to remove anything not indexed (less than 3 characters) but that initial scheme decision was a mistake, and should be remedied and reindexed. > On Oct 25, 2021, at 8:36 AM, son hoang wrote: > > Is there any way in the query so that I do not need to reindex the w

Re: Index for text with space

2021-10-25 Thread son hoang
Is there any way in the query so that I do not need to reindex the whole data? On 2021/10/23 15:39:18, Walter Underwood wrote: > Agreed. There is a simple fix. Index all the words. Also, stop using > EdgeNgramFilter. > That is only used for completion, not word search. > > wunder > Walter Unde

Re: Index for text with space

2021-10-23 Thread Walter Underwood
Agreed. There is a simple fix. Index all the words. Also, stop using EdgeNgramFilter. That is only used for completion, not word search. wunder Walter Underwood wun...@wunderwood.org http://observer.wunderwood.org/ (my blog) > On Oct 23, 2021, at 4:31 AM, Dave wrote: > > Why ever would you no

Re: Index for text with space

2021-10-23 Thread Dave
Why ever would you not index less than three characters? “To be or not to be” Seems like a significant search > On Oct 23, 2021, at 7:28 AM, son hoang wrote: > > Yep, words less than 3 chars will not be indexed. But if "Al Abbas" text can > be separated into a token "Abbas" (and "Al" but it

Re: RE: Index for text with space

2021-10-23 Thread son hoang
Yep, words less than 3 chars will not be indexed. But if "Al Abbas" text can be separated into a token "Abbas" (and "Al" but it is not counted as a token as it has 2 chars only) then we can apply OR condition in the query? On 2021/10/22 14:37:51, Andy C wrote: > The issue looks to me to be

Re: RE: Index for text with space

2021-10-22 Thread Andy C
The issue looks to me to be with the use of EdgeNGramFilterFactory in your field type. You have configured it with minGramSize="3" and have not specified preserveOriginal="true". So words less than 3 characters will not be indexed, and therefore can't be searched. See https://solr.apache.org/guid

Re: RE: Index for text with space

2021-10-22 Thread son hoang
Thanks, Thamiz It seems that I have index=StandardTokenizerFactory causing the issue I do not want to re-index. Is there any solution ? Should I have query "OR" so that the search can return "Al Abbas" when I have "Al Abbas" in the query field (eg: there is a OR match "Abbas" ? Thanks On 2

RE: Index for text with space

2021-10-21 Thread Thamizhazhagan B
Hi, Create a copy field as below and use this copyfield in your query.. Thanks, Thamizh -Original Message- From: son hoang Sent: Thursday, October 21, 2021 8:19 AM To: users@solr.apache.org Subject: Index fo

Re: Index for text with space

2021-10-20 Thread Aroop Ganguly
Can you share your query syntax in both cases please? > On Oct 20, 2021, at 5:19 PM, son hoang wrote: > > Hello > > I have a config like this: > > > > > > > maxGramSize="15"/> > > > >