I would think your problem goes beyond 1 and 2 characters words not being
indexed.
With your current field type definition, if someone searches for "can" it
will retrieve documents that contain any word that start with "can". So
"candidate", canadian", "cantina", etc.
Is this really the desired s
You can pre process the query to remove anything not indexed (less than 3
characters) but that initial scheme decision was a mistake, and should be
remedied and reindexed.
> On Oct 25, 2021, at 8:36 AM, son hoang wrote:
>
> Is there any way in the query so that I do not need to reindex the w
Is there any way in the query so that I do not need to reindex the whole data?
On 2021/10/23 15:39:18, Walter Underwood wrote:
> Agreed. There is a simple fix. Index all the words. Also, stop using
> EdgeNgramFilter.
> That is only used for completion, not word search.
>
> wunder
> Walter Unde
Agreed. There is a simple fix. Index all the words. Also, stop using
EdgeNgramFilter.
That is only used for completion, not word search.
wunder
Walter Underwood
wun...@wunderwood.org
http://observer.wunderwood.org/ (my blog)
> On Oct 23, 2021, at 4:31 AM, Dave wrote:
>
> Why ever would you no
Why ever would you not index less than three characters?
“To be or not to be”
Seems like a significant search
> On Oct 23, 2021, at 7:28 AM, son hoang wrote:
>
> Yep, words less than 3 chars will not be indexed. But if "Al Abbas" text can
> be separated into a token "Abbas" (and "Al" but it
Yep, words less than 3 chars will not be indexed. But if "Al Abbas" text can be
separated into a token "Abbas" (and "Al" but it is not counted as a token as
it has 2 chars only) then we can apply OR condition in the query?
On 2021/10/22 14:37:51, Andy C wrote:
> The issue looks to me to be
The issue looks to me to be with the use of EdgeNGramFilterFactory in your
field type. You have configured it with minGramSize="3" and have not
specified preserveOriginal="true".
So words less than 3 characters will not be indexed, and therefore can't be
searched.
See
https://solr.apache.org/guid
Thanks, Thamiz
It seems that I have index=StandardTokenizerFactory causing the issue
I do not want to re-index. Is there any solution ? Should I have query "OR" so
that the search can return "Al Abbas" when I have "Al Abbas" in the query
field (eg: there is a OR match "Abbas" ?
Thanks
On 2
Hi,
Create a copy field as below and use this copyfield in your query..
Thanks,
Thamizh
-Original Message-
From: son hoang
Sent: Thursday, October 21, 2021 8:19 AM
To: users@solr.apache.org
Subject: Index fo
Can you share your query syntax in both cases please?
> On Oct 20, 2021, at 5:19 PM, son hoang wrote:
>
> Hello
>
> I have a config like this:
>
>
>
>
>
>
> maxGramSize="15"/>
>
>
>
>
10 matches
Mail list logo