Re: Ignoring “de la” at index or search time

2019-02-23 Thread baris.kazar
In this case search string is c b and then search query has 8 combos including two cases with c b ~ which means find all containing c And b and c Or b ( two separate queries having ~ ) and then i can find a b but not a de la b without French stopwords. Thanks > On Feb 23, 2019, at 6:52 PM, Erick

Re: Ignoring “de la” at index or search time

2019-02-23 Thread Erick Erickson
Lucene won’t ignore these unless you tell it to via stopwords. This is a problem no matter how you look at it. If you do put in stopwords, the word _positions_ are retained. In your example, word position a 1 de 2 la 3 b 4 If you remove “de” and “la” via s

Re: Ignoring “de la” at index or search time

2019-02-23 Thread baris.kazar
Thanks Erick there is a pattern i cant catch in my results such as: a de la b i catch “a b” though. I though Lucene might ignore those automatically while creating index. > On Feb 23, 2019, at 12:29 PM, Erick Erickson wrote: > > Use stopwords, although it's becoming less of a concern, why do yo

Re: Ignoring “de la” at index or search time

2019-02-23 Thread Erick Erickson
Use stopwords, although it's becoming less of a concern, why do you think you need to? On Sat, Feb 23, 2019, 08:42 baris.kazar wrote: > Hi,- > What is the (most efficient) way to > ignore “de la” kinda connectors > in a string at index or search time? > Thanks > > ---

Ignoring “de la” at index or search time

2019-02-23 Thread baris.kazar
Hi,- What is the (most efficient) way to ignore “de la” kinda connectors in a string at index or search time? Thanks - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h.