Re: Synonyms and searching

2025-04-21 Thread Anh Dũng Bùi
[act as web server] generates: > > [work] > [act] > [like] > [as] > [internet] > [web] > [host] > [server] > > and the input: [act_as_web_server] generates: > > [work] > [act] > [act_] > [like] > [as] > [as_] > [internet] > [web] > [web_

RE: Synonyms and searching

2025-03-10 Thread Trevor Nicholls
phrases. But maybe my expectations are too high, or maybe I'm just doing it wrong. cheers T -Original Message- From: Uwe Schindler Sent: Monday, 10 March 2025 23:38 To: java-user@lucene.apache.org Subject: Re: Synonyms and searching Hi, Another way to do this is using Word Delimite

Re: Synonyms and searching

2025-03-10 Thread Uwe Schindler
Hi, Another way to do this is using Word Delimiter Filter and use "catenate" options. Be aware that you need special text tokenization (not use standard tokenizer, but instead WhitespaceTokenizer). This approach is common for product numbers. To no break you "normal" analysis, it is often a

Re: Synonyms and searching

2025-03-05 Thread Michael Sokolov
One thing to check is whether the synonyms are configured as bidirectional, or which direction they go (eg is "a b" being expanded to "ab" but "ab" is not being expanded to "a b"??) On Wed, Mar 5, 2025 at 2:20 PM Mikhail Khludnev wrote: > > Hello Trevor. > > Maintaining such a synonym map is too

Re: Synonyms and searching

2025-03-05 Thread Mikhail Khludnev
Hello Trevor. Maintaining such a synonym map is too much of a burden. One idea: sticks words together with "" separator with https://lucene.apache.org/core/8_0_0/analyzers-common/org/apache/lucene/analysis/shingle/ShingleFilter.html Another idea, the opposite breaks user's words via dictionary htt

Re: synonyms

2017-07-25 Thread Christian Kaufhold
Yep, you hit the point. Thank you so much! Output is now #Hits: 1 : Hochschule #Hits: 3 : Hello #Hits: 1 : people #Hits: 1 : universität term: hello term: hochschule term: people term: universität term: world >>> Alan Woodward 07/25/17 9:14 AM >>> You have a LowercaseFilter before y

Re: synonyms

2017-07-25 Thread Alan Woodward
You have a LowercaseFilter before your SynonymFilter, which means that the entities in your SynonymMap need to be all lowercase or they won’t be matched. Alan Woodward www.flax.co.uk > On 25 Jul 2017, at 07:52, Christian Kaufhold > wrote: > > Hi, > > I am not able to add synonyms to the luc

Re: Synonyms and Ranking

2008-01-03 Thread Michael Stoppelman
Hi all, Would this approach be recommended for stemmed words as well. For example let say the original word is 'mower', I want matches on 'mow', 'mowing' and 'mowers' but the most relevance would obviously be matches for 'mower'. Should I index my documents unstemmed and then stem at the query wor

Re: Synonyms and Ranking

2007-12-28 Thread Grant Ingersoll
Yes, the Payload stuff should work for this, but you will have to set it up during indexing. The simpler approach is probably a separate field for synonyms, but this means analyzing the same content twice (or trying out the TeeTokenFilter, but this is advanced usage at this point, since it

Re: Synonyms and Ranking

2007-12-28 Thread Frank Schima
Hi Grant, Grant Ingersoll-6 wrote: > > You can use the payload functionality (have a look at > BoostingTermQuery and Michael B. excellent ApacheCon talk at > http://people.apache.org/~buschmi/apachecon/). Other option is to put > the synonyms into a separate field and boost that less than

Re: Synonyms and Ranking

2007-12-27 Thread Grant Ingersoll
You can use the payload functionality (have a look at BoostingTermQuery and Michael B. excellent ApacheCon talk at http://people.apache.org/~buschmi/apachecon/). Other option is to put the synonyms into a separate field and boost that less than the main field. -Grant On Dec 27, 2007, at 4

Re: Synonyms in Lucene 2.2

2007-12-27 Thread Erick Erickson
Oh, now I get it . While I thoroughly approve of the strong typing in generics, I continually trip over code written for the 1.4 code base not being approved by the 1.5 compiler... Best Erick On Dec 27, 2007 12:29 PM, Frank Schima <[EMAIL PROTECTED]> wrote: > > Hi Erick, > > > > Erick Erickson w

Re: Synonyms in Lucene 2.2

2007-12-27 Thread Frank Schima
Hi Erick, Erick Erickson wrote: > > I don't think this has anything to do with Lucene, the problem > seems to be that your compiler can't find the Java Stack > class. > > You need to set your classpath to include wherever > java.utils is on your disk. > I agree it's a Java issue. I'm just u

Re: Synonyms in Lucene 2.2

2007-12-27 Thread Erick Erickson
I don't think this has anything to do with Lucene, the problem seems to be that your compiler can't find the Java Stack class. You need to set your classpath to include wherever java.utils is on your disk. Erick On Dec 27, 2007 10:56 AM, Frank Schima <[EMAIL PROTECTED]> wrote: > > Hello all,

Re: Synonyms and Aliases query

2007-03-27 Thread Erick Erickson
See below... On 3/27/07, daveburns <[EMAIL PROTECTED]> wrote: Hi, afriad I'm a noobie at Luncene but read Otis/Eriks book and was hoping someone can answer a quick question on the AliasAnalyzer (Chap 4). I want to build a search for names (Companies/surname, firstname etc) but need to match th

Re: Synonyms and Aliases query

2007-03-27 Thread daveburns
Thanks for the quick reply I'm using the synonym engine from LIA for both parsing queries and building the index. Do you have the code for a synonym engine that would work for all matches. I'm using ver 2.1 of lucene core. Thanks again Dave -- View this message in context: http://www.nabbl

Re: Synonyms and Aliases query

2007-03-27 Thread sandeep chawla
in a synonym Engine... suppose synonyms of word x is syn(x)... then if y = syn(x) then x = syn(y) doesn't hold true always .( you might not get any synonyms of y..it depends on the data of synonym engine) so your synonym engine might be providing alias of bob as robert, rob, bobby...

RE: Synonyms ...

2006-05-09 Thread Ziv Gome
You are free to take a look at the thread about synonym query from mars, initiated by Andrew Schetinin and myself. This code (suggestion) tries to handle synonym as a query expansion, rather than injection at indexing time, while fix the problems a simple expansion creates (mainly results of IDF).

RE: Synonyms ...

2006-04-22 Thread Colin Young
The mock synonym code in LIA will work just fine with multiple words, but you need to pass them to the synonym engine as a single string containing multiple words, which means treating some phrases in the input text as single tokens, and that's likely where you'll have to put in the work. I think

Re: Synonyms ...

2006-04-21 Thread Yonik Seeley
On 4/21/06, Dragon Fly <[EMAIL PROTECTED]> wrote: > I don't think the SynonymAnalyzer described in LIA would work because > some of my "synonyms" contain multiple words. The SynonymFilter in Solr can handle multi-word synonyms. http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters http://in