[act as web server] generates:
>
> [work]
> [act]
> [like]
> [as]
> [internet]
> [web]
> [host]
> [server]
>
> and the input: [act_as_web_server] generates:
>
> [work]
> [act]
> [act_]
> [like]
> [as]
> [as_]
> [internet]
> [web]
> [web_
phrases. But maybe my expectations are too high,
or maybe I'm just doing it wrong.
cheers
T
-Original Message-
From: Uwe Schindler
Sent: Monday, 10 March 2025 23:38
To: java-user@lucene.apache.org
Subject: Re: Synonyms and searching
Hi,
Another way to do this is using Word Delimite
Hi,
Another way to do this is using Word Delimiter Filter and use "catenate"
options. Be aware that you need special text tokenization (not use
standard tokenizer, but instead WhitespaceTokenizer). This approach is
common for product numbers.
To no break you "normal" analysis, it is often a
One thing to check is whether the synonyms are configured as
bidirectional, or which direction they go (eg is "a b" being expanded
to "ab" but "ab" is not being expanded to "a b"??)
On Wed, Mar 5, 2025 at 2:20 PM Mikhail Khludnev wrote:
>
> Hello Trevor.
>
> Maintaining such a synonym map is too
Hello Trevor.
Maintaining such a synonym map is too much of a burden.
One idea: sticks words together with "" separator with
https://lucene.apache.org/core/8_0_0/analyzers-common/org/apache/lucene/analysis/shingle/ShingleFilter.html
Another idea, the opposite breaks user's words via dictionary
htt
Yep, you hit the point.
Thank you so much!
Output is now
#Hits: 1 : Hochschule
#Hits: 3 : Hello
#Hits: 1 : people
#Hits: 1 : universität
term: hello
term: hochschule
term: people
term: universität
term: world
>>> Alan Woodward 07/25/17 9:14 AM >>>
You have a LowercaseFilter before y
You have a LowercaseFilter before your SynonymFilter, which means that the
entities in your SynonymMap need to be all lowercase or they won’t be matched.
Alan Woodward
www.flax.co.uk
> On 25 Jul 2017, at 07:52, Christian Kaufhold
> wrote:
>
> Hi,
>
> I am not able to add synonyms to the luc
Hi all,
Would this approach be recommended for stemmed words as well. For example
let say the original word is
'mower', I want matches on 'mow', 'mowing' and 'mowers' but the most
relevance would obviously be matches
for 'mower'. Should I index my documents unstemmed and then stem at the
query wor
Yes, the Payload stuff should work for this, but you will have to set
it up during indexing. The simpler approach is probably a separate
field for synonyms, but this means analyzing the same content twice
(or trying out the TeeTokenFilter, but this is advanced usage at this
point, since it
Hi Grant,
Grant Ingersoll-6 wrote:
>
> You can use the payload functionality (have a look at
> BoostingTermQuery and Michael B. excellent ApacheCon talk at
> http://people.apache.org/~buschmi/apachecon/). Other option is to put
> the synonyms into a separate field and boost that less than
You can use the payload functionality (have a look at
BoostingTermQuery and Michael B. excellent ApacheCon talk at
http://people.apache.org/~buschmi/apachecon/). Other option is to put
the synonyms into a separate field and boost that less than the main
field.
-Grant
On Dec 27, 2007, at 4
Oh, now I get it . While I thoroughly approve of the strong
typing in generics, I continually trip over code written for the 1.4
code base not being approved by the 1.5 compiler...
Best
Erick
On Dec 27, 2007 12:29 PM, Frank Schima <[EMAIL PROTECTED]> wrote:
>
> Hi Erick,
>
>
>
> Erick Erickson w
Hi Erick,
Erick Erickson wrote:
>
> I don't think this has anything to do with Lucene, the problem
> seems to be that your compiler can't find the Java Stack
> class.
>
> You need to set your classpath to include wherever
> java.utils is on your disk.
>
I agree it's a Java issue. I'm just u
I don't think this has anything to do with Lucene, the problem
seems to be that your compiler can't find the Java Stack
class.
You need to set your classpath to include wherever
java.utils is on your disk.
Erick
On Dec 27, 2007 10:56 AM, Frank Schima <[EMAIL PROTECTED]> wrote:
>
> Hello all,
See below...
On 3/27/07, daveburns <[EMAIL PROTECTED]> wrote:
Hi,
afriad I'm a noobie at Luncene but read Otis/Eriks book and was hoping
someone can answer a quick question on the AliasAnalyzer (Chap 4). I want
to
build a search for names (Companies/surname, firstname etc) but need to
match th
Thanks for the quick reply
I'm using the synonym engine from LIA for both parsing queries and building
the index. Do you have the code for a synonym engine that would work for all
matches.
I'm using ver 2.1 of lucene core.
Thanks again
Dave
--
View this message in context:
http://www.nabbl
in a synonym Engine...
suppose synonyms of word x is syn(x)...
then if y = syn(x) then x = syn(y) doesn't hold true always .(
you might not get any synonyms of y..it depends on the data of synonym
engine)
so your synonym engine might be providing alias of bob as robert,
rob, bobby...
You are free to take a look at the thread about synonym query from mars,
initiated by Andrew Schetinin and myself. This code (suggestion) tries
to handle synonym as a query expansion, rather than injection at
indexing time, while fix the problems a simple expansion creates (mainly
results of IDF).
The mock synonym code in LIA will work just fine with multiple words,
but you need to pass them to the synonym engine as a single string
containing multiple words, which means treating some phrases in the
input text as single tokens, and that's likely where you'll have to put
in the work.
I think
On 4/21/06, Dragon Fly <[EMAIL PROTECTED]> wrote:
> I don't think the SynonymAnalyzer described in LIA would work because
> some of my "synonyms" contain multiple words.
The SynonymFilter in Solr can handle multi-word synonyms.
http://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters
http://in
20 matches
Mail list logo