ver will find matches to any of those forms. But
> because my search method boosts exactText matches, the exact form they
> search for will be scored higher.
>
> Because the custom analyzer produces word tokens I have incorporated
> synonyms (when indexing). The synonyms are bidirectional.
, the exact form they search for will be scored
higher.
Because the custom analyzer produces word tokens I have incorporated synonyms
(when indexing). The synonyms are bidirectional.
OK, that's all background.
The problem I am trying to fix at the moment is that searching for these
compoun
uct the index and the search so that if the
user searches for e.g. "http proxyserver" they also find a match. I thought
it would be sufficient to add an entry to the synonym map specifying that
"http proxy" and "httpproxy" are synonyms, and likewise "proxy server&qu
One thing to check is whether the synonyms are configured as
bidirectional, or which direction they go (eg is "a b" being expanded
to "ab" but "ab" is not being expanded to "a b"??)
On Wed, Mar 5, 2025 at 2:20 PM Mikhail Khludnev wrote:
>
> Hello Tre
and all will
> match.
>
>
>
> However, I am trying to construct the index and the search so that if the
> user searches for e.g. "http proxyserver" they also find a match. I thought
> it would be sufficient to add an entry to the synonym map specifying that
> &qu
t would be sufficient to add an entry to the synonym map specifying that
"http proxy" and "httpproxy" are synonyms, and likewise "proxy server" and
"proxyserver". (When adding multiple-word phrases the spaces are replaced by
SynonymMap.WORD_SEPARATOR).
Th
Hi Lucene users,
I recently came across SynonymQuery and found out that it only supports
single-term synonyms (since it accepts a list of Term which will be
considered as synonyms). We have some multi-term synonyms like "internet
device" <-> "wifi router" or "dns&q
hich produces nice
> looking graphs right from the get-go (tokenizer), with compound words also
> properly decompounded so both options are indexed/searched.
>
> History: we created SynonymGraphFilter, along with other important
> QueryParser (e.g. http://issues.apache.org/jira/browse/L
th options are indexed/searched.
History: we created SynonymGraphFilter, along with other important
QueryParser (e.g. http://issues.apache.org/jira/browse/LUCENE-7603) and
Query improvements, to get multi-term synonyms working correctly, finally
in Lucene. With the old SynonymFilter, position
Hello,
I'm working on a project that involves search in Japanese and uses
synonyms. The Japanese tokenizer creates an analysis graph, but the
SynonymGraphFilter states it cannot take a graph as input. After a few
tests I've seen it can create some unusual outputs if given a graph as
PM baris.kazar wrote:
> i should have asked this way as Mike made clear for MultiPhraseQuery:
> is PhraseQuery ok to account for synonyms?
> Best
>
> > On Sep 20, 2018, at 2:02 PM, baris.ka...@oracle.com wrote:
> >
> > Hi,-
> >
> > should i use MultiPhrase
i should have asked this way as Mike made clear for MultiPhraseQuery:
is PhraseQuery ok to account for synonyms?
Best
> On Sep 20, 2018, at 2:02 PM, baris.ka...@oracle.com wrote:
>
> Hi,-
>
> should i use MultiPhraseQuery or PhraseQuery to take synonyms into account?
&g
Hi,-
should i use MultiPhraseQuery or PhraseQuery to take synonyms into
account?
Best regards
baris
-
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h
2018年6月21日(木) 20:09 egorlex :
> Hi,
>
> I am new in Lucene and I need add synonyms to my search.
>
> I tried use SynonymFilterFactory for this purpose, but it deprecated for
> Lucene 7.3.0
>
> Please help me with some examples or documentation. What is the b
Hi,
I am new in Lucene and I need add synonyms to my search.
I tried use SynonymFilterFactory for this purpose, but it deprecated for
Lucene 7.3.0
Please help me with some examples or documentation. What is the best way to
realized it?
--
Sent from: http://lucene.472066.n3.nabble.com
Hello,
I'm trying to implement a weighed synonyms analyzer using
SynonymGraphFilter and a custom payload analyzer.
My synonyms file looks like:
obama,president^0.9,barry^0.5
speaks,greets^0.8,congratulates^0.9,informs
media,press,journal,tv^0.8
I want user searching for "obama greets
I am using Lucene 4.8 (.net flavour) and cannot find a decent working example
to answer my issue.
In our source data we have lots of similar items that can be described in the
same way - for example "lawnmower", "lawn mower" & "grass cutter".
Obviously we have no control over how people choose
rcaseFilter before your SynonymFilter, which means that the
entities in your SynonymMap need to be all lowercase or they won’t be matched.
Alan Woodward
www.flax.co.uk
> On 25 Jul 2017, at 07:52, Christian Kaufhold
> wrote:
>
> Hi,
>
> I am not able to add synonyms to the lu
You have a LowercaseFilter before your SynonymFilter, which means that the
entities in your SynonymMap need to be all lowercase or they won’t be matched.
Alan Woodward
www.flax.co.uk
> On 25 Jul 2017, at 07:52, Christian Kaufhold
> wrote:
>
> Hi,
>
> I am not able to a
Hi,
I am not able to add synonyms to the lucene index.
I condensed my problem into the following class which is based on a Hello World
example.
The idea behind the code was to add a document with universität and the synonym
'Hochschule' (highschool)
so that lucene finds universität we
Hi Daniel,
Since you are restricting inOrder=true and proximity=0 in the top level query,
there is no problem in your particular example.
If you weren't restricting, injecting synonyms with plain OR, sometimes cause
'query drift': injection/addition of one term changes result l
I'm investigating various ways of supporting synonyms in Lucene.
One such approach that looks potentially interesting is to do a kind of
"query expansion".
For example, if the user searches for "us 1888", one might expand the
query as follows:
Span
From: venkatesham.gu...@igate.com
Sent: Saturday, May 17, 2014 2:28 AM
To: java-user@lucene.apache.org
Subject: A work around to get matching terms from document - Stemmed and
Synonyms
I am looking for a feature in SOLR that will give me all matched terms in
the
document when I search with a
java-user@lucene.apache.org
Subject: A work around to get matching terms from document - Stemmed and
Synonyms
I am looking for a feature in SOLR that will give me all matched terms in
the
document when I search with a query term, My SOLR field uses Stemming and
Synonym filters as a result of this I
program
which will take each document from the result and apply tokenizer, stemming
and synonym and match with query term(apply stemming and synonyms) to
identify the exact terms from the document, before applying the stemming and
synonym making a copy of the original terms.Please suggest is there any
words - in this example operation-
Thanks
Venkatesham Gundu
--
View this message in context:
http://lucene.472066.n3.nabble.com/Matched-words-from-document-Stemmed-and-Synonyms-tp4079968.html
Sent from the Lucene - Java Users mailing list archive at
Hi everyone!
I tried to inject the synonyms during indexing. I created a custom analyzer.
It pretty much does the same as a standard analyzer with an addition of
having synonym filter. During indexing and searching, I pass this custom
analyzer to both indexwriter and indexsearcher, but the
tomAnalyzer as part of my querying, then if I
enter the query as
myFieldName: manager
it expands the query with synonyms for manager.
But, I want the synonyms to be part of only my index and I don't want my
query to be expanded with synonyms.
So, when I removed the SynonymFilter f
Ah, you're expanding synonyms at query time. Missed that.
How is the indexing done in the first place? What analyzer is used? Have you
even tried using Luke to look into your index and see what's actually in there?
What is shown when you do an explain?
Most often this kind of thing i
edevil)
2011/2/13 Erick Erickson
> At a guess make is a synonym for one of your search terms. doc.get
> returns the original content, not synonyms.
>
>
> So what are your synonyms that might be a factor here?
>
> Best
> Erick
>
> On Sat, Feb 12, 2011 at 6:04 A
At a guess make is a synonym for one of your search terms. doc.get
returns the original content, not synonyms.
So what are your synonyms that might be a factor here?
Best
Erick
On Sat, Feb 12, 2011 at 6:04 AM, Gong Li wrote:
> Hi,
>
> I am tying WordNet synonyms into an SynonymAnalyz
Hi,
I am tying WordNet synonyms into an SynonymAnalyzer. But I find there is a
wrong in the search result as following:
input keywords: *browned fox*
query.toString(): (content:browned content:brown) (content:fox content:trick
content:throw content:slyboots content:fuddle content:fob
obert Muir a écrit :
> On Thu, Jan 6, 2011 at 11:53 AM, Pulkit Singhal
> wrote:
>> Hello,
>>
>> What's a good source to get dictionaries (for spellcorrections) and/or
>> thesaurus (for synonyms) that can be used with Lucene for non-English
>> languages s
On Thu, Jan 6, 2011 at 11:53 AM, Pulkit Singhal wrote:
> Hello,
>
> What's a good source to get dictionaries (for spellcorrections) and/or
> thesaurus (for synonyms) that can be used with Lucene for non-English
> languages such as Fresh, Chinese, Korean etc?
if you can&
is useful in search context. You can may uses synsets
(synonyms) like a suggestion dictionary. But stopwords, stem and analyzer
dictionaries are dependant to associate modules.
Best,
---
Hong-Thai
-Message d'origine-
De : Pulkit Singhal [mailto:pulkitsing...@gmail.com]
Hello,
What's a good source to get dictionaries (for spellcorrections) and/or
thesaurus (for synonyms) that can be used with Lucene for non-English
languages such as Fresh, Chinese, Korean etc?
For example, the wordnet contrib module is based on the data set
provided by the Princeton
Larry, you should look at the SynonymFilter in Lucene Contrib Analysis.
simon
On Mon, May 24, 2010 at 9:40 PM, Larry Hendrix wrote:
> Does anyone know of any classes available that allow you to define and use
> your own synonyms when searching with Lucene? I read some about WordPress but
Does anyone know of any classes available that allow you to define and use your
own synonyms when searching with Lucene? I read some about WordPress but it
seems those synonyms are predefined English words. The application I am working
with searches for the names of contacts and companies. I
Hi,
The first answer by Sven is more efficient and generaly used.
Abhay
@Sven
f you add the synonyms at query time you can assign a boost factor to
the added synonyms that would boost the matches to a particular term
down. -> something in the interval [0,1]
On Wed, Aug 26, 2009 at 3:40
Hi Sven,
While I have no idea bout the example in LiA I can give you some
quick pointers.
if you add the synonyms at query time you can assign a boost factor to
the added synonyms that would boost the matches to a particular term
down. -> something in the interval [0,1]
if you add the synonyms
Hi,
I implemented a synonym search by using the chapter 4.6's example from
the Lucene in Action book.
Now I want to extend the example in that way, that synonyms are boosted
less that the original word the user searched for. Is there a way to do
it? If it is, I would like to get any he
Just wanted to add this my original question:
FYI, The synonyms in my application are totally custom and not from English
dictionary...ie. "Global Leader in Finance" could also mean "Top Investment
Bank" or "Fortune 500 Finance company" etc etc.
Anshum-2 wrot
rs to draw
On Sat, Aug 8, 2009 at 11:14 AM, mitu2009 wrote:
>
> Hi,
>
> What is the best way to handle synonyms (phrases) using Lucene? Especially,
> when I need to execute queries like :a OR b OR c NOT d
>
> How about adding a new field called "synonyms" to eac
Hi,
What is the best way to handle synonyms (phrases) using Lucene? Especially,
when I need to execute queries like :a OR b OR c NOT d
How about adding a new field called "synonyms" to each document while
indexing? This field's value would have a list of all synonyms. It woul
I have my own synonyms, which are differnt from the ones in wordNet.
For every word, I have synonyms and the score - for how close the synonym to
its word.
I would like to 'elaborate' the query, to expand it so it will include also
the synonyms of the words given in the query.
Thanks
What are you trying to do? I think you'd get a better response ifyou
explained what higher-level task/feature you're trying to
implement.
Best
Erick
On Mon, Jul 13, 2009 at 4:54 AM, liat oren wrote:
> Hi all,
>
> I have a list of synonyms for every word.
> Is there a
Hi all,
I have a list of synonyms for every word.
Is there a good way to use these synonyms?
Currently I use a boost query so if 'a' is the queried word, and 'b' (0.5)
and 'c' (0.2) are its synonyms, I query for:
a^1 + b^0.5 + c^0.2.
Is there a better way of doing it?
Thanks,
Liat
Hi,
I saw a very old thread that suggests an implementation for Synonyms that
takes into account differnt weight to differnt synonyms and gives a penalty
factor to synonyms, to avoid getting documents with the synonyms prior to
documents with the original words.
http://mail-archives.apache.org
or other problems in queries that depend on order of words).
Uwe
-
Uwe Schindler
H.-H.-Meier-Allee 63, D-28213 Bremen
http://www.thetaphi.de
eMail: u...@thetaphi.de
-Original Message-
From: Michael McCandless [mailto:luc...@mikemccandless.com]
Sent: Monday, March 02, 2009 4:07 PM
T
ler
>> H.-H.-Meier-Allee 63, D-28213 Bremen
>> http://www.thetaphi.de
>> eMail: u...@thetaphi.de
>>
>>> -Original Message-
>>> From: Michael McCandless [mailto:luc...@mikemccandless.com]
>>> Sent: Monday, March 02, 2009 4:07 PM
>&g
4:07 PM
To: java-user@lucene.apache.org
Subject: Re: Indexing synonyms for multiple words
Shouldn't WORD2's position be 1 more than your SYN?
Ie, don't you want these positions?:
WORD1 2
WORD2 3
SYN 2
The position is the starting position of the token; Lucene doesn
n
http://www.thetaphi.de
eMail: u...@thetaphi.de
> -Original Message-
> From: Michael McCandless [mailto:luc...@mikemccandless.com]
> Sent: Monday, March 02, 2009 4:07 PM
> To: java-user@lucene.apache.org
> Subject: Re: Indexing synonyms for multiple words
>
>
&
>
> Hi,
>
> I'm fairly new to Lucene. I'd like to know how we can index synonyms for
> multiple words.
>
> This is the scenario:
>
> Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG.
>
> Now assume the two words combined WORD1 WORD2 can be replace
ene. I'd like to know how we can index synonyms
for
multiple words.
This is the scenario:
Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG.
Now assume the two words combined WORD1 WORD2 can be replaced by
another
word SYN.
If I place SYN after WORD1 with positionIncrement set to 0
Lucene. I'd like to know how we can index synonyms for
> multiple words.
>
> This is the scenario:
>
> Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG.
>
> Now assume the two words combined WORD1 WORD2 can be replaced by another
> word SYN.
>
> If I place
Hi,
I'm fairly new to Lucene. I'd like to know how we can index synonyms for
multiple words.
This is the scenario:
Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG.
Now assume the two words combined WORD1 WORD2 can be replaced by another
word SYN.
If I place SYN after
How can I see the senses of a word with wordnet ??? And How could I select
the most populars ???
Is there a way to make queries ignoring the synonyms I have added to the
index ???
I hope you can help me.
Regards
Ariel
On Wed, Feb 4, 2009 at 7:46 PM, Manu Konchady wrote:
>
>
>
> -
Well, I have the luke 0.8, I opened my index with that tool but there is not
any clue of synonyms in the field I have indexed with the synonym analyzer.
I don't know how can I see the group of synonyms of each term, sould
somebody tell me hot to do that ???
On Wed, Feb 4, 2009 at 5:09 PM,
is unanswerable as it
stands.
Best
Erick
On Wed, Feb 4, 2009 at 3:27 PM, Ariel wrote:
> Hi every body:
>
> I am using wordnet to index my document taking in account the synonyms
> with wordnet.
> After I indexed the whole documents collections I made a query with
> the word "
Hi every body:
I am using wordnet to index my document taking in account the synonyms
with wordnet.
After I indexed the whole documents collections I made a query with
the word "snort" but documents that contain the word bird are
retrieved, I don't understand this because snort a
token synonyms there has been a few discussions on the
forum about this before I came up with ShingleMatrixFilter, perhaps
some of these might work better for you:
http://www.nabble.com/multi-word-synonyms-to17294842.html#a17305359
Still, here is the explaination of what ShingleMatrixFilter
The unit tests don't really show how I could use it for synonyms at
index time- does anyone have sample code? Is it possible?
-Original Message-
From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com]
Sent: Tuesday, January 13, 2009 3:06 PM
To: java-user@lucene.apache.org
Subjec
ene.apache.org
> Sent: Tuesday, January 13, 2009 2:39:11 PM
> Subject: ShingleMatrixFilter for synonyms
>
> Does anyone have an example using this?
>
>
>
> I have a SynonymEngine that returns a an array list of strings, some of
> which may be multiple words. How c
Does anyone have an example using this?
I have a SynonymEngine that returns a an array list of strings, some of
which may be multiple words. How can I incorporate this with my
SynonymEngine at index time?
Also, the javadoc for the ShingleMatrixFilter class says:
Without a spac
18 maj 2008 kl. 19.17 skrev Paul Elschot:
Now I'm thinking it could be solved using token payloads and a brand
new MultiDimensionalSpanQuery. Not too different from what you
suggested way back in
http://www.nabble.com/Using-Lucene-for-searching-tokens%2C-not-storin
g-them.-to3918462.html#a394401
Op Sunday 18 May 2008 16:30:26 schreef Karl Wettin:
> 18 maj 2008 kl. 00.01 skrev Paul Elschot:
> > Op Saturday 17 May 2008 20:28:40 schreef Karl Wettin:
> >> As far as I know Lucene only handle single word synonyms at index
> >> time. My life would be much simple
18 maj 2008 kl. 00.01 skrev Paul Elschot:
Op Saturday 17 May 2008 20:28:40 schreef Karl Wettin:
As far as I know Lucene only handle single word synonyms at index
time. My life would be much simpler if it was possible to add
synonyms that spanned over multiple tokens, such as "lucene in
a
Op Saturday 17 May 2008 20:28:40 schreef Karl Wettin:
> As far as I know Lucene only handle single word synonyms at index
> time. My life would be much simpler if it was possible to add
> synonyms that spanned over multiple tokens, such as "lucene in
> action"="lia"
As far as I know Lucene only handle single word synonyms at index
time. My life would be much simpler if it was possible to add synonyms
that spanned over multiple tokens, such as "lucene in action"="lia". I
have a couple of workarounds that are OK but it really isn
my documents unstemmed and then stem at the
query words with a lower
weighting?
-M
On Dec 28, 2007 10:39 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> Yes, the Payload stuff should work for this, but you will have to set
> it up during indexing. The simpler approach is probably a se
Yes, the Payload stuff should work for this, but you will have to set
it up during indexing. The simpler approach is probably a separate
field for synonyms, but this means analyzing the same content twice
(or trying out the TeeTokenFilter, but this is advanced usage at this
point, since
Hi Grant,
Grant Ingersoll-6 wrote:
>
> You can use the payload functionality (have a look at
> BoostingTermQuery and Michael B. excellent ApacheCon talk at
> http://people.apache.org/~buschmi/apachecon/). Other option is to put
> the synonyms into a separate field and
You can use the payload functionality (have a look at
BoostingTermQuery and Michael B. excellent ApacheCon talk at
http://people.apache.org/~buschmi/apachecon/). Other option is to put
the synonyms into a separate field and boost that less than the main
field.
-Grant
On Dec 27, 2007, at
e higher than the synonym(s)?
TIA,
Frank
--
View this message in context:
http://www.nabble.com/Synonyms-and-Ranking-tp14518753p14518753.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsubs
M";
>
> private Stack synonymStack;
> private SynonymEngine engine;
>
> public SynonymFilter(TokenStream in, SynonymEngine engine) {
>super(in);
>synonymStack = new Stack();
>this.engine = engine;
> }
> ...
>
>
> Cheers!
&g
nonymFilter(TokenStream in, SynonymEngine engine) {
super(in);
synonymStack = new Stack();
this.engine = engine;
}
...
Cheers!
Frank
--
View this message in context:
http://www.nabble.com/Synonyms-in-Lucene-2.2-tp14514848p14515933.html
Sent
;
>
> private Stack synonymStack;
> private SynonymEngine engine;
>
> public SynonymFilter(TokenStream in, SynonymEngine engine) {
>super(in);
>synonymStack = new Stack();
>this.engine = engine;
> }
> ....
>
> But "Stack" is not recogn
ilter(TokenStream in, SynonymEngine engine) {
super(in);
synonymStack = new Stack();
this.engine = engine;
}
But "Stack" is not recognized. What am I doing wrong?
--
View this message in context:
http://www.nabble.com/Synonyms-in-Lucene-2.2-tp14514848p14514848.html
Sen
it will be difficult to use features that rely on the
>> > term list, e.g. the spellchecker. It will not be able to suggest
>> > "mainstrasse", as that's not in the index.
>> >
>> > Regards
>> > Daniel
>> >
>>
in the index.
>
> Regards
> Daniel
>
> --
> http://www.danielnaber.de
>
> -----
> To unsubscribe, e-mail: [EMAIL PROTECTED]
> For additional comma
lchecker. It will not be able to suggest
> "mainstrasse", as that's not in the index.
>
> Regards
> Daniel
>
> --
> http://www.danielnaber.de
>
> -
> To unsubscribe, e-mail: [EMAI
On Monday 21 May 2007 22:53, bhecht wrote:
> If someone searches for mainstrasse, my tools will split it again to
> main and strasse, and then lucene will be able to find it.
"strasse" will match "mainstrasse" but the phrase query "schöne strasse"
will not match "schöne mainstrasse". However, th
el Naber-5 wrote:
>
> On Monday 21 May 2007 22:05, bhecht wrote:
>
>> Is there any point for me to start creating custom analyzers with filter
>> for stop words, synonyms, and implementing my own "sub string" filter,
>> for separating tokens into "sub words&q
On Monday 21 May 2007 22:05, bhecht wrote:
> Is there any point for me to start creating custom analyzers with filter
> for stop words, synonyms, and implementing my own "sub string" filter,
> for separating tokens into "sub words" (like "mainstrasse"=>
m (substitution mechanism) that can
also be configured according to location within a phrase. It can also be
configured to find synonyms taking into account spelling mistakes. Although
it doesn't expand but only transforms to one certain replacement.It can find
replacements for sub strings as well
th enclosing multiple-word entries in quotes...
Best
Erick
Thanks,
Dave.
--
View this message in context:
http://www.nabble.com/Synonyms-and-Aliases-query-tf3473040.html#a9692225
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
---
ext:
http://www.nabble.com/Synonyms-and-Aliases-query-tf3473040.html#a9693265
Sent from the Lucene - Java Users mailing list archive at Nabble.com.
-
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
in a synonym Engine...
suppose synonyms of word x is syn(x)...
then if y = syn(x) then x = syn(y) doesn't hold true always .(
you might not get any synonyms of y..it depends on the data of synonym
engine)
so your synonym engine might be providing alias of bob as robert,
rob,
s analyzer to match I.B.M=IBM=International Business Machines? i.e.
multiple words to a single word.
Thanks,
Dave.
--
View this message in context:
http://www.nabble.com/Synonyms-and-Aliases-query-tf3473040.html#a9692225
Sent from the Lucene - Java Users mailing list archive at
On 8/6/06, Michael J. Prichard <[EMAIL PROTECTED]> wrote:
I created some indexes that use a SynonymAnalyzer and now I want to be
able to offer a choice as to search the synonyms or not.
The simplest method is to index the field more than once... one with
synonyms, one without.
-Yoni
Howdy,
I created some indexes that use a SynonymAnalyzer and now I want to be
able to offer a choice as to search the synonyms or not. If I search
now it will find all docs since the analyzer created tokens in the same
position. How do I tell my IndexSearcher to not look at those tokens
Thanks, Otis. I think the SynonymAnalyzer is the way to go, injecting the
synonyms while removing the stop words.
Andrew
-Original Message-
From: Otis Gospodnetic [mailto:[EMAIL PROTECTED]
Sent: Wednesday, July 26, 2006 3:19 PM
To: java-user@lucene.apache.org
Subject: Re: Newbie
Hi Andrew,
There is othing built into Lucene for synonyms, but you can grab the code from
Lucene in Action to see how they can be handled (plus:
http://www.lucenebook.com/search?query=synonyms for some context)
Otis
- Original Message
From: "Lee, Andrew J (CA - Toronto)&quo
Sorry if this question has already been answered, but it is regarding
synonyms.
I am using the WordNet/Synonyms index and using the following algorithm
to create synonym searches (this is a dumbed down version):
Look up a the "base" word in the synonym index
In my search string, r
i am currently adding synonyms at index time (and not expanding the query), i
fear that there is a problem with this implementation:
is there a way to lower the score of a document if it was found due to a
synonyms match and not due to a match of the word queried. from what i
understand
As you might have already seen, Andrew Schetinin and I have published (at
http://mail-archives.apache.org/mod_mbox/lucene-java-user/200603.mbox/[EMAIL
PROTECTED]) a source code that handles synonyms at search time (query
expansion).
This code includes also a de-boost factor for synonyms
nt: Friday, April 21, 2006 8:49 PM
To: java-user@lucene.apache.org
Subject: Synonyms ...
Hi,
What is the best way to implement the following?
Document 1 contains the following text:
"THE CZECH REPUBLIC ORGANIZATION"
Document 2 contains the following text:
"THE CZE ORGANISAT
.
I think you will need to create a custom tokenizer to deal with phrases
you'd like to keep together. I have done something similar (e.g. United
Kingdom has Britain, England and UK as synonyms), but in my case I'm
indexing only one or two word "documents" not huge blocks of
On 4/21/06, Dragon Fly <[EMAIL PROTECTED]> wrote:
> I don't think the SynonymAnalyzer described in LIA would work because
> some of my "synonyms" contain multiple words.
The SynonymFilter in Solr can handle multi-word synonyms.
http://wiki.apache.org/solr/Analyzers
G"
I don't think the SynonymAnalyzer described in LIA would work because
some of my "synonyms" contain multiple words. Thank you.
_
Dont just search. Find. Check out the new MSN Search!
http://search.msn.click-url.com/
1 - 100 of 111 matches
Mail list logo