Re: Synonyms and searching

2025-04-21 Thread Anh Dũng Bùi
ver will find matches to any of those forms. But > because my search method boosts exactText matches, the exact form they > search for will be scored higher. > > Because the custom analyzer produces word tokens I have incorporated > synonyms (when indexing). The synonyms are bidirectional.

RE: Synonyms and searching

2025-03-10 Thread Trevor Nicholls
, the exact form they search for will be scored higher. Because the custom analyzer produces word tokens I have incorporated synonyms (when indexing). The synonyms are bidirectional. OK, that's all background. The problem I am trying to fix at the moment is that searching for these compoun

Re: Synonyms and searching

2025-03-10 Thread Uwe Schindler
uct the index and the search so that if the user searches for e.g. "http proxyserver" they also find a match. I thought it would be sufficient to add an entry to the synonym map specifying that "http proxy" and "httpproxy" are synonyms, and likewise "proxy server&qu

Re: Synonyms and searching

2025-03-05 Thread Michael Sokolov
One thing to check is whether the synonyms are configured as bidirectional, or which direction they go (eg is "a b" being expanded to "ab" but "ab" is not being expanded to "a b"??) On Wed, Mar 5, 2025 at 2:20 PM Mikhail Khludnev wrote: > > Hello Tre

Re: Synonyms and searching

2025-03-05 Thread Mikhail Khludnev
and all will > match. > > > > However, I am trying to construct the index and the search so that if the > user searches for e.g. "http proxyserver" they also find a match. I thought > it would be sufficient to add an entry to the synonym map specifying that > &qu

Synonyms and searching

2025-03-05 Thread Trevor Nicholls
t would be sufficient to add an entry to the synonym map specifying that "http proxy" and "httpproxy" are synonyms, and likewise "proxy server" and "proxyserver". (When adding multiple-word phrases the spaces are replaced by SynonymMap.WORD_SEPARATOR). Th

Multi-term synonyms in SynonymQuery

2022-12-28 Thread Anh Dũng Bùi
Hi Lucene users, I recently came across SynonymQuery and found out that it only supports single-term synonyms (since it accepts a list of Term which will be considered as synonyms). We have some multi-term synonyms like "internet device" <-> "wifi router" or "dns&q

Re: Correct usage of synonyms with Japanese

2021-05-19 Thread Geoffrey Lawson
hich produces nice > looking graphs right from the get-go (tokenizer), with compound words also > properly decompounded so both options are indexed/searched. > > History: we created SynonymGraphFilter, along with other important > QueryParser (e.g. http://issues.apache.org/jira/browse/L

Re: Correct usage of synonyms with Japanese

2021-05-18 Thread Michael McCandless
th options are indexed/searched. History: we created SynonymGraphFilter, along with other important QueryParser (e.g. http://issues.apache.org/jira/browse/LUCENE-7603) and Query improvements, to get multi-term synonyms working correctly, finally in Lucene. With the old SynonymFilter, position

Correct usage of synonyms with Japanese

2021-05-18 Thread Geoffrey Lawson
Hello, I'm working on a project that involves search in Japanese and uses synonyms. The Japanese tokenizer creates an analysis graph, but the SynonymGraphFilter states it cannot take a graph as input. After a few tests I've seen it can create some unusual outputs if given a graph as

Re: MultiPhraseQuery or PhraseQuery to take the synonyms into account?

2018-09-22 Thread Michael McCandless
PM baris.kazar wrote: > i should have asked this way as Mike made clear for MultiPhraseQuery: > is PhraseQuery ok to account for synonyms? > Best > > > On Sep 20, 2018, at 2:02 PM, baris.ka...@oracle.com wrote: > > > > Hi,- > > > > should i use MultiPhrase

Re: MultiPhraseQuery or PhraseQuery to take the synonyms into account?

2018-09-20 Thread baris.kazar
i should have asked this way as Mike made clear for MultiPhraseQuery: is PhraseQuery ok to account for synonyms? Best > On Sep 20, 2018, at 2:02 PM, baris.ka...@oracle.com wrote: > > Hi,- > > should i use MultiPhraseQuery or PhraseQuery to take synonyms into account? &g

MultiPhraseQuery or PhraseQuery to take the synonyms into account?

2018-09-20 Thread baris . kazar
Hi,-  should i use MultiPhraseQuery or PhraseQuery to take synonyms into account? Best regards baris - To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h

Re: Lucene add synonyms to serch

2018-06-21 Thread Tomoko Uchida
2018年6月21日(木) 20:09 egorlex : > Hi, > > I am new in Lucene and I need add synonyms to my search. > > I tried use SynonymFilterFactory for this purpose, but it deprecated for > Lucene 7.3.0 > > Please help me with some examples or documentation. What is the b

Lucene add synonyms to serch

2018-06-21 Thread egorlex
Hi, I am new in Lucene and I need add synonyms to my search. I tried use SynonymFilterFactory for this purpose, but it deprecated for Lucene 7.3.0 Please help me with some examples or documentation. What is the best way to realized it? -- Sent from: http://lucene.472066.n3.nabble.com

Weighed Synonyms - Lucene 7.1

2017-11-16 Thread benafia salem
Hello, I'm trying to implement a weighed synonyms analyzer using SynonymGraphFilter and a custom payload analyzer. My synonyms file looks like: obama,president^0.9,barry^0.5 speaks,greets^0.8,congratulates^0.9,informs media,press,journal,tv^0.8 I want user searching for "obama greets

Synonyms with multiple alternatives

2017-11-15 Thread Chris . Hill
I am using Lucene 4.8 (.net flavour) and cannot find a decent working example to answer my issue. In our source data we have lots of similar items that can be described in the same way - for example "lawnmower", "lawn mower" & "grass cutter". Obviously we have no control over how people choose

Re: synonyms

2017-07-25 Thread Christian Kaufhold
rcaseFilter before your SynonymFilter, which means that the entities in your SynonymMap need to be all lowercase or they won’t be matched. Alan Woodward www.flax.co.uk > On 25 Jul 2017, at 07:52, Christian Kaufhold > wrote: > > Hi, > > I am not able to add synonyms to the lu

Re: synonyms

2017-07-25 Thread Alan Woodward
You have a LowercaseFilter before your SynonymFilter, which means that the entities in your SynonymMap need to be all lowercase or they won’t be matched. Alan Woodward www.flax.co.uk > On 25 Jul 2017, at 07:52, Christian Kaufhold > wrote: > > Hi, > > I am not able to a

synonyms

2017-07-24 Thread Christian Kaufhold
Hi, I am not able to add synonyms to the lucene index. I condensed my problem into the following class which is based on a Hello World example. The idea behind the code was to add a document with universität and the synonym 'Hochschule' (highschool) so that lucene finds universität we

Re: Query Expansion for Synonyms

2016-04-28 Thread Ahmet Arslan
Hi Daniel, Since you are restricting inOrder=true and proximity=0 in the top level query, there is no problem in your particular example. If you weren't restricting, injecting synonyms with plain OR, sometimes cause 'query drift': injection/addition of one term changes result l

Query Expansion for Synonyms

2016-04-28 Thread Daniel Bigham
I'm investigating various ways of supporting synonyms in Lucene. One such approach that looks potentially interesting is to do a kind of "query expansion". For example, if the user searches for "us 1888", one might expand the query as follows: Span

Re: A work around to get matching terms from document - Stemmed and Synonyms

2014-05-17 Thread Jack Krupansky
From: venkatesham.gu...@igate.com Sent: Saturday, May 17, 2014 2:28 AM To: java-user@lucene.apache.org Subject: A work around to get matching terms from document - Stemmed and Synonyms I am looking for a feature in SOLR that will give me all matched terms in the document when I search with a

Re: A work around to get matching terms from document - Stemmed and Synonyms

2014-05-17 Thread Jack Krupansky
java-user@lucene.apache.org Subject: A work around to get matching terms from document - Stemmed and Synonyms I am looking for a feature in SOLR that will give me all matched terms in the document when I search with a query term, My SOLR field uses Stemming and Synonym filters as a result of this I

A work around to get matching terms from document - Stemmed and Synonyms

2014-05-16 Thread venkatesham.gu...@igate.com
program which will take each document from the result and apply tokenizer, stemming and synonym and match with query term(apply stemming and synonyms) to identify the exact terms from the document, before applying the stemming and synonym making a copy of the original terms.Please suggest is there any

Matched words from document - Stemmed and Synonyms

2013-07-24 Thread venkatesham.gu...@igate.com
words - in this example operation- Thanks Venkatesham Gundu -- View this message in context: http://lucene.472066.n3.nabble.com/Matched-words-from-document-Stemmed-and-Synonyms-tp4079968.html Sent from the Lucene - Java Users mailing list archive at

Injecting custom synonyms in Lucene

2013-07-12 Thread gtkesh
Hi everyone! I tried to inject the synonyms during indexing. I created a custom analyzer. It pretty much does the same as a standard analyzer with an addition of having synonym filter. During indexing and searching, I pass this custom analyzer to both indexwriter and indexsearcher, but the

Why doesn't this code run - Adding synonyms from Wordnet to Lucene Index

2013-04-18 Thread Abhishek Shivkumar
tomAnalyzer as part of my querying, then if I enter the query as myFieldName: manager it expands the query with synonyms for manager. But, I want the synonyms to be part of only my index and I don't want my query to be expanded with synonyms. So, when I removed the SynonymFilter f

Re: About WordNet synonyms search

2011-02-12 Thread Erick Erickson
Ah, you're expanding synonyms at query time. Missed that. How is the indexing done in the first place? What analyzer is used? Have you even tried using Luke to look into your index and see what's actually in there? What is shown when you do an explain? Most often this kind of thing i

Re: About WordNet synonyms search

2011-02-12 Thread Gong Li
edevil) 2011/2/13 Erick Erickson > At a guess make is a synonym for one of your search terms. doc.get > returns the original content, not synonyms. > > > So what are your synonyms that might be a factor here? > > Best > Erick > > On Sat, Feb 12, 2011 at 6:04 A

Re: About WordNet synonyms search

2011-02-12 Thread Erick Erickson
At a guess make is a synonym for one of your search terms. doc.get returns the original content, not synonyms. So what are your synonyms that might be a factor here? Best Erick On Sat, Feb 12, 2011 at 6:04 AM, Gong Li wrote: > Hi, > > I am tying WordNet synonyms into an SynonymAnalyz

About WordNet synonyms search

2011-02-12 Thread Gong Li
Hi, I am tying WordNet synonyms into an SynonymAnalyzer. But I find there is a wrong in the search result as following: input keywords: *browned fox* query.toString(): (content:browned content:brown) (content:fox content:trick content:throw content:slyboots content:fuddle content:fob

Re: Where to find non-English dictionaries, thesaurus, synonyms

2011-01-07 Thread Paul Libbrecht
obert Muir a écrit : > On Thu, Jan 6, 2011 at 11:53 AM, Pulkit Singhal > wrote: >> Hello, >> >> What's a good source to get dictionaries (for spellcorrections) and/or >> thesaurus (for synonyms) that can be used with Lucene for non-English >> languages s

Re: Where to find non-English dictionaries, thesaurus, synonyms

2011-01-07 Thread Robert Muir
On Thu, Jan 6, 2011 at 11:53 AM, Pulkit Singhal wrote: > Hello, > > What's a good source to get dictionaries (for spellcorrections) and/or > thesaurus (for synonyms) that can be used with Lucene for non-English > languages such as Fresh, Chinese, Korean etc? if you can&

RE: Where to find non-English dictionaries, thesaurus, synonyms

2011-01-06 Thread Hong-Thai Nguyen
is useful in search context. You can may uses synsets (synonyms) like a suggestion dictionary. But stopwords, stem and analyzer dictionaries are dependant to associate modules. Best, --- Hong-Thai -Message d'origine- De : Pulkit Singhal [mailto:pulkitsing...@gmail.com]

Where to find non-English dictionaries, thesaurus, synonyms

2011-01-06 Thread Pulkit Singhal
Hello, What's a good source to get dictionaries (for spellcorrections) and/or thesaurus (for synonyms) that can be used with Lucene for non-English languages such as Fresh, Chinese, Korean etc? For example, the wordnet contrib module is based on the data set provided by the Princeton

Re: Using synonyms with Lucene without WordPress

2010-05-24 Thread Simon Willnauer
Larry, you should look at the SynonymFilter in Lucene Contrib Analysis. simon On Mon, May 24, 2010 at 9:40 PM, Larry Hendrix wrote: > Does anyone know of any classes available that allow you to define and use > your own synonyms when searching with Lucene? I read some about WordPress but

Using synonyms with Lucene without WordPress

2010-05-24 Thread Larry Hendrix
Does anyone know of any classes available that allow you to define and use your own synonyms when searching with Lucene? I read some about WordPress but it seems those synonyms are predefined English words. The application I am working with searches for the names of contacts and companies. I

Re: how to down-weight synonyms

2009-08-26 Thread abhay kumar
Hi, The first answer by Sven is more efficient and generaly used. Abhay @Sven f you add the synonyms at query time you can assign a boost factor to the added synonyms that would boost the matches to a particular term down. -> something in the interval [0,1] On Wed, Aug 26, 2009 at 3:40

Re: how to down-weight synonyms

2009-08-26 Thread Simon Willnauer
Hi Sven, While I have no idea bout the example in LiA I can give you some quick pointers. if you add the synonyms at query time you can assign a boost factor to the added synonyms that would boost the matches to a particular term down. -> something in the interval [0,1] if you add the synonyms

how to down-weight synonyms

2009-08-26 Thread Sven Fischer
Hi, I implemented a synonym search by using the chapter 4.6's example from the Lucene in Action book. Now I want to extend the example in that way, that synonyms are boosted less that the original word the user searched for. Is there a way to do it? If it is, I would like to get any he

Re: Handling synonyms using Lucene

2009-08-09 Thread mitu2009
Just wanted to add this my original question: FYI, The synonyms in my application are totally custom and not from English dictionary...ie. "Global Leader in Finance" could also mean "Top Investment Bank" or "Fortune 500 Finance company" etc etc. Anshum-2 wrot

Re: Handling synonyms using Lucene

2009-08-07 Thread Anshum
rs to draw On Sat, Aug 8, 2009 at 11:14 AM, mitu2009 wrote: > > Hi, > > What is the best way to handle synonyms (phrases) using Lucene? Especially, > when I need to execute queries like :a OR b OR c NOT d > > How about adding a new field called "synonyms" to eac

Handling synonyms using Lucene

2009-08-07 Thread mitu2009
Hi, What is the best way to handle synonyms (phrases) using Lucene? Especially, when I need to execute queries like :a OR b OR c NOT d How about adding a new field called "synonyms" to each document while indexing? This field's value would have a list of all synonyms. It woul

Re: Use of Synonyms

2009-07-13 Thread liat oren
I have my own synonyms, which are differnt from the ones in wordNet. For every word, I have synonyms and the score - for how close the synonym to its word. I would like to 'elaborate' the query, to expand it so it will include also the synonyms of the words given in the query. Thanks

Re: Use of Synonyms

2009-07-13 Thread Erick Erickson
What are you trying to do? I think you'd get a better response ifyou explained what higher-level task/feature you're trying to implement. Best Erick On Mon, Jul 13, 2009 at 4:54 AM, liat oren wrote: > Hi all, > > I have a list of synonyms for every word. > Is there a

Use of Synonyms

2009-07-13 Thread liat oren
Hi all, I have a list of synonyms for every word. Is there a good way to use these synonyms? Currently I use a boost query so if 'a' is the queried word, and 'b' (0.5) and 'c' (0.2) are its synonyms, I query for: a^1 + b^0.5 + c^0.2. Is there a better way of doing it? Thanks, Liat

Reply to "Search for synonyms - implemenetation for review"

2009-04-20 Thread liat oren
Hi, I saw a very old thread that suggests an implementation for Synonyms that takes into account differnt weight to differnt synonyms and gives a penalty factor to synonyms, to avoid getting documents with the synonyms prior to documents with the original words. http://mail-archives.apache.org

Re: Indexing synonyms for multiple words

2009-03-03 Thread Michael McCandless
or other problems in queries that depend on order of words). Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Michael McCandless [mailto:luc...@mikemccandless.com] Sent: Monday, March 02, 2009 4:07 PM T

Re: Indexing synonyms for multiple words

2009-03-02 Thread Sumukh
ler >> H.-H.-Meier-Allee 63, D-28213 Bremen >> http://www.thetaphi.de >> eMail: u...@thetaphi.de >> >>> -Original Message- >>> From: Michael McCandless [mailto:luc...@mikemccandless.com] >>> Sent: Monday, March 02, 2009 4:07 PM >&g

Re: Indexing synonyms for multiple words

2009-03-02 Thread Michael McCandless
4:07 PM To: java-user@lucene.apache.org Subject: Re: Indexing synonyms for multiple words Shouldn't WORD2's position be 1 more than your SYN? Ie, don't you want these positions?: WORD1 2 WORD2 3 SYN 2 The position is the starting position of the token; Lucene doesn

RE: Indexing synonyms for multiple words

2009-03-02 Thread Uwe Schindler
n http://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Michael McCandless [mailto:luc...@mikemccandless.com] > Sent: Monday, March 02, 2009 4:07 PM > To: java-user@lucene.apache.org > Subject: Re: Indexing synonyms for multiple words > > &

Indexing synonyms for multiple words

2009-03-02 Thread Sumukh
> > Hi, > > I'm fairly new to Lucene. I'd like to know how we can index synonyms for > multiple words. > > This is the scenario: > > Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG. > > Now assume the two words combined WORD1 WORD2 can be replace

Re: Indexing synonyms for multiple words

2009-03-02 Thread Michael McCandless
ene. I'd like to know how we can index synonyms for multiple words. This is the scenario: Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG. Now assume the two words combined WORD1 WORD2 can be replaced by another word SYN. If I place SYN after WORD1 with positionIncrement set to 0

Re: Indexing synonyms for multiple words

2009-03-02 Thread Erick Erickson
Lucene. I'd like to know how we can index synonyms for > multiple words. > > This is the scenario: > > Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG. > > Now assume the two words combined WORD1 WORD2 can be replaced by another > word SYN. > > If I place

Indexing synonyms for multiple words

2009-03-02 Thread Sumukh
Hi, I'm fairly new to Lucene. I'd like to know how we can index synonyms for multiple words. This is the scenario: Consider a sentence: AAA BBB WORD1 WORD2 EEE FFF GGG. Now assume the two words combined WORD1 WORD2 can be replaced by another word SYN. If I place SYN after

Re: How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Ariel
How can I see the senses of a word with wordnet ??? And How could I select the most populars ??? Is there a way to make queries ignoring the synonyms I have added to the index ??? I hope you can help me. Regards Ariel On Wed, Feb 4, 2009 at 7:46 PM, Manu Konchady wrote: > > > > -

Re: How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Ariel
Well, I have the luke 0.8, I opened my index with that tool but there is not any clue of synonyms in the field I have indexed with the synonym analyzer. I don't know how can I see the group of synonyms of each term, sould somebody tell me hot to do that ??? On Wed, Feb 4, 2009 at 5:09 PM,

Re: How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Erick Erickson
is unanswerable as it stands. Best Erick On Wed, Feb 4, 2009 at 3:27 PM, Ariel wrote: > Hi every body: > > I am using wordnet to index my document taking in account the synonyms > with wordnet. > After I indexed the whole documents collections I made a query with > the word "

How to index correctly taking in account the synonyms using Wordnet ???

2009-02-04 Thread Ariel
Hi every body: I am using wordnet to index my document taking in account the synonyms with wordnet. After I indexed the whole documents collections I made a query with the word "snort" but documents that contain the word bird are retrieved, I don't understand this because snort a

Re: ShingleMatrixFilter for synonyms

2009-01-14 Thread Karl Wettin
token synonyms there has been a few discussions on the forum about this before I came up with ShingleMatrixFilter, perhaps some of these might work better for you: http://www.nabble.com/multi-word-synonyms-to17294842.html#a17305359 Still, here is the explaination of what ShingleMatrixFilter

RE: ShingleMatrixFilter for synonyms

2009-01-13 Thread Angel, Eric
The unit tests don't really show how I could use it for synonyms at index time- does anyone have sample code? Is it possible? -Original Message- From: Otis Gospodnetic [mailto:otis_gospodne...@yahoo.com] Sent: Tuesday, January 13, 2009 3:06 PM To: java-user@lucene.apache.org Subjec

Re: ShingleMatrixFilter for synonyms

2009-01-13 Thread Otis Gospodnetic
ene.apache.org > Sent: Tuesday, January 13, 2009 2:39:11 PM > Subject: ShingleMatrixFilter for synonyms > > Does anyone have an example using this? > > > > I have a SynonymEngine that returns a an array list of strings, some of > which may be multiple words. How c

ShingleMatrixFilter for synonyms

2009-01-13 Thread Angel, Eric
Does anyone have an example using this? I have a SynonymEngine that returns a an array list of strings, some of which may be multiple words. How can I incorporate this with my SynonymEngine at index time? Also, the javadoc for the ShingleMatrixFilter class says: Without a spac

Re: multi word synonyms

2008-05-18 Thread Karl Wettin
18 maj 2008 kl. 19.17 skrev Paul Elschot: Now I'm thinking it could be solved using token payloads and a brand new MultiDimensionalSpanQuery. Not too different from what you suggested way back in http://www.nabble.com/Using-Lucene-for-searching-tokens%2C-not-storin g-them.-to3918462.html#a394401

Re: multi word synonyms

2008-05-18 Thread Paul Elschot
Op Sunday 18 May 2008 16:30:26 schreef Karl Wettin: > 18 maj 2008 kl. 00.01 skrev Paul Elschot: > > Op Saturday 17 May 2008 20:28:40 schreef Karl Wettin: > >> As far as I know Lucene only handle single word synonyms at index > >> time. My life would be much simple

Re: multi word synonyms

2008-05-18 Thread Karl Wettin
18 maj 2008 kl. 00.01 skrev Paul Elschot: Op Saturday 17 May 2008 20:28:40 schreef Karl Wettin: As far as I know Lucene only handle single word synonyms at index time. My life would be much simpler if it was possible to add synonyms that spanned over multiple tokens, such as "lucene in a

Re: multi word synonyms

2008-05-17 Thread Paul Elschot
Op Saturday 17 May 2008 20:28:40 schreef Karl Wettin: > As far as I know Lucene only handle single word synonyms at index > time. My life would be much simpler if it was possible to add > synonyms that spanned over multiple tokens, such as "lucene in > action"="lia"

multi word synonyms

2008-05-17 Thread Karl Wettin
As far as I know Lucene only handle single word synonyms at index time. My life would be much simpler if it was possible to add synonyms that spanned over multiple tokens, such as "lucene in action"="lia". I have a couple of workarounds that are OK but it really isn

Re: Synonyms and Ranking

2008-01-03 Thread Michael Stoppelman
my documents unstemmed and then stem at the query words with a lower weighting? -M On Dec 28, 2007 10:39 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Yes, the Payload stuff should work for this, but you will have to set > it up during indexing. The simpler approach is probably a se

Re: Synonyms and Ranking

2007-12-28 Thread Grant Ingersoll
Yes, the Payload stuff should work for this, but you will have to set it up during indexing. The simpler approach is probably a separate field for synonyms, but this means analyzing the same content twice (or trying out the TeeTokenFilter, but this is advanced usage at this point, since

Re: Synonyms and Ranking

2007-12-28 Thread Frank Schima
Hi Grant, Grant Ingersoll-6 wrote: > > You can use the payload functionality (have a look at > BoostingTermQuery and Michael B. excellent ApacheCon talk at > http://people.apache.org/~buschmi/apachecon/). Other option is to put > the synonyms into a separate field and

Re: Synonyms and Ranking

2007-12-27 Thread Grant Ingersoll
You can use the payload functionality (have a look at BoostingTermQuery and Michael B. excellent ApacheCon talk at http://people.apache.org/~buschmi/apachecon/). Other option is to put the synonyms into a separate field and boost that less than the main field. -Grant On Dec 27, 2007, at

Synonyms and Ranking

2007-12-27 Thread Frank Schima
e higher than the synonym(s)? TIA, Frank -- View this message in context: http://www.nabble.com/Synonyms-and-Ranking-tp14518753p14518753.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubs

Re: Synonyms in Lucene 2.2

2007-12-27 Thread Erick Erickson
M"; > > private Stack synonymStack; > private SynonymEngine engine; > > public SynonymFilter(TokenStream in, SynonymEngine engine) { >super(in); >synonymStack = new Stack(); >this.engine = engine; > } > ... > > > Cheers! &g

Re: Synonyms in Lucene 2.2

2007-12-27 Thread Frank Schima
nonymFilter(TokenStream in, SynonymEngine engine) { super(in); synonymStack = new Stack(); this.engine = engine; } ... Cheers! Frank -- View this message in context: http://www.nabble.com/Synonyms-in-Lucene-2.2-tp14514848p14515933.html Sent

Re: Synonyms in Lucene 2.2

2007-12-27 Thread Erick Erickson
; > > private Stack synonymStack; > private SynonymEngine engine; > > public SynonymFilter(TokenStream in, SynonymEngine engine) { >super(in); >synonymStack = new Stack(); >this.engine = engine; > } > .... > > But "Stack" is not recogn

Synonyms in Lucene 2.2

2007-12-27 Thread Frank Schima
ilter(TokenStream in, SynonymEngine engine) { super(in); synonymStack = new Stack(); this.engine = engine; } But "Stack" is not recognized. What am I doing wrong? -- View this message in context: http://www.nabble.com/Synonyms-in-Lucene-2.2-tp14514848p14514848.html Sen

Re: stop words, synonyms... what's in it for me?

2007-05-21 Thread bhecht
it will be difficult to use features that rely on the >> > term list, e.g. the spellchecker. It will not be able to suggest >> > "mainstrasse", as that's not in the index. >> > >> > Regards >> > Daniel >> > >>

Re: stop words, synonyms... what's in it for me?

2007-05-21 Thread Erick Erickson
in the index. > > Regards > Daniel > > -- > http://www.danielnaber.de > > ----- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional comma

Re: stop words, synonyms... what's in it for me?

2007-05-21 Thread bhecht
lchecker. It will not be able to suggest > "mainstrasse", as that's not in the index. > > Regards > Daniel > > -- > http://www.danielnaber.de > > - > To unsubscribe, e-mail: [EMAI

Re: stop words, synonyms... what's in it for me?

2007-05-21 Thread Daniel Naber
On Monday 21 May 2007 22:53, bhecht wrote: > If someone searches for mainstrasse, my tools will split it again to > main and strasse, and then lucene will be able to find it. "strasse" will match "mainstrasse" but the phrase query "schöne strasse" will not match "schöne mainstrasse". However, th

Re: stop words, synonyms... what's in it for me?

2007-05-21 Thread bhecht
el Naber-5 wrote: > > On Monday 21 May 2007 22:05, bhecht wrote: > >> Is there any point for me to start creating custom analyzers with filter >> for stop words, synonyms, and implementing my own "sub string" filter, >> for separating tokens into "sub words&q

Re: stop words, synonyms... what's in it for me?

2007-05-21 Thread Daniel Naber
On Monday 21 May 2007 22:05, bhecht wrote: > Is there any point for me to start creating custom analyzers with filter > for stop words, synonyms, and implementing my own "sub string" filter, > for separating tokens into "sub words" (like "mainstrasse"=>

stop words, synonyms... what's in it for me?

2007-05-21 Thread bhecht
m (substitution mechanism) that can also be configured according to location within a phrase. It can also be configured to find synonyms taking into account spelling mistakes. Although it doesn't expand but only transforms to one certain replacement.It can find replacements for sub strings as well

Re: Synonyms and Aliases query

2007-03-27 Thread Erick Erickson
th enclosing multiple-word entries in quotes... Best Erick Thanks, Dave. -- View this message in context: http://www.nabble.com/Synonyms-and-Aliases-query-tf3473040.html#a9692225 Sent from the Lucene - Java Users mailing list archive at Nabble.com. ---

Re: Synonyms and Aliases query

2007-03-27 Thread daveburns
ext: http://www.nabble.com/Synonyms-and-Aliases-query-tf3473040.html#a9693265 Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: Synonyms and Aliases query

2007-03-27 Thread sandeep chawla
in a synonym Engine... suppose synonyms of word x is syn(x)... then if y = syn(x) then x = syn(y) doesn't hold true always .( you might not get any synonyms of y..it depends on the data of synonym engine) so your synonym engine might be providing alias of bob as robert, rob,

Synonyms and Aliases query

2007-03-27 Thread daveburns
s analyzer to match I.B.M=IBM=International Business Machines? i.e. multiple words to a single word. Thanks, Dave. -- View this message in context: http://www.nabble.com/Synonyms-and-Aliases-query-tf3473040.html#a9692225 Sent from the Lucene - Java Users mailing list archive at

Re: Search w/o looking at synonyms?

2006-08-06 Thread Yonik Seeley
On 8/6/06, Michael J. Prichard <[EMAIL PROTECTED]> wrote: I created some indexes that use a SynonymAnalyzer and now I want to be able to offer a choice as to search the synonyms or not. The simplest method is to index the field more than once... one with synonyms, one without. -Yoni

Search w/o looking at synonyms?

2006-08-06 Thread Michael J. Prichard
Howdy, I created some indexes that use a SynonymAnalyzer and now I want to be able to offer a choice as to search the synonyms or not. If I search now it will find all docs since the analyzer created tokens in the same position. How do I tell my IndexSearcher to not look at those tokens

RE: Newbie synonyms question

2006-07-26 Thread Lee, Andrew J \(CA - Toronto\)
Thanks, Otis. I think the SynonymAnalyzer is the way to go, injecting the synonyms while removing the stop words. Andrew -Original Message- From: Otis Gospodnetic [mailto:[EMAIL PROTECTED] Sent: Wednesday, July 26, 2006 3:19 PM To: java-user@lucene.apache.org Subject: Re: Newbie

Re: Newbie synonyms question

2006-07-26 Thread Otis Gospodnetic
Hi Andrew, There is othing built into Lucene for synonyms, but you can grab the code from Lucene in Action to see how they can be handled (plus: http://www.lucenebook.com/search?query=synonyms for some context) Otis - Original Message From: "Lee, Andrew J (CA - Toronto)&quo

Newbie synonyms question

2006-07-24 Thread Lee, Andrew J \(CA - Toronto\)
Sorry if this question has already been answered, but it is regarding synonyms. I am using the WordNet/Synonyms index and using the following algorithm to create synonym searches (this is a dumbed down version): Look up a the "base" word in the synonym index In my search string, r

lowering score of doc if synonyms matched (synonyms indexed)

2006-05-10 Thread zzzzz shalev
i am currently adding synonyms at index time (and not expanding the query), i fear that there is a problem with this implementation: is there a way to lower the score of a document if it was found due to a synonyms match and not due to a match of the word queried. from what i understand

RE: lowering score of doc if synonyms matched (synonyms indexed)

2006-05-10 Thread Ziv Gome
As you might have already seen, Andrew Schetinin and I have published (at http://mail-archives.apache.org/mod_mbox/lucene-java-user/200603.mbox/[EMAIL PROTECTED]) a source code that handles synonyms at search time (query expansion). This code includes also a de-boost factor for synonyms

RE: Synonyms ...

2006-05-09 Thread Ziv Gome
nt: Friday, April 21, 2006 8:49 PM To: java-user@lucene.apache.org Subject: Synonyms ... Hi, What is the best way to implement the following? Document 1 contains the following text: "THE CZECH REPUBLIC ORGANIZATION" Document 2 contains the following text: "THE CZE ORGANISAT

RE: Synonyms ...

2006-04-22 Thread Colin Young
. I think you will need to create a custom tokenizer to deal with phrases you'd like to keep together. I have done something similar (e.g. United Kingdom has Britain, England and UK as synonyms), but in my case I'm indexing only one or two word "documents" not huge blocks of

Re: Synonyms ...

2006-04-21 Thread Yonik Seeley
On 4/21/06, Dragon Fly <[EMAIL PROTECTED]> wrote: > I don't think the SynonymAnalyzer described in LIA would work because > some of my "synonyms" contain multiple words. The SynonymFilter in Solr can handle multi-word synonyms. http://wiki.apache.org/solr/Analyzers

Synonyms ...

2006-04-21 Thread Dragon Fly
G" I don't think the SynonymAnalyzer described in LIA would work because some of my "synonyms" contain multiple words. Thank you. _ Don’t just search. Find. Check out the new MSN Search! http://search.msn.click-url.com/

  1   2   >