Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
1) >> 0.5 = fieldNorm(field=field, doc=0) >> >> On Wed, Oct 21, 2009 at 3:16 PM, Paul Libbrecht >> wrote: >> >> Can the dictionary have weights? >>> >>> überwachungsgesetz alone probably needs a higher rank than überwachung >>>

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Paul Libbrecht
are combinations of other sub-compounds. Thanks! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 21, 2009 11:49 AM To: java-user@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound yes, your dictionary :) if überwachungsgesetz

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
hanks! >> >> -Original Message- >> From: Robert Muir [mailto:rcm...@gmail.com] >> Sent: Wednesday, October 21, 2009 11:49 AM >> To: java-user@lucene.apache.org >> Subject: Re: Using org.apache.lucene.analysis.compound >> >> yes, your dictionary :) >

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
--Original Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Wednesday, October 21, 2009 11:49 AM > To: java-user@lucene.apache.org > Subject: Re: Using org.apache.lucene.analysis.compound > > yes, your dictionary :) > > if überwachungsgesetz is a real word, add

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Paul Libbrecht
they are combinations of other sub-compounds. Thanks! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 21, 2009 11:49 AM To: java-user@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound yes, your dictionary :) if

RE: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Benjamin Douglas
@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound yes, your dictionary :) if überwachungsgesetz is a real word, add it to your dictionary. for example, if your dictionary is { "Rind", "Fleisch", "Draht", "Schere", "Geset

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
yes, your dictionary :) if überwachungsgesetz is a real word, add it to your dictionary. for example, if your dictionary is { "Rind", "Fleisch", "Draht", "Schere", "Gesetz", "Aufgabe", "Überwachung" }, and you index Rindfleischüberwachungsgesetz, then all 3 queries will have the same score. but i

RE: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Benjamin Douglas
fleischgesetz"? -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 21, 2009 5:12 AM To: java-user@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound Paul, there are two implementations in compounds, one is dictionary-base

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
Paul, there are two implementations in compounds, one is dictionary-based, the other is hyphenation-grammar + dictionary (it restricts the decompounding based on hyphenation rules). You could also subclass the compound base class and implement your own. I haven't seen any user-measures (relevance,

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Paul Libbrecht
I'm interested to this analyzer.. it had escaped me and solves an old problem! Could you report about its usage: - did you have to feed words in a dictionary? - does anyone have user-measures already? ... and the last question for the research fun: is there any approach towards preferring Üb

Re: Using org.apache.lucene.analysis.compound

2009-10-20 Thread Robert Muir
hi, it will work because it will also decompound "Rindfleish" into Rind and fleish, with posIncr=0 so if you index Rindfleischüberwachungsgesetz, then query with "Rindfleish", its matching because Rindfleish also gets decompounded into Rind and fleish. On Tue, Oct 20, 2009 at 8:35 PM, Benjamin Do