Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
1) >> 0.5 = fieldNorm(field=field, doc=0) >> >> On Wed, Oct 21, 2009 at 3:16 PM, Paul Libbrecht >> wrote: >> >> Can the dictionary have weights? >>> >>> überwachungsgesetz alone probably needs a higher rank than überwachung >>>

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Paul Libbrecht
are combinations of other sub-compounds. Thanks! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 21, 2009 11:49 AM To: java-user@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound yes, your dictionary :) if überwachungsgesetz

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
hanks! >> >> -Original Message- >> From: Robert Muir [mailto:rcm...@gmail.com] >> Sent: Wednesday, October 21, 2009 11:49 AM >> To: java-user@lucene.apache.org >> Subject: Re: Using org.apache.lucene.analysis.compound >> >> yes, your dictionary :) >

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
--Original Message- > From: Robert Muir [mailto:rcm...@gmail.com] > Sent: Wednesday, October 21, 2009 11:49 AM > To: java-user@lucene.apache.org > Subject: Re: Using org.apache.lucene.analysis.compound > > yes, your dictionary :) > > if überwachungsgesetz is a real word, add

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Paul Libbrecht
they are combinations of other sub-compounds. Thanks! -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 21, 2009 11:49 AM To: java-user@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound yes, your dictionary :) if

RE: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Benjamin Douglas
@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound yes, your dictionary :) if überwachungsgesetz is a real word, add it to your dictionary. for example, if your dictionary is { "Rind", "Fleisch", "Draht", "Schere", "Geset

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
yes, your dictionary :) if überwachungsgesetz is a real word, add it to your dictionary. for example, if your dictionary is { "Rind", "Fleisch", "Draht", "Schere", "Gesetz", "Aufgabe", "Überwachung" }, and you index Rindfleischüberwachungsgesetz, then all 3 queries will have the same score. but i

RE: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Benjamin Douglas
fleischgesetz"? -Original Message- From: Robert Muir [mailto:rcm...@gmail.com] Sent: Wednesday, October 21, 2009 5:12 AM To: java-user@lucene.apache.org Subject: Re: Using org.apache.lucene.analysis.compound Paul, there are two implementations in compounds, one is dictionary-base

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Robert Muir
Paul, there are two implementations in compounds, one is dictionary-based, the other is hyphenation-grammar + dictionary (it restricts the decompounding based on hyphenation rules). You could also subclass the compound base class and implement your own. I haven't seen any user-measures (relevance,

Re: Using org.apache.lucene.analysis.compound

2009-10-21 Thread Paul Libbrecht
I'm interested to this analyzer.. it had escaped me and solves an old problem! Could you report about its usage: - did you have to feed words in a dictionary? - does anyone have user-measures already? ... and the last question for the research fun: is there any approach towards preferring Üb

Re: Using org.apache.lucene.analysis.compound

2009-10-20 Thread Robert Muir
hi, it will work because it will also decompound "Rindfleish" into Rind and fleish, with posIncr=0 so if you index Rindfleischüberwachungsgesetz, then query with "Rindfleish", its matching because Rindfleish also gets decompounded into Rind and fleish. On Tue, Oct 20, 2009 at 8:35 PM, Benjamin Do

Using org.apache.lucene.analysis.compound

2009-10-20 Thread Benjamin Douglas
Hello, I've found a number of posts in different places talking about how to perform decompounding, but I haven't found too many discussing how to use the results of decompounding. If anyone can answer this question or point me to an existing discussion it would be very helpful. In the descrip