Re: Indexing of virtual "made up" documents

2005-05-03 Thread Erik Hatcher
On Apr 30, 2005, at 7:01 AM, Daniel Stephan wrote: Erik, thank you very much for your help! I am not in the position to build the indexing (other features are in line before that), yet, but I will try Lucene for it. Looks very good :) What I did not ask the last time, because it just occurred to

Re: Indexing of virtual "made up" documents

2005-04-30 Thread Daniel Stephan
Erik, thank you very much for your help! I am not in the position to build the indexing (other features are in line before that), yet, but I will try Lucene for it. Looks very good :) What I did not ask the last time, because it just occurred to me, was: I have in my application a metric of "fit"

Re: Indexing of virtual "made up" documents

2005-04-27 Thread Doug Cutting
Morus Walter wrote: Alternatively it should be able to write a query that does such a scoring directly (without the document start anchor) by the same means proximity query uses. Proximity query uses positional information so it should be possible to use that information for scoring based on docum

Re: Indexing of virtual "made up" documents

2005-04-26 Thread Morus Walter
Erik Hatcher writes: > > > > There are some information retrieval settings which tend to say that > > things that appear early in the document should be considered with > > greater score... is there nothing such in Lucene's scoring ? > > No, Lucene doesn't have that feature, at least not explici

Re: Indexing of virtual "made up" documents

2005-04-26 Thread Erik Hatcher
On Apr 26, 2005, at 4:46 PM, Paul Libbrecht wrote: Le 26 avr. 05, à 15:00, Erik Hatcher a écrit : I am not sure how Lucenes uses the placement information, but in the described case where I concatenate all my features to a whitespace-delimited text, I fear that Lucene uses the placement of features

Re: Indexing of virtual "made up" documents

2005-04-26 Thread Paul Libbrecht
Le 26 avr. 05, à 15:00, Erik Hatcher a écrit : I am not sure how Lucenes uses the placement information, but in the described case where I concatenate all my features to a whitespace-delimited text, I fear that Lucene uses the placement of features in this made-up text and comes to some wrong concl

Re: Indexing of virtual "made up" documents

2005-04-26 Thread Erik Hatcher
On Apr 26, 2005, at 3:21 AM, Daniel Stephan wrote: lets see if somebody listens on this list :-D I doubt many are on this list, yet. But your question is probably best asked on the [EMAIL PROTECTED] list rather than here. I'll CC java-user this time to loop those folks in. I wonder if the foll