Re: How to customize scoring using user feedback?

2007-03-15 Thread xiong
Peter W. marketingbrokers.com> writes: > > Hello, > > This is not currently in Lucene. > > Sounds like you are looking for a voting > system to generate float scores that would be > inserted as a sortable field at index time. > > Regards, > > Peter W. Hi Peter, But the voting is query depe

Re: How to customize scoring using user feedback?

2007-03-15 Thread Peter W.
Hello, This is not currently in Lucene. Sounds like you are looking for a voting system to generate float scores that would be inserted as a sortable field at index time. Gathering user feedback on search results is hard because you need to introduce a layer which logs the click then redirects

Re: How to customize scoring using user feedback?

2007-03-15 Thread karl wettin
16 mar 2007 kl. 02.13 skrev xiong: karl wettin gmail.com> writes: 15 mar 2007 kl. 11.21 skrev xiong: Just like google: the more user clicks of search results, the higher rank they are. Are you really sure Google does this? It would surprise me if they did. I'm not sure, actually. B

Re: How to customize scoring using user feedback?

2007-03-15 Thread xiong
daniel rosher hotonline.com> writes: > > Hi, > > This can be achieved by implementing your own implementation of the > SortComparitorSource interface. > > Section 6.1 of Lucene in Action will help you here. > > We currently use this method to alter the ranking of documents depending > on the

Re: How to customize scoring using user feedback?

2007-03-15 Thread xiong
karl wettin gmail.com> writes: > > > 15 mar 2007 kl. 11.21 skrev xiong: > > > Just like google: the more user clicks of search results, > > the higher rank they are. > > Are you really sure Google does this? It would surprise me if they did. > I'm not sure, actually. But using user feedbac

Re: IndexWriter.deleteDocuments(Term) vs IndexReader.deleteDocuments(Term)

2007-03-15 Thread Michael McCandless
"Antony Bowesman" <[EMAIL PROTECTED]> wrote: > The writer method does not return the number of deleted documents. Is > there a > technical reason why this is not done. > > I am planning to see about converting my batch deletions using > IndexReader to > IndexWriter, but I'm currently using the

IndexWriter.deleteDocuments(Term) vs IndexReader.deleteDocuments(Term)

2007-03-15 Thread Antony Bowesman
The writer method does not return the number of deleted documents. Is there a technical reason why this is not done. I am planning to see about converting my batch deletions using IndexReader to IndexWriter, but I'm currently using the return value to record stats. Does the following give th

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread Ryan O'Hara
Thanks a ton, Hoss. I just did an ant on the contrib/spellchecker directory and it produced a jar file in the LUCENE_HOME/build/ directory. Replacing the old jar file with the new jar file fixed my errrors as I suspected. Thanks, again. -Ryan On Mar 15, 2007, at 1:38 PM, Chris Hostetter

off topic: ASF support vector implementation?

2007-03-15 Thread karl wettin
I'm looking for an Apache licensed SVM, preferably logistic. Is there such a thing? Any suggestion for a better forum to ask this question? -- karl - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [

search timeout

2007-03-15 Thread Sean Timm
Nutch recently added a search query timeout (NUTCH-308). Are there any plans to add such functionality to the Lucene HitCollector directly? Or is there some reason that this is a bad idea? I'm using Solr which doesn't seem to support search timeouts. It seems that it would make sense to add

RE: Is Lucene Java trunk still stable for production code?

2007-03-15 Thread Chris Hostetter
: Thanks for your insight, it matches my thoughts. The only reason I'm : wondering is because the latest change in the lucene_2_1 branch is 4 : weeks old. So either no bugs were found since then (yeah!) or the bugs : gets fixed in the trunk... The general policy that Doug has encouraged in teh

RE: Is Lucene Java trunk still stable for production code?

2007-03-15 Thread Jean-Philippe Robichaud
That's really wonderful. Everything gets cleaner now. Thanks, I mean really Thanks, for all the hard work that goes in Lucene code + Doc + Processes + Mailing list. Lucene is really something I refer other as "what (open source) software development should be". I'll go with lucene_2_1 ! Jp -

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread Chris Hostetter
: My question basically was is there a spellchecker-2.*1*-rc1-dev.jar? There *should* be a spellchecker 2.1 jar in the contrib/spellchecker/ directory of the 2.1 release. -Hoss - To unsubscribe, e-mail: [EMAIL PROTECTED] Fo

RE: Is Lucene Java trunk still stable for production code?

2007-03-15 Thread Doron Cohen
> An index created with a formal version (major or minor), would be readable > (and hence upgradable) by a later version. In fact this is not guaranteed for every "later version" - only for those up to one major step ahead. The Wiki's statement is very accurate here: File formats are back-comp

RE: Is Lucene Java trunk still stable for production code?

2007-03-15 Thread Doron Cohen
A good reason for applications to only use formal versions (vs. trunk) is backwards compatibility (see "BackwardsCompatibility" in Lucene Wiki). An index created with a formal version (major or minor), would be readable (and hence upgradable) by a later version. But for interim trunk states this i

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread karl wettin
15 mar 2007 kl. 15.33 skrev Ryan O'Hara: My question basically was is there a spellchecker-2.*1*-rc1-dev.jar? There has been changes since 2.0. You can also take a look at this: https://issues.apache.org/jira/browse/LUCENE-632 The problems I'm having... Exception: java.lang.NullPointerE

Re: Please suggest how to use the different analyse to accomodate number and alphanumeric search

2007-03-15 Thread Erick Erickson
I think you'd figure it all out if you just printed out the parsed query with toString(); And have you looked at your index with Luke to see what you've actually stored? And perhaps queried with Luke which, among other things, will show you the query as parsed by various analyzers? If you don't h

Re: Performance between Filter and HitCollector?

2007-03-15 Thread Erik Hatcher
On Mar 15, 2007, at 12:27 AM, Antony Bowesman wrote: Thanks for the detailed reponse Hoss. That's the sort of in depth golden nugget I'd like to see in a copy of LIA 2 when it becomes available... NOTED! :) Erik -

RE: Is Lucene Java trunk still stable for production code?

2007-03-15 Thread Jean-Philippe Robichaud
Thanks for your insight, it matches my thoughts. The only reason I'm wondering is because the latest change in the lucene_2_1 branch is 4 weeks old. So either no bugs were found since then (yeah!) or the bugs gets fixed in the trunk... I'll search JIRA and decides after! Thanks! Jp -Origi

Please suggest how to use the different analyse to accomodate number and alphanumeric search

2007-03-15 Thread Gaurav Srivastava
How to search numbers using Lucene API I am using a demo application so as to search the documents but when i search the numbers or alphanumeric text it appends an empty space and no hits are returned any help would be appreciated i am developing a new serach engine like Google .Currently it use

Re: SpellChecker and Lucene 2.1

2007-03-15 Thread Ryan O'Hara
My question basically was is there a spellchecker-2.*1*-rc1-dev.jar? The problems I'm having... Exception: java.lang.NullPointerException at org.apache.lucene.search.spell.SpellChecker.indexDictionary (SpellChecker.java:298) at DidYouMeanIndexer.createSpellIndex(DidYouMeanInd

Re: Is Lucene Java trunk still stable for production code?

2007-03-15 Thread Grant Ingersoll
Well, I think the standard disclaimer applies here: The trunk is not guaranteed to be stable (but, hey, what really is, right?). Releases are meant to be "production ready" to the best of our knowledge at the time. That being said, we do make efforts to keep the trunk stable. I guess t

Re: how to get approximate total matching

2007-03-15 Thread senthil kumaran
Hi, Sorry for my delayed reply. I can't reproduce that OOM error with MultiSearcher now.I'll try to reproduce that error and send you the details that you asked. I've tested to search my index with Multisearcher & separate IndexSearcher.The search time by using Multisearcher is nearly equa

Re: How to customize scoring using user feedback?

2007-03-15 Thread daniel rosher
Hi, This can be achieved by implementing your own implementation of the SortComparitorSource interface. Section 6.1 of Lucene in Action will help you here. We currently use this method to alter the ranking of documents depending on the age of the document by multiplying the current score by a ca

Re: How to customize scoring using user feedback?

2007-03-15 Thread karl wettin
15 mar 2007 kl. 11.21 skrev xiong: Just like google: the more user clicks of search results, the higher rank they are. Are you really sure Google does this? It would surprise me if they did. How to implement this in lucene? I've read the javadoc of org.apache.lucene.search package, but

How to customize scoring using user feedback?

2007-03-15 Thread xiong
Hi there, Just like google: the more user clicks of search results, the higher rank they are. How to implement this in lucene? I've read the javadoc of org.apache.lucene.search package, but still dont know how. Some sample code will be great. Thanks in advance, Xiong -

Re: Performance between Filter and HitCollector?

2007-03-15 Thread eks dev
great! a few words to refresh my memory, it's been a while... - This patch lays only groundwork and should not cause any performance changes in existing code per se, that is what we have tested extensively some months ago (compatibility), it applied cleanly, passed all Junit tests and our in