Re: search performance enhancement

2005-09-20 Thread John Wang
Hi Paul and other gurus: In a related topic, seems lucene is scoring documents that would hit in a "prohibited" boolean clause, e.g. NOT field:value. It doesn't seem to make sense to score a document that is to be excluded from the result. Is this a difficult thing to fix? Also in Paul's ealie

Re: Stale NFS file handle Exception

2005-09-20 Thread John Wang
How do you "update" the index? -John On 9/12/05, Harini Raghavan <[EMAIL PROTECTED]> wrote: > > Hi All, > I have 2 servers in the production environment, one running some Quartz > jobs and the other one running the application. There is a common NFS > mount which has the lucene index directory.

Re: "Best-practice" in a web application AND live update of index used by Tomcat

2005-09-20 Thread Ramanathan Krishnan
Hi, I am a newbie to lucene but I guess the IOException will occur only in windows as window os doesn't close the files properly. It shud work fine in Linux. I had a similar problem when recreating the same index. The approach I took was to delete all documents and start adding new ones, inste

Re: Storing HashMap as an UnIndexed Field

2005-09-20 Thread markharw00d
Or using XMLEncoder: HashMap map=new HashMap(); map.put("foo","bar"); ByteArrayOutputStream baos=new ByteArrayOutputStream(); XMLEncoder encoder =new XMLEncoder(baos); encoder.writeObject(map); encoder.flush(); System.out.println(baos.toString());

Re: How the default ranking works

2005-09-20 Thread Erik Hatcher
On Sep 20, 2005, at 4:19 PM, tirupathi reddy wrote: I don't have that book. Is it available in the internet? You can order it online and have it shipped to you from many online bookstores - or buy the PDF directly from Manning online. Links at the top of this page: http://www.luce

Re: How the default ranking works

2005-09-20 Thread Otis Gospodnetic
--- tirupathi reddy <[EMAIL PROTECTED]> wrote: > I don't have that book. Is it available in the internet? Of course, here it is: http://www.manning.com/books/hatcher2 (Add to Cart link) Otis - To unsubscribe, e-mail: [EMAIL

Re: How the default ranking works

2005-09-20 Thread tirupathi reddy
hello erik, I don't have that book. Is it available in the internet? Thanx, MTREDDY Tirupati Reddy Manyam 24-06-08, Sundugaullee-24, 79110 Freiburg GERMANY. Phone: 00497618811257 cell : 004917624649007 - Yahoo! for Good Click here to

Re: Storing HashMap as an UnIndexed Field

2005-09-20 Thread jian chen
well, certainly you can serialize into a byte stream and encode it using base64. Jian On 9/20/05, Mordo, Aviran (EXP N-NANNATEK) <[EMAIL PROTECTED]> wrote: > > I can't think of a way you can use serialization, since lucene only > works with strings. > > -Original Message- > From: Trici

Re: How the default ranking works

2005-09-20 Thread Erik Hatcher
On Sep 20, 2005, at 4:03 PM, tirupathi reddy wrote: How the hits are ranked in default case. If I have say some query like this: title:"measurement procedure" AND id:ep6289* Say I have some 10 documents matched with that query, how my hits will be displayed. Which record will display fi

How the default ranking works

2005-09-20 Thread tirupathi reddy
Hello, How the hits are ranked in default case. If I have say some query like this: title:"measurement procedure" AND id:ep6289* Say I have some 10 documents matched with that query, how my hits will be displayed. Which record will display first and how it will do the ranking in default

RE: Storing HashMap as an UnIndexed Field

2005-09-20 Thread Mordo, Aviran (EXP N-NANNATEK)
I can't think of a way you can use serialization, since lucene only works with strings. -Original Message- From: Tricia Williams [mailto:[EMAIL PROTECTED] Sent: Tuesday, September 20, 2005 3:30 PM To: java-user@lucene.apache.org Subject: RE: Storing HashMap as an UnIndexed Field Do you

Re: Storing HashMap as an UnIndexed Field

2005-09-20 Thread Erik Hatcher
On Sep 20, 2005, at 3:29 PM, Tricia Williams wrote: Do you think there is anyway that I could use the serialization already built into the HashMap data structure? A Document, when reconstituted from Hits, is essentially a glorified HashMap-like structure. I recommend you simply iterate yo

Re: "Best-practice" in a web application AND live update of index used by Tomcat

2005-09-20 Thread Matthias Bräuer
Hello, I have a question regarding your answers to two previous posts: >For best performance, use a single IndexSearcher instance across your entire application. >DelayCloseIndexSearcher overrides the close() method so it does not >close immediately: it only decrements the usage counter. [.

RE: Storing HashMap as an UnIndexed Field

2005-09-20 Thread Tricia Williams
Do you think there is anyway that I could use the serialization already built into the HashMap data structure? On Tue, 20 Sep 2005, Mordo, Aviran (EXP N-NANNATEK) wrote: > You can store the values as a coma separated string (which then you'll > need to parse manually back to a HashMap) > > -O

RE: Storing HashMap as an UnIndexed Field

2005-09-20 Thread Mordo, Aviran (EXP N-NANNATEK)
You can store the values as a coma separated string (which then you'll need to parse manually back to a HashMap) -Original Message- From: Tricia Williams [mailto:[EMAIL PROTECTED] Sent: Tuesday, September 20, 2005 3:14 PM To: java-user@lucene.apache.org Subject: Storing HashMap as an UnI

Storing HashMap as an UnIndexed Field

2005-09-20 Thread Tricia Williams
Hi, I'd like to store a HashMap for some extra data to be used when a given document is retrieved as a Hit for a query. To add an UnIndexed Field to an index takes only Strings as parameters. Does anyone have any suggestions on how I might convert the HashMap to a String that is efficiently r

Re: Sort by relevance+distance

2005-09-20 Thread markharw00d
To avoid caching 10,025 docs when you only want to see 10,000 to 10,025 (and assuming the user was paging through results) you might have to remember the lowest score used in the previous page of results to avoid adding those 10,000 docs with score > lastLowScore to the HitQueue again.

Re: (TermInfosReader, SegmentTermEnum) Out Of Memory Exception patch

2005-09-20 Thread Daniel Naber
On Tuesday 20 September 2005 12:13, kieran wrote: > I've thought of submitting this to the project as a patch, but the > lucene bugzilla account is disabled at the moment so...see the diff, > below, and the attached, patched, file: We've moved to JIRA but the link on our web page isn't updated ye

Re: date keyword

2005-09-20 Thread Erik Hatcher
On Sep 20, 2005, at 12:55 PM, haipeng du wrote: I understand that. But from Field API, there is a method of Keyword which accepts Date object as value. When I use that method to index, I can not get real date back. I also use other method to pass a string value to index. That works great. D

Re: date keyword

2005-09-20 Thread haipeng du
I understand that. But from Field API, there is a method of Keyword which accepts Date object as value. When I use that method to index, I can not get real date back. I also use other method to pass a string value to index. That works great. Does that mean I can not use that method to index keyw

RE: date keyword

2005-09-20 Thread Mordo, Aviran (EXP N-NANNATEK)
Lucene only uses strings to store and search, you should convert any objects to string. For dates you have a special Date field that you should use which converts dated to a searchable strings Aviran http://www.aviransplace.com -Original Message- From: haipeng du [mailto:[EMAIL PROTECTED

date keyword

2005-09-20 Thread haipeng du
I use lucene to index a key word with date object. When I search document, how could I process the searching result for that field? For example: index date with Field field = Field.Keyword("created", new Date); . .. When I search that, I get that field back Field f = doc.getField("created

RE: live update of index used by Tomcat

2005-09-20 Thread Vanlerberghe, Luc
You should keep your IndexReader open until the merge has finished, but also until there are no more Hits Objects that depend on it (tricky in multithreaded environments like tomcat). The fact that the files cannot be deleted immediately after the merge is no problem. The filenames will be stored

Re: Relevance Feedback

2005-09-20 Thread Grant Ingersoll
I have implemented (more or less) Rochio rel. feedback. You have to make some minor modifications b/c Lucene doesn't support boost values less than 0, but other than it is pretty straightforward using the TermVector support. At feedback time, get the TermVector for the top X documents and constr

Re: "Best-practice" in a web application

2005-09-20 Thread Erik Hatcher
On Sep 20, 2005, at 2:24 AM, Magne Skjeret wrote: I am using lucene to index all my data, and it is working just great. I will now add search to a web application, so the index can actually be used, not just sit there. Good idea... it'd be a shame for the index to sit unsearched! :) 1. C

(TermInfosReader, SegmentTermEnum) Out Of Memory Exception patch

2005-09-20 Thread kieran
We've been experiencing terrible memory problems on our production search server, running lucene (1.4.3). Our live app regularly opens new indexes and, in doing so, releases old IndexReaders for garbage collection. But...there appears to be a memory leak in org.apache.lucene.index.TermInfosR