Re: how newer documents have a better score

2005-11-15 Thread gekkokid
thanks :) _gk - Original Message - From: "Chris Hostetter" <[EMAIL PROTECTED]> To: Sent: Wednesday, November 16, 2005 6:10 AM Subject: Re: how newer documents have a better score : Hi, can anyone give me some pointers on making newer documents have a : better ranking/score? i.e. do

Re: how newer documents have a better score

2005-11-15 Thread Chris Hostetter
: Hi, can anyone give me some pointers on making newer documents have a : better ranking/score? i.e. documents i indexed today have a higher : ranking/score in the index than documents that were index yesterday etc This topic has come up more then once in the past, a good starting point at an app

Re: How to give weight to document when adding to the index?

2005-11-15 Thread gekkokid
boosting the document using the method setBoost(double), i think the param is a double but not sure, it works on both Document and Field objects lil'example: Document d = new Document(); d.add(Field.Keyword("name","gekkokid")); d.setBoost(1.1); // default is 1.0 i believe writer.ad

Re: Memory Usage

2005-11-15 Thread Daniel Noll
Marvin Humphrey wrote: The formatting of the results turned up a little screwy in my email reader, so here's a reformatted version... I noticed the same thing on Thunderbird, although viewing the source showed that the original was okay, and KMail didn't seem to have the same issue. Howeve

How to give weight to document when adding to the index?

2005-11-15 Thread Supheakmungkol SARIN
Dear Luceners, I'd like to give different weight to each of the documents in the index. Is it possible to do this at the time we add the document to the index? Regards, Mungkol - Yahoo! FareChase - Search multiple travel sites in

Re: Memory Usage

2005-11-15 Thread Marvin Humphrey
Good stuff, Daniel... Thanks for taking the time to tabulate the results and present them. If your results hold, it may have a significant impact on my application. I'm working on a Perl/XS port, and I think a lot of people who want to run it won't be running mod_perl, so startup times

how newer documents have a better score

2005-11-15 Thread gekkokid
Hi, can anyone give me some pointers on making newer documents have a better ranking/score? i.e. documents i indexed today have a higher ranking/score in the index than documents that were index yesterday etc Thanks _gk

Re: Memory Usage

2005-11-15 Thread Daniel Noll
Doug Cutting wrote: Marvin Humphrey wrote: You *can't* set it on the reader end. If you could set it, the reader would get out of sync and break. The value is set per-segment at write time, and the reader has to be able to adapt on the fly. It would actually not be too hard to change

Re: De-duping MultiSearcher results

2005-11-15 Thread Daniel Noll
Jason Calabrese wrote: Maybe I'm missing something simple, but I don't see how this will work. It looks like this filter will just filter out documents that don't have guid field, but in my case every document has a guid. Actually, it filters out every document except the *first* which h

Re: Searching Special Characters

2005-11-15 Thread Daniel Noll
Mordo, Aviran (EXP N-NANNATEK) wrote: You can use your own Analyzer to support special characters. Just process the special characters in your analyzer That's one option. The "correct" solution would be, since this is presumably HTML or XML, replacing entities with their actual string valu

RE: Searching Special Characters

2005-11-15 Thread Mordo, Aviran (EXP N-NANNATEK)
You can use your own Analyzer to support special characters. Just process the special characters in your analyzer Aviran http://www.aviransplace.com -Original Message- From: Lucene User [mailto:[EMAIL PROTECTED] Sent: Tuesday, November 15, 2005 11:00 AM To: java-user@lucene.apache.org S

Re: Items in multiple category: distinct search?

2005-11-15 Thread John Powers
I got the category thing to work. I'm using a string "category" that looks like: Cat1[980455981]{4}|SubCatA[980456122]{1}| Then I can search by id, name and when I get a result list I can put them in sequence (the soft bracket number) However, my problem now is distinct... I want to put items

Re: Items in multiple category: distinct search?

2005-11-15 Thread Jeff Rodenburg
Hi John - It sounds like you're thinking of your index in terms of sql constructs -- multiple rows for the same record. We do this very same thing with categories; if you have a record that lives in multiple categories, just add additional category field/value pairs for your original record. It's

Re: FileNotFoundException while indexing data

2005-11-15 Thread amolb
Hi everybody, I ran the same code on linux and it has worked very well. It could be related to OS resource issue, but I am not sure as did not try to debug on windows. I hope this help others in case of such problems. thanks Amol amolb wrote: Hi everybody, I am trying to index arround 10 la

IndexOutOfBoundsException

2005-11-15 Thread Nestel, Frank IZ/HZA-IOL
Hello, we're running into strange Lucene problems here right now: Occassionally certain lists of hits do not build, but we end with an Exception. See below. While this error appears often, it is not determinstic, i.e. you repeat the identical search with the same result, you might get no such exc

Searching Special Characters

2005-11-15 Thread Lucene User
Hi Our index contains articles with special characters. For instance, the string P&O is indexed as P&O. The correct entity codes are indexed for all the special characters we use. My question is that a typical user searching for the above will enter P&O but that will not match P&O. I know I coul

Items in multiple category: distinct search?

2005-11-15 Thread John Powers
Hello,  I have done a search for this issue, but I didnt really see an answer. if i want to store things in a hierarchy, but i want items to be in multiple categories at any level.   i'd like to be able to search for an item and only show it once, but maybe (not required) show what categories the

Memory fault

2005-11-15 Thread MALCOLM CLARK
I'm currently trying to index another collection. I am suffering a problem with writer.close.Basically at the end of indexing it only works if I remove the writer.close.It simple can't find the routine despite being able to find writer.optimize. Has anyone else discovered this problem and what

Re: Search on multiple columns

2005-11-15 Thread Manoj Kr. Sheoran
100 Thanks Tarjei - Original Message - From: "Tarjei Lægreid" <[EMAIL PROTECTED]> To: Sent: Tuesday, November 15, 2005 5:01 PM Subject: Re: Search on multiple columns Hi, you can use the MultiFieldQueryParser, i.e. Query query = MultiFieldQueryParser.parse(query, fields, analyzer). Re

Re: Search on multiple columns

2005-11-15 Thread Tarjei Lægreid
Hi, you can use the MultiFieldQueryParser, i.e. Query query = MultiFieldQueryParser.parse(query, fields, analyzer). Regards, Tarjei On 11/15/05, Manoj Kr. Sheoran <[EMAIL PROTECTED]> wrote: > > Hi, > Is there any way to search on multiple columns. > > i.e. > Query qry = QueryParser.parse(line,"

Search on multiple columns

2005-11-15 Thread Manoj Kr. Sheoran
Hi, Is there any way to search on multiple columns. i.e. Query qry = QueryParser.parse(line,"contents",anlz); In above statement I can search on contents field only but I would like to search data on two Fields at a time. Is there any way to do it ? Regards, Manoj - Original Message

Deprecated API in BooleanQuery broken in Lucene from CVS?

2005-11-15 Thread Patrick Kimber
Hi I have checked out the latest version of Lucene from CVS and have found a change in the results compared to version 1.4.3. The issue is with the deprecated API in the BooleanQuery class. The deprecated function: "public void add(Query query, boolean required, boolean prohibited)" is returning d

RE: Memory Usage

2005-11-15 Thread Vanlerberghe, Luc
Since an IndexReader can't know what indexInterval was used and that each segment could have a different indexInterval, wouldn't it be better too have a parameter that sets an average indexInterval that should be used? The fraction you talk about could then be calculated by the IndexReader per segm