Re: Fields with phrases

2006-09-11 Thread Chris Hostetter
: I have a field called "location" on my index. For example, this string: "A : B" "A C" D was stored on my index : When I search for "location: ", these are the results that I'd like to : retrieve: : 1) location: D -- 1 hit : 2) location: A -- no hits : 3) location: "A B" -- 1 hit : 4) location:

Re: Changing the Scoring api

2006-09-11 Thread Chris Hostetter
: I want to override the default scoring when it comes to queries : containing the OR operator. this mesages seems to be an exact repost of your question from last friday ... was theresomething wrong with teh suggestions i included in my reply to it? http://www.nabble.com/Changing-the-Scoring-ap

Fields with phrases

2006-09-11 Thread Leandro Saad
Hi all, I have a field called "location" on my index. For example, this string: "A B" "A C" D was stored on my index When I search for "location: ", these are the results that I'd like to retrieve: 1) location: D -- 1 hit 2) location: A -- no hits 3) location: "A B" -- 1 hit 4) location: "A C" -

Re: Fields with phrases

2006-09-11 Thread Erick Erickson
I know of no way of doing this with the standard analyzers, unless you do some fooling around.. I think you'd have to write your own analyzer/tokenizer that you use both at indexing time and query parsing time that broke the input streams up the way you want. In this case, A B would be a SINGLE t

Fields with phrases

2006-09-11 Thread Leandro Saad
Hi all, I have a field called "location" on my index. For example, this string: "A B" "A C" D was stored on my index When I search for "location: ", these are the results that I'd like to retrieve: 1) location: D -- 1 hit 2) location: A -- no hits 3) location: "A B" -- 1 hit 4) location: "A C" -

Re: Highligher Example

2006-09-11 Thread Till Kinstler
Mark Miller schrieb: > Highlighting a PDF document, last time I looked (quite a while ago), > involves supplying an xml file that describes offsets for highlighting. > You can specify the file in the URL. PDFBox (http://www.pdfbox.org/), which is also convenient for parsing PDFs, can generate tho

Re: Highligher Example

2006-09-11 Thread Shane Perry
Not sure if this is something of interest, but there is an open source project called File2XLIFF4j on Sourceforge.net (http://file2xliff4j.sourceforge.net/). The project converts many common file formats to XLIFF. It may be useful for getting a common format, highlighting, and the recreating

Re: duplicate fields

2006-09-11 Thread Erick Erickson
I'm not at all sure what you're asking. I believe you can use a TermEnum with an empty term ("") to get all the terms in a particular field. If you're asking "how can I find all the fields in a document", well, that's tricky. Since there's no requirement that every document have the same fields,

Changing the Scoring api

2006-09-11 Thread Marcus Falck
Hi everyone, I want to override the default scoring when it comes to queries containing the OR operator. For example if I got the following headlines in my index : "Sun sues Microsoft" "Microsoft want to buy Tiscali" ".NU domain sues Microsoft" "The sun is shining" "Sun brings antitrus