Highlight Wildcard Queries

2011-01-25 Thread Wulf Berschin
Hi, I'm just migrating our small search customization from Lucene version 2.3 to the current version (3.0.3) and wonder why, in contrast to the old version, we no longer get the Wildcard Queries (which are default, since surround the search string with asterisks) highlighted. We're using the

RE: Highlight Wildcard Queries

2011-01-26 Thread Wulf Berschin
Thank you Alexander and Uwe, for your help. I read Marks explanation but it seems to me that his changes are not contained in Lucene-3.0.3. So I commented out the rewrite, changed QueryTermScorer back to QueryScorer and now I got the wildcard queries highlighted again. Wulf --

Re: Highlight Wildcard Queries: Scores

2011-01-26 Thread Wulf Berschin
e now? Wulf Am 26.01.2011 11:06, schrieb Wulf Berschin: Thank you Alexander and Uwe, for your help. I read Marks explanation but it seems to me that his changes are not contained in Lucene-3.0.3. So I commented out the rewrite, changed QueryTermScorer back to QueryScorer and now I got the wil

Re: Highlight Wildcard Queries: Scores

2011-01-26 Thread Wulf Berschin
Sorry for bothering, that was my fault: I my subclass of QueryParser which wraps * around the terms I had not yet considered the new multiTermRewriteMethod. After adding these scoring seems to work and even the rewrite is possible again. Wulf Am 26.01.2011 15:10, schrieb Wulf Berschin: Now

Re: ****SPAM(5.0)**** Re: Highlight Wildcard Queries: Scores

2011-01-26 Thread Wulf Berschin
h considering before assuming that scoring is necessary on wildcard terms. Best Erick On Wed, Jan 26, 2011 at 9:10 AM, Wulf Berschin wrote: Now I have the highlighted wildcards but obviously the scoring is lost. I see that a rewrite of the wildcard query produces a constant sc

Re: Highlight Wildcard Queries: Scores

2011-01-26 Thread Wulf Berschin
men http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message- From: Wulf Berschin [mailto:bersc...@dosco.de] Sent: Wednesday, January 26, 2011 3:56 PM To: java-user@lucene.apache.org Subject: Re: SPAM(5.0) Re: Highlight Wildcard Queries: Scores Hi Erick, good points, but:

How to index part numbers

2011-01-28 Thread Wulf Berschin
Hi, I'm poking in the dark and hope someone has some light... We have part numbers in technical documentation to retrieve. For now we have a (long) regular expression to find those in a string. The part numbers have letters, digits and (redundant) whitespace. Furthermore authors often used a

Re: ****SPAM(5.0)**** Re: How to index part numbers

2011-01-28 Thread Wulf Berschin
with? It's because I also work with technical documentation and we do exactly, waht you're asking for, but it is XML-data. On Fri, Jan 28, 2011 at 1:05 PM, Wulf Berschin wrote: Hi, I'm poking in the dark and hope someone has some light... We have part numbers in technical docum

Undo hyphenation when indexing

2011-04-01 Thread Wulf Berschin
Hi, for indexing PDF files we have to undo word hyphenation. The basic idea is simply to remove the hyphen when a new line and a small letter follows. Of course this approach isnt 100%-foolproofed but checking against a dictionary wouldnt be as well... Since we face this problem too when hig

Re: Undo hyphenation when indexing

2011-04-04 Thread Wulf Berschin
ucene/Solr User Conference, May 25-26, San Francisco On Fri, Apr 1, 2011 at 11:50 AM, Wulf Berschin wrote: Hi, for indexing PDF files we have to undo word hyphenation. The basic idea is simply to remove the hyphen when a new line and a small letter follows. Of course this approach isnt 100%-foolp

PDF Highlighting using PDF Highlight File

2011-05-10 Thread Wulf Berschin
Hi all, in our Lucene 3.0.3-based web application when a user clicks on a hit link the targeted PDF should be opened in the browser with highlighted hits. For this purpose using the Acrobat Highlight File (Parameter xml, see http://www.pdfbox.org/userguide/highlighting.html and http://partne

Re: PDF Highlighting using PDF Highlight File

2011-05-12 Thread Wulf Berschin
try { tokenStream.close(); } catch (Exception e) { } } } } } Am 10.05.2011 12:32, schrieb Wulf Berschin: Hi all, in our Lucene 3.0.3-based web application when a user clicks on a hit link the targeted PDF should be opened in the browser with hig