Re: Highligher Example

2006-09-12 Thread Tom Emerson
Autonomy's KeyView is an alternative to Stellent. It does not cover all of the file formats that Stellent does, though many of them are probably not interesting for most applications. When I last looked at it it did not handle mail archives, though there was a plan to add it. I found it more stabl

Re: Highligher Example

2006-09-11 Thread Till Kinstler
Mark Miller schrieb: > Highlighting a PDF document, last time I looked (quite a while ago), > involves supplying an xml file that describes offsets for highlighting. > You can specify the file in the URL. PDFBox (http://www.pdfbox.org/), which is also convenient for parsing PDFs, can generate tho

Re: Highligher Example

2006-09-11 Thread Shane Perry
Not sure if this is something of interest, but there is an open source project called File2XLIFF4j on Sourceforge.net (http://file2xliff4j.sourceforge.net/). The project converts many common file formats to XLIFF. It may be useful for getting a common format, highlighting, and the recreating

Re: Highligher Example

2006-09-10 Thread Daniel Noll
Dejan Nenov wrote: Second that - I was a client of Stellent - the libs work great but are expensive. To see Stellent in action - get a copy of the free X1 desktop search or the X1 server (Lucene based). I would say that the libs work great but are slow. One problem is that they don't provide a

RE: Highligher Example

2006-09-08 Thread Dejan Nenov
:[EMAIL PROTECTED] Sent: Friday, September 08, 2006 1:27 AM To: java-user@lucene.apache.org Subject: Re: Highligher Example If you have a budget for this stuff then Stellent provide tools for parsing multiple document types and also have a viewer that can display documents with their original

Re: Highligher Example

2006-09-08 Thread mark harwood
riginal Message From: Mark Miller <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Friday, 8 September, 2006 2:02:47 AM Subject: Re: Highligher Example Highlighting a PDF document, last time I looked (quite a while ago), involves supplying an xml file that describes offsets for h

Re: Highligher Example

2006-09-07 Thread Mark Miller
Highlighting a PDF document, last time I looked (quite a while ago), involves supplying an xml file that describes offsets for highlighting. You can specify the file in the URL. You can also do simple highlighting by passing in a list of words to be highlighted, but this does not even catch min

Re: Highligher Example

2006-09-07 Thread Mag Gam
Thanks for the quick response Erik. I will be getting my LIA book back very soon, I forgot it at a destination :-( Lets assume, there is a document called "hello.pdf" and it has the content "this is hello.pdf. It uses Acrobat" When I perform a search for "Acrobat", i want hello.pdf to show up, a

Re: Highligher Example

2006-09-07 Thread Erik Hatcher
There are test cases in the Highlighter codebase that exercise it and show its use, as well as a few examples of it in the "Lucene in Action" codebase. These examples output plain text with some prefix and suffix surrounding the highlighted terms. Highlighting text in a PDF is possible,

Highligher Example

2006-09-07 Thread Mag Gam
Hey Anyone have a search result highlighter example? I have various doc, PDFs, DOC, TXT, PPT, and I would like to show a highlight, similar to how google does it... tia