Autonomy's KeyView is an alternative to Stellent. It does not cover all of
the file formats that Stellent does, though many of them are probably not
interesting for most applications. When I last looked at it it did not
handle mail archives, though there was a plan to add it. I found it more
stabl
Mark Miller schrieb:
> Highlighting a PDF document, last time I looked (quite a while ago),
> involves supplying an xml file that describes offsets for highlighting.
> You can specify the file in the URL.
PDFBox (http://www.pdfbox.org/), which is also convenient for parsing
PDFs, can generate tho
Not sure if this is something of interest, but there is an open source
project called File2XLIFF4j on Sourceforge.net
(http://file2xliff4j.sourceforge.net/). The project converts many
common file formats to XLIFF. It may be useful for getting a common
format, highlighting, and the recreating
Dejan Nenov wrote:
Second that - I was a client of Stellent - the libs work great but are
expensive. To see Stellent in action - get a copy of the free X1 desktop
search or the X1 server (Lucene based).
I would say that the libs work great but are slow.
One problem is that they don't provide a
:[EMAIL PROTECTED]
Sent: Friday, September 08, 2006 1:27 AM
To: java-user@lucene.apache.org
Subject: Re: Highligher Example
If you have a budget for this stuff then Stellent provide tools for parsing
multiple document types and also have a viewer that can display documents
with their original
riginal Message
From: Mark Miller <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Friday, 8 September, 2006 2:02:47 AM
Subject: Re: Highligher Example
Highlighting a PDF document, last time I looked (quite a while ago),
involves supplying an xml file that describes offsets for h
Highlighting a PDF document, last time I looked (quite a while ago),
involves supplying an xml file that describes offsets for highlighting.
You can specify the file in the URL. You can also do simple highlighting
by passing in a list of words to be highlighted, but this does not even
catch min
Thanks for the quick response Erik. I will be getting my LIA book back very
soon, I forgot it at a destination :-(
Lets assume, there is a document called "hello.pdf" and it has the content
"this is hello.pdf. It uses Acrobat"
When I perform a search for "Acrobat", i want hello.pdf to show up, a
There are test cases in the Highlighter codebase that exercise it and
show its use, as well as a few examples of it in the "Lucene in
Action" codebase.
These examples output plain text with some prefix and suffix
surrounding the highlighted terms. Highlighting text in a PDF is
possible,
Hey
Anyone have a search result highlighter example?
I have various doc, PDFs, DOC, TXT, PPT, and I would like to show a
highlight, similar to how google does it...
tia
10 matches
Mail list logo