Hi Bill,

Bill Taylor wrote:
> On Oct 16, 2006, at 5:44 AM, Christoph Pächter wrote:
>> I know that I can index pdf-files (using a third-party library).
> 
> Could you please tell me where to find this library? 

There are several PDF extraction packages listed here (look under the
"Lucene Document Converters" heading):

<http://lucene.apache.org/java/docs/contributions.html>

I haven't personally used it, but the documentation for PDF Box (one of
the packages listed on the above-linked page) describes integration with
Lucene:

<http://www.pdfbox.org/userguide/text_extraction.html#Lucene+Integration>

Steve


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to