Hi Bill, Bill Taylor wrote: > On Oct 16, 2006, at 5:44 AM, Christoph Pächter wrote: >> I know that I can index pdf-files (using a third-party library). > > Could you please tell me where to find this library?
There are several PDF extraction packages listed here (look under the "Lucene Document Converters" heading): <http://lucene.apache.org/java/docs/contributions.html> I haven't personally used it, but the documentation for PDF Box (one of the packages listed on the above-linked page) describes integration with Lucene: <http://www.pdfbox.org/userguide/text_extraction.html#Lucene+Integration> Steve --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]