since this question seems to be entirely about extracting text with PDFBox and not at all baout indexing the text once it's been extracted, perhaps it would be better suited for the PDFBox forums...
http://www.pdfbox.org/ http://sourceforge.net/forum/?group_id=78314 ...i suspect you would find a much larger PDFBox user base there who can be of more assistance. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]