Have you measured to see how much of your time is spent indexing and how much is just parsing the file? You need to do this before having a clue what you need to make faster....
Erick On 11/10/06, Daniel Naber <[EMAIL PROTECTED]> wrote:
On Friday 10 November 2006 12:18, spinergywmy wrote: > I having this indexing the pdf file performance issue. It took me more > than 10 sec to index a pdf file about 200kb. Is it because I only have a > segment file? How can I make the indexing performance better? PDFBox (which I assume you are using) can be quite slow converting large PDF files to text. This has nothing to do with Lucene. Regards Daniel -- http://www.danielnaber.de --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]