On Nov 30, 2006, at 10:54 AM, spinergywmy wrote:
Hi Grant,
Thanks for the tips. I will take ur adviced and look into the
link that u
send to me.
For my scenario will be every time the users upload the single
file, I
need to index that particular file. Previously was because the
previous
version of pdfbox integrate with log4j.jar file and I believe is the
log4j.jar cause the indexing performance and takes up a lot of memory
resources. However, the latest version of pdfbox doesn't need to
integrate
with log4j.jar, and I thought that will actually speed up the indexing
performance but the result was no.
I would isolate PDFBox and do some performance testing on it, then
submit your questions on the PDFBox forums, as they will know better
about PDFBox performance.
Good luck,
Grant
--------------------------
Grant Ingersoll
Center for Natural Language Processing
http://www.cnlp.org
Read the Lucene Java FAQ at http://wiki.apache.org/jakarta-lucene/
LuceneFAQ
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]