Hi,
     I am using Lucene2.0. I am doing full text index of pdf file. To
extract the text from pdf I am using pdfbox library. When I start indexing
of pdf files I get Out of memory exception. This is becuase files are about
10 mb in size. I tried different value for mergefactor,  maxmergefactor and
maxbuffereddocs i.e. 100, 100; 100; 10, 100, 100; 100,100 ,1000; etc...   I
am storing the field value.

    I am not getting resolve this exception. Any suggestion to resolve the
issue.

Thanks

Reply via email to