Hi,

We have 500k xml documents in a file. We ran the digest/lucene and get the
following error messages. We had ran a smaller size file with 20k xml
documents without any problems. Can anyone help us to resolve this problem?
Thank you very much.

Regards,

Mark


Jan 3, 2007 10:17:37 AM org.apache.commons.digester.Digester endElement
SEVERE: End event threw exception
java.lang.reflect.InvocationTargetException
       at sun.reflect.GeneratedMethodAccessor23.invoke(Unknown Source)
       at sun.reflect.DelegatingMethodAccessorImpl.invoke(
DelegatingMethodAccessorImpl.java:25)
       at java.lang.reflect.Method.invoke(Method.java:585)
       at org.apache.commons.beanutils.MethodUtils.invokeMethod(
MethodUtils.java:216)
       at org.apache.commons.digester.SetNextRule.end(SetNextRule.java:217)
       at org.apache.commons.digester.Rule.end(Rule.java:253)
       at org.apache.commons.digester.Digester.endElement(Digester.java
:1222)
       at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown
Source)
       at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanEndElement(Unknown
Source)
       at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown
Source)
       at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown
Source)
       at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
       at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
       at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
       at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
       at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown
Source)
       at org.apache.commons.digester.Digester.parse(Digester.java:1728)
       at DOCUMENTParserIndexer.main(DOCUMENTParserIndexer.java:127)
Caused by: java.lang.NullPointerException
       at org.apache.lucene.index.FieldsReader.doc(FieldsReader.java:82)
       at org.apache.lucene.index.SegmentReader.document(SegmentReader.java
:284)
       at org.apache.lucene.index.SegmentMerger.mergeFields(
SegmentMerger.java:186)
       at org.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java
:88)
       at org.apache.lucene.index.IndexWriter.mergeSegments(
IndexWriter.java:681)
       at org.apache.lucene.index.IndexWriter.mergeSegments(
IndexWriter.java:658)
       at org.apache.lucene.index.IndexWriter.maybeMergeSegments(
IndexWriter.java:646)
       at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java
:453)
       at org.apache.lucene.index.IndexWriter.addDocument(IndexWriter.java
:436)
       at DOCUMENTParserIndexer.addDOCUMENT(DOCUMENTParserIndexer.java:59)
       ... 18 more
Exception in thread "main" java.lang.NullPointerException
       at org.apache.commons.digester.Digester.createSAXException(
Digester.java:3181)
       at org.apache.commons.digester.Digester.createSAXException(
Digester.java:3207)
       at org.apache.commons.digester.Digester.endElement(Digester.java
:1225)
       at org.apache.xerces.parsers.AbstractSAXParser.endElement(Unknown
Source)
       at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanEndElement(Unknown
Source)
       at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl$FragmentContentDispatcher.dispatch(Unknown
Source)
       at
org.apache.xerces.impl.XMLDocumentFragmentScannerImpl.scanDocument(Unknown
Source)
       at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
       at org.apache.xerces.parsers.XML11Configuration.parse(Unknown
Source)
       at org.apache.xerces.parsers.XMLParser.parse(Unknown Source)
       at org.apache.xerces.parsers.AbstractSAXParser.parse(Unknown Source)
       at org.apache.xerces.jaxp.SAXParserImpl$JAXPSAXParser.parse(Unknown
Source)
       at org.apache.commons.digester.Digester.parse(Digester.java:1728)
       at DOCUMENTParserIndexer.main(DOCUMENTParserIndexer.java:127)

Reply via email to