similar ArrayIndexOutOfBoundsException on searching and optimizing

Adam Constabaris Fri, 21 Apr 2006 06:49:02 -0700

This is a puzzler, I'm not sure if I'm doing something wrong or whetherI have a poisoned document, a corrupted index (failing to close myIndexModifier properly?) or what. The setup is this: I have twoprocesses (the backend and frontend of a CMS) that run in two differentVMs -- both use Lucene 1.9.1 with the PorterStemmerAnalyzer wrapper overthe StandardAnalyzer (from lucene-memory AnalyzerUtils).

The backend is responsible for index creation, updates, etc., while thefrontend process uses the created index. What's puzzling is that somequeries will die with an ArrayIndexOutOfBoundsException being thrown outof the BitVector class:


Caused by: java.lang.ArrayIndexOutOfBoundsException: 240
        at org.apache.lucene.util.BitVector.get(BitVector.java:63)

atorg.apache.lucene.index.SegmentTermDocs.read(SegmentTermDocs.java:133)

        at org.apache.lucene.search.TermScorer.next(TermScorer.java:105)

atorg.apache.lucene.search.DisjunctionSumScorer.advanceAfterCurrent(DisjunctionSumScorer.java:151)atorg.apache.lucene.search.DisjunctionSumScorer.next(DisjunctionSumScorer.java:125)atorg.apache.lucene.search.BooleanScorer2.score(BooleanScorer2.java:290)atorg.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:132)atorg.apache.lucene.search.IndexSearcher.search(IndexSearcher.java:99)

        at org.apache.lucene.search.Hits.getMoreDocs(Hits.java:65)
        at org.apache.lucene.search.Hits.<init>(Hits.java:44)
        at org.apache.lucene.search.Searcher.search(Searcher.java:44)
        at org.apache.lucene.search.Searcher.search(Searcher.java:36)

The only pattern I've been able to discern in queries that cause thisproblem is that (a) they search the "contents" field (tokenized,unstored, TermVector.YES), and (b) it *seems* that it mostly happenswith longer terms in the query. Although the frontend defaults to amultifield query, the same happens when I use "contents:<<term>>" anddoes not happen if I specify <<term>> and any other of the defaultfields used by the MultiFieldQueryParser.

Here's where it gets interesting: I've noticed that calling optimize()on the index as it's created by the server process is also throwing ahissy fit, with an *eerily similar* index:


java.lang.ArrayIndexOutOfBoundsException: 239
        at org.apache.lucene.util.BitVector.get(BitVector.java:63)

atorg.apache.lucene.index.SegmentReader.isDeleted(SegmentReader.java:288)atorg.apache.lucene.index.SegmentMerger.mergeFields(SegmentMerger.java:185)atorg.apache.lucene.index.SegmentMerger.merge(SegmentMerger.java:88)atorg.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:681)atorg.apache.lucene.index.IndexWriter.mergeSegments(IndexWriter.java:658)atorg.apache.lucene.index.IndexWriter.optimize(IndexWriter.java:517)atorg.apache.lucene.index.IndexWriter.addIndexes(IndexWriter.java:553)

Does anybody have any ideas about what I might be doing wrong, or ifI've possibly uncovered a bug? I'm too new to the scene to know where Iought to start with this.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

similar ArrayIndexOutOfBoundsException on searching and optimizing

Reply via email to