An update, I have managed to get it to not fail by debugging and changing the value of org.apache.lucene.store.InputIndex.preUTF8Strings = true. The value is always false when it fails.
Mike -----Original Message----- From: Mike Streeton [mailto:mike.stree...@connexica.com] Sent: 28 April 2009 12:53 To: java-user@lucene.apache.org Subject: Read past EOF I have an index that works fine on Lucene 2.3.2 but fails to open in 2.4.1, it always fails with an Read past EOF. The index does contain some field names with german umlaut characters in Any ideas? Many Thanks Mike CheckIndex v2.3.2 NOTE: testing will be more thorough if you run java with '-ea:org.apache.lucene', so assertions are enabled Opening index @ C:/index/german Segments file=segments_9 numSegments=1 version=FORMAT_SHARED_DOC_STORE [Lucene 2.3] 1 of 1: name=_3 docCount=235535 compound=true numFiles=1 size (MB)=301.684 no deletions test: open reader.........OK test: fields, norms.......OK [70 fields] test: terms, freq, prox...OK [1475862 terms; 25448796 terms/docs pairs; 28642994 tokens] test: stored fields.......OK [13560464 total field count; avg 57.573 fields per doc] test: term vectors........OK [0 total vector count; avg 0 term/freq vector fields per doc] No problems were detected with this index. CheckIndex v2.4.1 NOTE: testing will be more thorough if you run java with '-ea:org.apache.lucene...', so assertions are enabled Opening index @ C:/index/german Segments file=segments_9 numSegments=1 version=FORMAT_SHARED_DOC_STORE [Lucene 2.3] 1 of 1: name=_3 docCount=235535 compound=true hasProx=true numFiles=1 size (MB)=301.684 no deletions test: open reader.........FAILED WARNING: fixIndex() would remove reference to this segment; full exception: java.io.IOException: read past EOF at org.apache.lucene.store.BufferedIndexInput.refill(Unknown Source) at org.apache.lucene.store.BufferedIndexInput.readBytes(Unknown Source) at org.apache.lucene.store.BufferedIndexInput.readBytes(Unknown Source) at org.apache.lucene.store.IndexInput.readString(Unknown Source) at org.apache.lucene.index.FieldInfos.read(Unknown Source) at org.apache.lucene.index.FieldInfos.<init>(Unknown Source) at org.apache.lucene.index.SegmentReader.initialize(Unknown Source) at org.apache.lucene.index.SegmentReader.get(Unknown Source) at org.apache.lucene.index.SegmentReader.get(Unknown Source) at org.apache.lucene.index.CheckIndex.checkIndex(Unknown Source) at org.apache.lucene.index.CheckIndex.main(Unknown Source) WARNING: 1 broken segments (containing 235535 documents) detected WARNING: would write new segments file, and 235535 documents would be lost, if -fix were specified --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org