: We would also entertain alternative indexing approaches. We even : considered concatenating all the text of the contained docs into a doc : indexed as the zipfile, but lucene only indexes part of a large file and : even if that were resolved, proximity searches can return false : positives.
problem #1 can be resolved using this... http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/index/IndexWriter.html#setMaxFieldLength(int) problem #2 can be resolved by making sure you add the contents of each file in the zip as a seperate Field objects (with the same name) and making sure you use an Analyzer where this method... http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/analysis/Analyzer.html#getPositionIncrementGap(java.lang.String) ...returns a number significantly larger then whatever the biggest slop value you use intend to use when doing proximity queries. -Hoss --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]