Re: Question about querying for files in a zip file

Chris Hostetter Fri, 08 Jun 2007 15:05:23 -0700

: We would also entertain alternative indexing approaches.  We even
: considered concatenating all the text of the contained docs into a doc
: indexed as the zipfile, but lucene only indexes part of a large file and
: even if that were resolved, proximity searches can return false
: positives.


problem #1 can be resolved using this...
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/index/IndexWriter.html#setMaxFieldLength(int)

problem #2 can be resolved by making sure you add the contents of each
file in the zip as a seperate Field objects (with the same name) and
making sure you use an Analyzer where this method...
http://lucene.zones.apache.org:8080/hudson/job/Lucene-Nightly/javadoc/org/apache/lucene/analysis/Analyzer.html#getPositionIncrementGap(java.lang.String)
...returns a number significantly larger then whatever the biggest slop
value you use intend to use when doing proximity queries.



-Hoss


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Question about querying for files in a zip file

Reply via email to