Am 15.04.2013 10:04, schrieb Uwe Schindler: > The limit also applies for filters. If you have a list of terms ORed > together, the fastest way is not to use a BooleanQuery at all, but instead a > TermsFilter (which has no limits).
Hi Uwe, thanks for the pointer, this looks promising! The only missing piece for me is now how to use that filter in SpanQuery#getSpans(). I have generated a DocIdSet from the filter with getDocIdSet(AtomicReaderContext context, Bits.MatchAllBits).bits(), but for some reason this just doesn't filter anything. I am not sure what getSpans() expects the acceptDocs to be (I suppose the bits that correspond to docs that should be returned are to be set). This uncertainty roots in the getDocIdSet method because I am not sure what to use as an argument for acceptDocs there either. Best, Carsten -- Institut für Deutsche Sprache | http://www.ids-mannheim.de Projekt KorAP | http://korap.ids-mannheim.de Tel. +49-(0)621-43740789 | schno...@ids-mannheim.de Korpusanalyseplattform der nächsten Generation Next Generation Corpus Analysis Platform --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org