SpanMultiTermQueryWrapper with PrefixQuery hitting num clause limit

2024-03-28 Thread Yixun Xu
Hello, We are trying to search for phrases where the last term is a prefix match. For example, find all documents that contain "foo bar.*", with a configurable slop between "foo" and "bar". We were able to do this using `SpanNearQuery` where the last clause is a `SpanMultiTermQueryWrapper` that wr

Re: SpanMultiTermQueryWrapper with PrefixQuery hitting num clause limit

2024-03-28 Thread Robert Muir
using spans and wildcards together is asking for trouble, you will hit limits, it is not efficient by definition. I'd recommend to change your indexing so that your queries are fast and you aren't using wildcards that enumerate many terms at search-time. Don't index words such as "bar_294e50e1-fc3

Re: SpanMultiTermQueryWrapper with PrefixQuery hitting num clause limit

2024-03-28 Thread Yixun Xu
That makes sense. Thank you! On Thu, Mar 28, 2024 at 12:58 PM Robert Muir wrote: > using spans and wildcards together is asking for trouble, you will hit > limits, it is not efficient by definition. > > I'd recommend to change your indexing so that your queries are fast > and you aren't using wi

Re: Does Lucene Vector Search support int8 and / or even binary?

2024-03-28 Thread Shubham Chaudhary
> > btw, what about native binary embedding quantization support by Lucene? This sounds like a good idea to have in Lucene. Would this require another VetctorField /VectorsFormat? Based on current implementation, one way would be to use another KNN format or alternatively maybe a better approa