Re: Optimizing term-occurrence counting (code included)

2020-07-23 Thread Ali Akhtar
I'm new to lucene so I'm not sure what the best way of speeding this up in Lucene is, but I've previously used https://github.com/npgall/cqengine for similar stuff. It provided really good performance, especially if you're just counting things. On Fri, Jul 24, 2020 at 6:55 AM Alex K wrote: > Hi

Optimizing term-occurrence counting (code included)

2020-07-23 Thread Alex K
Hi all, I am working on a query that takes a set of terms, finds all documents containing at least one of those terms, computes a subset of candidate docs with the most matching terms, and applies a user-provided scoring function to each of the candidate docs Simple example of the query: - query

Re: ANN search current state

2020-07-23 Thread Tomoko Uchida
Hi Julie, thank you for working on LUCENE-9322 (I also love the issue). I think it would be great if we can try some preliminary aknn implementations (both of clustering-based and graph-based approach) on LUCENE-9322, to explore working unified API and Codec/Format for vectors; for now, I still hav