Question about high-performance methods to drive TFIDF queries.

2018-01-07 Thread 李文海
Hi, all. Recently, we were performing experiment on Lucene based on TFIDF. We want to get the similar documents from the corpus, of which the similarity between each document (d) and the given query (q) is no less than a threshold. We use the following scoring function. sum(tf(t,d) * i

[ANNOUNCE] Luke 7.2.0 released

2018-01-07 Thread Tomoko Uchida
Download the release zip here: https://github.com/DmitryKey/luke/releases/tag/luke-7.2.0 Upgrade to Lucene 7.2.0. And, revised Luke atop JavaFX can be downloaded here: https://github.com/DmitryKey/luke/releases/tag/luke-javafx-7.2.0 Luke atop JavaFX is distributed under Apache License v2. We pla