optimizer for n-gram PhraseQuery
--------------------------------
Key: LUCENE-3426
URL: https://issues.apache.org/jira/browse/LUCENE-3426
Project: Lucene - Java
Issue Type: Improvement
Components: core/search
Reporter: Koji Sekiguchi
Priority: Trivial
If 2-gram is used and the length of query string is 4, for example q="ABCD",
QueryParser generates (when autoGeneratePhraseQueries is true) PhraseQuery("AB
BC CD") with slop 0. But it can be optimized PhraseQuery("AB CD") with
appropriate positions.
The idea came from the Japanese paper "N.M-gram: Implementation of Inverted
Index Using N-gram with Hash Values" by Mikio Hirabayashi, et al. (The main
theme of the paper is different from the idea that I'm using here, though)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]