Here it is:
http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/analysis/shingle/ShingleMatrixFilter.html
Otis
--
Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch
- Original Message
> From: Neha Gupta
> To: java-user@lucene.apache.org
> Sent: Thursday, June 18, 2009 1
The contrib/analyzers has several n-gram based tokenization and token
filter options.
On Jun 18, 2009, at 10:15 PM, Neha Gupta wrote:
Hey,
I was wondering if there is a way to read the index and generate n-
grams of
words for a document in lucene? I am quite new to it and am using
pylucen
Yeah, look at the spellcheck component in Solr. They are doing something
similar.
Sameer.
On Thu, Jun 18, 2009 at 7:15 PM, Neha Gupta wrote:
> Hey,
>
> I was wondering if there is a way to read the index and generate n-grams of
> words for a document in lucene? I am quite new to it and am using