Re: n-gram word support

2009-06-19 Thread Otis Gospodnetic
Here it is: http://lucene.apache.org/java/2_4_1/api/org/apache/lucene/analysis/shingle/ShingleMatrixFilter.html Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Neha Gupta > To: java-user@lucene.apache.org > Sent: Thursday, June 18, 2009 1

Re: n-gram word support

2009-06-19 Thread Grant Ingersoll
The contrib/analyzers has several n-gram based tokenization and token filter options. On Jun 18, 2009, at 10:15 PM, Neha Gupta wrote: Hey, I was wondering if there is a way to read the index and generate n- grams of words for a document in lucene? I am quite new to it and am using pylucen

Re: n-gram word support

2009-06-18 Thread Sameer Maggon
Yeah, look at the spellcheck component in Solr. They are doing something similar. Sameer. On Thu, Jun 18, 2009 at 7:15 PM, Neha Gupta wrote: > Hey, > > I was wondering if there is a way to read the index and generate n-grams of > words for a document in lucene? I am quite new to it and am using