Re: n-gram word support

2009-06-22 Thread Neha Gupta
ShingleFilter did the trick. - Neha On Thu, Jun 18, 2009 at 10:23 PM, Neha Gupta wrote: > > There is an n-gram tokenizer, EdgeNGramTokenizer, that may be what you're >> looking for. > > > Looks like this is a n-gram character tokenizer. > > - Neha > >> >> >> - Brian >> >> >

Re: n-gram word support

2009-06-18 Thread Neha Gupta
> There is an n-gram tokenizer, EdgeNGramTokenizer, that may be what you're > looking for. Looks like this is a n-gram character tokenizer. - Neha > > > - Brian > >

Re: n-gram word support

2009-06-18 Thread Brian Golbère
Andi Vajda wrote: On Thu, 18 Jun 2009, Neha Gupta wrote: I was wondering if there is a way to read the index and generate n-grams of words for a document using pylucene? PyLucene just wraps Java Lucene. If there is a way to do this in Java Lucene, then use the same way with PyLucene. To fin

Re: n-gram word support

2009-06-18 Thread Andi Vajda
On Thu, 18 Jun 2009, Neha Gupta wrote: I was wondering if there is a way to read the index and generate n-grams of words for a document using pylucene? PyLucene just wraps Java Lucene. If there is a way to do this in Java Lucene, then use the same way with PyLucene. To find out how to do thi

n-gram word support

2009-06-18 Thread Neha Gupta
Hey, I was wondering if there is a way to read the index and generate n-grams of words for a document using pylucene? Thanks, Neha