ShingleFilter did the trick.
- Neha
On Thu, Jun 18, 2009 at 10:23 PM, Neha Gupta wrote:
>
> There is an n-gram tokenizer, EdgeNGramTokenizer, that may be what you're
>> looking for.
>
>
> Looks like this is a n-gram character tokenizer.
>
> - Neha
>
>>
>>
>> - Brian
>>
>>
>
> There is an n-gram tokenizer, EdgeNGramTokenizer, that may be what you're
> looking for.
Looks like this is a n-gram character tokenizer.
- Neha
>
>
> - Brian
>
>
Andi Vajda wrote:
On Thu, 18 Jun 2009, Neha Gupta wrote:
I was wondering if there is a way to read the index and generate
n-grams of
words for a document using pylucene?
PyLucene just wraps Java Lucene. If there is a way to do this in Java
Lucene, then use the same way with PyLucene.
To fin
On Thu, 18 Jun 2009, Neha Gupta wrote:
I was wondering if there is a way to read the index and generate n-grams of
words for a document using pylucene?
PyLucene just wraps Java Lucene. If there is a way to do this in Java
Lucene, then use the same way with PyLucene.
To find out how to do thi
Hey,
I was wondering if there is a way to read the index and generate n-grams of
words for a document using pylucene?
Thanks,
Neha