Re: PhraseQuery and non-letter characters

2008-12-02 Thread Ng Vinny
Hi Ian Thanks for the suggestion. I was able to write the custom analyzer to return non-letters as tokens, as well as to keep the numeric characters instead of skipping them. This is probably not the best solution, but at least i can have a demo without bugs :-) To save time for others who may ha

Re: PhraseQuery and non-letter characters

2008-11-28 Thread Ian Lea
I suggest you write your own analyzer that doesn't remove non-letter characters at index time. There might be one out there already, but not that I can think of off hand. Instead of leaving the non-letters in place you might consider doing something with position increments. I think that would pr

PhraseQuery and non-letter characters

2008-11-28 Thread Ng Vinny
Hi, I'm having an issue with PhraseQuery in which a query for the phrase "information technology" has among of its matches the strings "information, technology" and "information. Technology", which should not be considered as matches. Both StopAnalyzer StandardAnalyzer removes non-letter charact