I am not very sure about a language specific analyzer. French language has
words seperated by an apostrophe. The standard analyzer does not tokenize
words on an apostrophe. Then there is the case of accented letters. It
depends on how you want to handle these. Are you going to store them as is
or c
Thank you :)
It seems to work as I wanted to. Now If I want to index content to do
full-text search, would it be better to choose a standardAnalyzer or a
more specific one like EnglishAnalyzer or FrenchAnalyzer (assuming the
content I want to index is only in english or only in french) ?
Reg
On Tue, 2011-05-31 at 08:52 +0200, Maciej Klimczuk wrote:
> I did some testing with 3.1.0 demo on Windows and encountered some strange
> bahaviour. I tried to index ~6 small text documents using the demo.
> - First trial took about 18 minutes.
> - Second and third trial took about 2 minutes.