Re: StandardAnalyzer Problem with Apostrophes

2006-11-14 Thread Sarah Hunter
That was my first thought as well, but it looks like APOSTROPHE is already the one that I want. As you can see, from StandardAnalyzer.jj --- TOKEN : { // token patterns // basic word: a sequence of digits & letters ||)+ > // internal ap

StandardAnalyzer Problem with Apostrophes

2006-11-13 Thread Sarah Hunter
Hi there, Any ideas you have about the following would be greatly appreciated. I'd like apostropes to break up a word into two for indexing - ie, the french l'observatoire would be indexed as two separate tokens, l observatoire. My understanding from reading documentation and list archives is tha