On 21 Nov 2005, at 18:54, [EMAIL PROTECTED] wrote:
I'm using a StandardFilter and seeing some strange tokenization.
Here's
the input:
apache.org hosts lucene at apache.org.
Here's the tokens it
outputs:
apache.org
hosts
lucene
at
apacheorg
Is this a bug
that apache.org and apache.org. don't convert to the same token?
Didn't you just report this same issue?
The behavior certainly is not sensible in this case. So I'd call it
a bug, yes. Again, the trailing '.' is the culprit.
Erik
---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]