Robert Muir created LUCENE-5601:
-----------------------------------
Summary: ThaiTokenizer ignores sentenceStart
Key: LUCENE-5601
URL: https://issues.apache.org/jira/browse/LUCENE-5601
Project: Lucene - Core
Issue Type: Bug
Components: modules/analysis
Reporter: Robert Muir
Fix For: 4.8, 5.0
Attachments: LUCENE-5601.patch
This tokenizer segments sentences into words, but doesnt have a test for
multiple sentences.
Since its not yet released, it would be good to fix for 4.8 so no user sees the
bug.
--
This message was sent by Atlassian JIRA
(v6.2#6252)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]