https://bugs.documentfoundation.org/show_bug.cgi?id=95159

Samphan Raruenrom <[email protected]> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Severity|minor                       |trivial

--- Comment #8 from Samphan Raruenrom <[email protected]> ---
LibreOffice relies on ICU to break Thai words. ICU uses a greedy
dictionary-based longest-matching Thai word-segmentation algorithm, e.g. it
stops when found the first possible segmentation "การก|ระ|ทำ" (การก happens to
really be a valid Thai word).

To fix this issue, one needs to implement a slightly better maximal-matching
algorithm in ICU. It will still fail in some other rarer cases anyway.

BTW, this kind of problem happens very rarely in real-world Thai because the
ambiguous sequences are rare and must be at the end of a line. I never hear a
Thai complain about this.

-- 
You are receiving this mail because:
You are the assignee for the bug.
_______________________________________________
Libreoffice-bugs mailing list
[email protected]
https://lists.freedesktop.org/mailman/listinfo/libreoffice-bugs

Reply via email to