alexeymin added a comment.
Regarding this - `I don't know if it is really chinese look foreign enough to me anyway.` Some lines of text in your test script surely look like Japanese Hiragana to me, especially this one (and tests related to this) echo "otto东到宛平路anna"> "終末なにしてますか?忙しいですか?救ってもらっていいですか? EP01 太阳の倾いたこの世界で -broken chronograph-.txt" Google translate confirms :) But do your ranges include that characters? This answer on stackoverflow <https://stackoverflow.com/a/30200250/2323699> says that there are also other ranges for Hiragana, Katakana, etc... as @cfeck already said. Does it pass the test for you? REPOSITORY R293 Baloo REVISION DETAIL https://phabricator.kde.org/D11552 To: michaelh, #baloo, #frameworks, lbeltrame, bruns Cc: alexeymin, cfeck, ashaposhnikov, michaelh, astippich, spoorun, nicolasfella, ngraham