https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=243229
Bug ID: 243229 Summary: awk in base system does not work with UTF-8 strings correctly Product: Base System Version: 12.1-RELEASE Hardware: Any OS: Any Status: New Severity: Affects Some People Priority: --- Component: misc Assignee: b...@freebsd.org Reporter: s...@ulstu.ru I tried using the function length() with UTF-8 strings. And this function produces an incorrect result. The function works with strings not as characters, but as bytes. And the number of characters per string is multiplied by two. Steps to reproduce (for LANG=ru_RU.UTF-8): echo 'Привет' | awk '{print length($1)}' If I use the function length() with lang/gawk, then UTF-8 string length is calculated correctly. Are you planning to update awk in the base system to support UTF-8 strings in the near future? -- You are receiving this mail because: You are the assignee for the bug. _______________________________________________ freebsd-bugs@freebsd.org mailing list https://lists.freebsd.org/mailman/listinfo/freebsd-bugs To unsubscribe, send any mail to "freebsd-bugs-unsubscr...@freebsd.org"