2023年11月29日(水) 20:42 Hans Henrik Bergan <divinit...@gmail.com>:
>
> i think Shift_JIS can also be automatically converted to UTF-8, does
> this seem right?
> https://github.com/divinity76/php2utf8/commit/6e08c4c16312961170cce821195816a8d24e23f6
>

Sorry if it's harsh, not right.
Shift_JIS is very ambiguous, What will we do if SJIS-2004 or SJIS-win comes?
How do we guess(detect) SJIS-2004, SJIS-win and SJIS-mac?

<?php
// Comparison table from https://uic.io/en/charset/compare/shiftjis2004/cp932/
var_dump("\xfc\x40"); // What is 0xFC40, 騱(SJIS-2004) or 髜(SJIS-win)?
?>

In the first place, We **should not** change PHP script character encoding.
In addition to this, We have to think about various things.
This is not just a Japanese problem.

-- 
---------------------------
Yuya Hamada (tekimen)
- https://tekitoh-memdhoi.info
- https://github.com/youkidearitai
-----------------------------

--
PHP Internals - PHP Runtime Development Mailing List
To unsubscribe, visit: https://www.php.net/unsub.php

Reply via email to