|
php.net | support | documentation | report a bug | advanced search | search howto | statistics | random bug | login |
[2003-06-10 02:00 UTC] richard at enfour dot co dot jp
It maybe elsewhere but I found a case where UTF-8 to SJIS mb_convert_encoding mashes a Japanese text string. The string is the kanji for "souseki" Unicode: U8e2a+8de1 In SJIS it should be: E748+90D5 but gets mashed. EUC works... PatchesPull RequestsHistoryAllCommentsChangesGit/SVN commits
|
|||||||||||||||||||||||||||
Copyright © 2001-2025 The PHP GroupAll rights reserved. |
Last updated: Thu Dec 04 21:00:02 2025 UTC |
I tested by a tiny script using PHP 4.3.3RC1 on Windows2000, <?php $str_ucs2 = sprintf("%c%c%c%c",0x8e,0x2a,0x8d,0xe1); echo mb_convert_encoding($str_ucs2, "SJIS", "UCS2"); ?> the output byte code is E748+90D5, as you are expecting. I think it works fine.I tested also on Linux using PHP 4.3.3RC1. <?php $str_ucs2 = sprintf("%c%c%c%c",0x8e,0x2a,0x8d,0xe1); echo mb_convert_encoding($str_ucs2, "SJIS", "UCS2"); ?> the output byte code is E748+90D5, as you are expecting. I think it works fine.