|
php.net | support | documentation | report a bug | advanced search | search howto | statistics | random bug | login |
[2007-07-25 12:10 UTC] mcorne at yahoo dot com
Description:
------------
mb_substr("\x44\xCC\x87", 0, PHP_INT_MAX, 'UTF-8') only captures the first character on linux 64-bit instead of returning the whole string.
Note that this works fine on Windows XP and Linux 32-bit.
Reproduce code:
---------------
function substring($string, $length)
{
$substr = mb_substr($string, 0, $length , 'UTF-8');
$length = strlen($substr);
$chars = $length? unpack("C{$length}chars", $substr) : array();
$decs = array_map('dechex', $chars);
return array($substr, $decs);
}
$test['string'] = "\x44\xCC\x87";
$test['utf8'] = '\x44\xCC\x87';
$test['unicode'] = '\u0044\u0307';
$test['PHP_INT_MAX'] = PHP_INT_MAX;
$test['php_int_max'] = substring($test['string'], PHP_INT_MAX);
$test['9999'] = substring($test['string'], 9999);
print_r($test);
Expected result:
----------------
Array
(
[string] => Ḋ
[utf8] => \x44\xCC\x87
[unicode] => \u0044\u0307
[PHP_INT_MAX] => 2147483647
[php_int_max] => Array
(
[0] => Ḋ
[1] => Array
(
[chars1] => 44
[chars2] => cc
[chars3] => 87
)
)
[9999] => Array
(
[0] => Ḋ
[1] => Array
(
[chars1] => 44
[chars2] => cc
[chars3] => 87
)
)
)
Actual result:
--------------
Array
(
[string] => Ḋ
[utf8] => \x44\xCC\x87
[unicode] => \u0044\u0307
[PHP_INT_MAX] => 2147483647
[php_int_max] => Array
(
[0] => D
[1] => Array
(
[chars1] => 44
)
)
[9999] => Array
(
[0] => Ḋ
[1] => Array
(
[chars1] => 44
[chars2] => cc
[chars3] => 87
)
)
)
PatchesPull RequestsHistoryAllCommentsChangesGit/SVN commits
|
|||||||||||||||||||||||||||||||||||||
Copyright © 2001-2025 The PHP GroupAll rights reserved. |
Last updated: Tue Oct 28 14:00:01 2025 UTC |
I reproduced the same issue with mb_substr() on my Athlon 64/x2 machine. I believe that substr() is also has the same 64bit issue. It is a sample script (tested on my x86/64 Ubuntu Linux, Athlon 64x2) <?php echo substr("\x44\xCC\x87", 0, 1024); // output: 0x44,0xcc,0x87 echo substr("\x44\xCC\x87", 0, PHP_INT_MAX); // output: 0x44,0xcc echo substr("\x44\xCC\x87", 0, PHP_INT_MAX-1); // output: 0x44 echo substr("\x44\xCC\x87", 0, PHP_INT_MAX-2); // output: ?> I think PHP itself is not 64bit compatible. Why didn't you submit a bug report for substr() ?