| 
        php.net | support | documentation | report a bug | advanced search | search howto | statistics | random bug | login | 
  [2008-11-02 22:46 UTC] hostmaster at uuism dot net
 Description:
------------
When I run test scripts ext/standard/tests/strings/htmlentitiesXX.phpt where XX is 10, 11, and 13, the tests fail when mbstring.so is loaded and pass when mbstring.so is not loaded.
If I change the test script INI value for mbstring.internal_encoding to the default_charset, the test will pass.  Also, they will pass of the $charset is included in the command htmlentities.
I am using the php.ini-recommended with extension_dir = "/usr/local/src/php-5.2.6/modules"  I placed the php.ini in directory /usr/local/src/php-5.2.6/sapi/cli so sapi/cli/php would use it.
All the php.ini values for mbstring are commented out:
# grep mbstring /usr/local/src/php-5.2.6/sapi/cli/php.ini                                               ;extension=php_mbstring.dll
[mbstring]
;mbstring.language = Japanese
;mbstring.internal_encoding = EUC-JP
;mbstring.http_input = auto
;mbstring.http_output = SJIS
; mbstring.internal_encoding setting. Input chars are
;mbstring.encoding_translation = Off
;mbstring.detect_order = auto
;mbstring.substitute_character = none;
; overload(replace) single byte functions by mbstring functions.
;mbstring.func_overload = 0
;mbstring.strict_encoding = Off
; With mbstring support this will automatically be converted into the encoding
; given by corresponding encode setting. When empty mbstring.internal_encoding
Here is the run without mbstring.so loaded:
]# TEST_PHP_EXECUTABLE=sapi/cli/php sapi/cli/php ./run-tests.php ext/standard/tests/strings/htmlentities*phpt
=====================================================================
PHP         : sapi/cli/php
PHP_SAPI    : cli
PHP_VERSION : 5.2.6
ZEND_VERSION: 2.2.0
PHP_OS      : Linux - Linux host.uuserver.net 2.6.20.1 #16 SMP Thu Nov 8 14:19:44 EST 2007 i686
INI actual  : /usr/local/src/php-5.2.6/sapi/cli/php.ini
More .INIs  : /etc/php.d/mysql.ini,/etc/php.d/mysqli.ini
CWD         : /usr/local/src/php-5.2.6
Extra dirs  :
=====================================================================
Running selected tests.
[snip]
PASS htmlentities() test 10 (default_charset / cp1252) [ext/standard/tests/strings/htmlentities10.phpt]
PASS htmlentities() test 11 (default_charset / ISO-8859-15) [ext/standard/tests/strings/htmlentities11.phpt]
PASS htmlentities() test 12 (default_charset / ISO-8859-1) [ext/standard/tests/strings/htmlentities12.phpt]
PASS htmlentities() test 13 (default_charset / EUC-JP) [ext/standard/tests/strings/htmlentities13.phpt]
[snip]
=====================================================================
Number of tests :   20                14
Tests skipped   :    6 ( 30.0%) --------
Tests warned    :    0 (  0.0%) (  0.0%)
Tests failed    :    0 (  0.0%) (  0.0%)
Tests passed    :   14 ( 70.0%) (100.0%)
---------------------------------------------------------------------
Time taken      :    1 seconds
=====================================================================
Here is the run WITH mbstring.so loaded:
# TEST_PHP_EXECUTABLE=sapi/cli/php sapi/cli/php ./run-tests.php ext/standard/tests/strings/htmlentities*phpt
=====================================================================
PHP         : sapi/cli/php
PHP_SAPI    : cli
PHP_VERSION : 5.2.6
ZEND_VERSION: 2.2.0
PHP_OS      : Linux - Linux host.uuserver.net 2.6.20.1 #16 SMP Thu Nov 8 14:19:44 EST 2007 i686
INI actual  : /usr/local/src/php-5.2.6/sapi/cli/php.ini
More .INIs  : /etc/php.d/mbstring.ini,/etc/php.d/mysql.ini,/etc/php.d/mysqli.ini
CWD         : /usr/local/src/php-5.2.6
Extra dirs  :
=====================================================================
Running selected tests.
[snip]
FAIL htmlentities() test 10 (default_charset / cp1252) [ext/standard/tests/strings/htmlentities10.phpt]
FAIL htmlentities() test 11 (default_charset / ISO-8859-15) [ext/standard/tests/strings/htmlentities11.phpt]
PASS htmlentities() test 12 (default_charset / ISO-8859-1) [ext/standard/tests/strings/htmlentities12.phpt]
FAIL htmlentities() test 13 (default_charset / EUC-JP) [ext/standard/tests/strings/htmlentities13.phpt]
[snip]
=====================================================================
Number of tests :   20                20
Tests skipped   :    0 (  0.0%) --------
Tests warned    :    3 ( 15.0%) ( 15.0%)
Tests failed    :    3 ( 15.0%) ( 15.0%)
Tests passed    :   14 ( 70.0%) ( 70.0%)
---------------------------------------------------------------------
Time taken      :    2 seconds
=====================================================================
=====================================================================
FAILED TEST SUMMARY
---------------------------------------------------------------------
[snip]
htmlentities() test 10 (default_charset / cp1252) [ext/standard/tests/strings/htmlentities10.phpt]
htmlentities() test 11 (default_charset / ISO-8859-15) [ext/standard/tests/strings/htmlentities11.phpt]
htmlentities() test 13 (default_charset / EUC-JP) [ext/standard/tests/strings/htmlentities13.phpt]
[snip]
=====================================================================
Thanks for looking into this problem.
Jim
Reproduce code:
---------------
<?php
        ini_set('mbstring.internal_encoding','cp1252');
        ini_set('default_charset','cp1252');
        print ini_get('default_charset')."\n";
        var_dump(htmlentities("\x82\x86\x99\x9f", ENT_QUOTES, ''));
        var_dump(htmlentities("\x82\x86\x99\x9f", ENT_QUOTES, 'cp1252'));
        ini_set('mbstring.internal_encoding','pass');
        ini_set('default_charset','cp1252');
        print ini_get('default_charset')."\n";
        var_dump(htmlentities("\x82\x86\x99\x9f", ENT_QUOTES, ''));
        var_dump(htmlentities("\x82\x86\x99\x9f", ENT_QUOTES, 'cp1252'));
?>
Expected result:
----------------
cp1252
string(28) "‚†™Ÿ"
string(28) "‚†™Ÿ"
cp1252
string(28) "‚†™Ÿ"
string(28) "‚†™Ÿ"
Actual result:
--------------
cp1252
string(28) "‚†™Ÿ"
string(28) "‚†™Ÿ"
cp1252
string(4) ""
string(28) "‚†™Ÿ"
PatchesPull RequestsHistoryAllCommentsChangesGit/SVN commits             
             | 
    |||||||||||||||||||||||||||
            
                 
                Copyright © 2001-2025 The PHP GroupAll rights reserved.  | 
        Last updated: Tue Nov 04 15:00:01 2025 UTC | 
jani, if they fail for everyone, should they have a --XFAIL-- section frankly, based on pure logic, i would not expect this statement to use the default_charset htmlentities("\x82\x86\x99\x9f", ENT_QUOTES, ''); since '' has no meaning as a value for charset, htmlentites should use the same default as when default_charset='' which is ISO-8859-1 on the other hand, this statement should use the default_charset htmlentities("\x82\x86\x99\x9f", ENT_QUOTES); since the third parameter is optional. jimmoriyoshi, Okay. I understand. Isn't it also an over-assumption to assume that the third argument charset equals default_charset in this statement? htmlentities("\x82\x86\x99\x9f", ENT_QUOTES, ''); Jim