php.net |  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Bug #49861 htmlspecialchars() returns incorrect results for Unicode characters outside BMP
Submitted: 2009-10-13 09:00 UTC Modified: 2009-10-13 09:07 UTC
From: moriyoshi@php.net Assigned:
Status: Closed Package: Strings related
PHP Version: 5.3.0 OS: *
Private report: No CVE-ID: None
Welcome back! If you're the original bug submitter, here's where you can edit the bug or add additional notes.
If you forgot your password, you can retrieve your password here.
Password:
Status:
Package:
Bug Type:
Summary:
From: moriyoshi@php.net
New email:
PHP Version: OS:

 

 [2009-10-13 09:00 UTC] moriyoshi@php.net
Description:
------------
htmlspecialchars() / htmlentities() returns incorrect results when an 
Unicode character whose codepoint is outside the range of BMP.

Confirmed with 5.3.0, 5.3SVN.


Reproduce code:
---------------
<?php
// U+1003C
var_dump(bin2hex(htmlspecialchars("\xf0\x90\x80\xbc", ENT_QUOTES, 'UTF-8'));
?>

Expected result:
----------------
string(8) "f09080bc"


Actual result:
--------------
string(8) "266c743b"


Patches

Pull Requests

History

AllCommentsChangesGit/SVN commitsRelated reports
 [2009-10-13 09:07 UTC] moriyoshi@php.net
Just a reminder: this was already fixed along with bug #49785 in SVN.
 
PHP Copyright © 2001-2024 The PHP Group
All rights reserved.
Last updated: Fri Nov 22 04:01:28 2024 UTC