php.net |  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Bug #49861 htmlspecialchars() returns incorrect results for Unicode characters outside BMP
Submitted: 2009-10-13 09:00 UTC Modified: 2009-10-13 09:07 UTC
From: moriyoshi@php.net Assigned:
Status: Closed Package: Strings related
PHP Version: 5.3.0 OS: *
Private report: No CVE-ID: None
View Add Comment Developer Edit
Welcome! If you don't have a Git account, you can't do anything here.
You can add a comment by following this link or if you reported this bug, you can edit this bug over here.
(description)
Block user comment
Status: Assign to:
Package:
Bug Type:
Summary:
From: moriyoshi@php.net
New email:
PHP Version: OS:

 

 [2009-10-13 09:00 UTC] moriyoshi@php.net
Description:
------------
htmlspecialchars() / htmlentities() returns incorrect results when an 
Unicode character whose codepoint is outside the range of BMP.

Confirmed with 5.3.0, 5.3SVN.


Reproduce code:
---------------
<?php
// U+1003C
var_dump(bin2hex(htmlspecialchars("\xf0\x90\x80\xbc", ENT_QUOTES, 'UTF-8'));
?>

Expected result:
----------------
string(8) "f09080bc"


Actual result:
--------------
string(8) "266c743b"


Patches

Add a Patch

Pull Requests

Add a Pull Request

History

AllCommentsChangesGit/SVN commitsRelated reports
 [2009-10-13 09:07 UTC] moriyoshi@php.net
Just a reminder: this was already fixed along with bug #49785 in SVN.
 
PHP Copyright © 2001-2024 The PHP Group
All rights reserved.
Last updated: Fri Apr 19 11:01:28 2024 UTC