|  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Bug #68846 False detection of CJK Unified Ideographs Extension E
Submitted: 2015-01-16 14:26 UTC Modified: 2015-03-09 06:48 UTC
From: masakielastic at gmail dot com Assigned: stas (profile)
Status: Closed Package: mbstring related
PHP Version: 5.6.4 OS: Mac OS X
Private report: No CVE-ID: None
View Add Comment Developer Edit
Welcome! If you don't have a Git account, you can't do anything here.
You can add a comment by following this link or if you reported this bug, you can edit this bug over here.
Block user comment
Status: Assign to:
Bug Type:
From: masakielastic at gmail dot com
New email:
PHP Version: OS:


 [2015-01-16 14:26 UTC] masakielastic at gmail dot com
mb_check_encoding return false if the encoding is GB18030 and the string contains CJK Unified Ideographs Extension E (U+2B820 .. U+2CEA1) which Unicode Version 8.0 include.

Test script:
// U+2B864
$str = mb_convert_encoding("\xF0\xAB\xA1\xA4", 'GB18030', 'UTF-8');
    mb_check_encoding($str, 'GB18030')

Expected result:

Actual result:


Add a Patch

Pull Requests

Pull requests:

Add a Pull Request


AllCommentsChangesGit/SVN commitsRelated reports
 [2015-02-18 15:26 UTC] masakielastic at gmail dot com
I created pull request.
 [2015-03-09 06:48 UTC]
-Status: Open +Status: Closed -Assigned To: +Assigned To: stas
 [2015-03-09 06:48 UTC]
The fix for this bug has been committed.

Snapshots of the sources are packaged every three hours; this change
will be in the next snapshot. You can grab the snapshot at

 For Windows:
Thank you for the report, and for helping us make PHP better.

PHP Copyright © 2001-2024 The PHP Group
All rights reserved.
Last updated: Mon Apr 15 03:01:28 2024 UTC