|  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Bug #42528 Out of "char"(8-bit) range value doesn't roll back, with uni-code ON.
Submitted: 2007-09-03 11:53 UTC Modified: 2009-08-25 15:56 UTC
From: mahesh dot vemula at in dot ibm dot com Assigned:
Status: Not a bug Package: *Unicode Issues
PHP Version: 6CVS-2007-09-03 (CVS) OS: RHEL4
Private report: No CVE-ID: None
View Add Comment Developer Edit
Welcome! If you don't have a Git account, you can't do anything here.
You can add a comment by following this link or if you reported this bug, you can edit this bug over here.
Block user comment
Status: Assign to:
Bug Type:
From: mahesh dot vemula at in dot ibm dot com
New email:
PHP Version: OS:


 [2007-09-03 11:53 UTC] mahesh dot vemula at in dot ibm dot com
The character(8-bit) range of value does not roll back to the corresponding minimum value, once the value reaches beyond the range of a char(8-bit) that it can hold, when Unicode is ON.

Reproduce code:
var_dump( decbin( ord(chr(255)) ) );
var_dump( decbin( ord(chr(256)) ) );
var_dump( decbin( ord(chr(257)) ) );

Expected result:
unicode(8) "11111111"
unicode(1) "0"
unicode(1) "1"

Actual result:
unicode(8) "11111111"
unicode(9) "100000000"
unicode(9) "100000001"


Add a Patch

Pull Requests

Add a Pull Request


AllCommentsChangesGit/SVN commitsRelated reports
 [2007-09-07 12:30 UTC] mahesh dot vemula at in dot ibm dot com
Samething is happening for strncasecmp() function also.
 [2009-08-25 15:56 UTC]
Thank you for your bug report.

In unicode, characters and bytes do not map one-on-one. That means that the codepoint for a character may be bigger than 256. The ord() function returns the numerical value for the codepoint, which may be bigger than 256. It should not wrap around as you describe.
PHP Copyright © 2001-2023 The PHP Group
All rights reserved.
Last updated: Mon Dec 04 01:01:27 2023 UTC