|  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Bug #42528 Out of "char"(8-bit) range value doesn't roll back, with uni-code ON.
Submitted: 2007-09-03 11:53 UTC Modified: 2009-08-25 15:56 UTC
From: mahesh dot vemula at in dot ibm dot com Assigned:
Status: Not a bug Package: *Unicode Issues
PHP Version: 6CVS-2007-09-03 (CVS) OS: RHEL4
Private report: No CVE-ID: None
 [2007-09-03 11:53 UTC] mahesh dot vemula at in dot ibm dot com
The character(8-bit) range of value does not roll back to the corresponding minimum value, once the value reaches beyond the range of a char(8-bit) that it can hold, when Unicode is ON.

Reproduce code:
var_dump( decbin( ord(chr(255)) ) );
var_dump( decbin( ord(chr(256)) ) );
var_dump( decbin( ord(chr(257)) ) );

Expected result:
unicode(8) "11111111"
unicode(1) "0"
unicode(1) "1"

Actual result:
unicode(8) "11111111"
unicode(9) "100000000"
unicode(9) "100000001"


Add a Patch

Pull Requests

Add a Pull Request


AllCommentsChangesGit/SVN commitsRelated reports
 [2007-09-07 12:30 UTC] mahesh dot vemula at in dot ibm dot com
Samething is happening for strncasecmp() function also.
 [2009-08-25 15:56 UTC]
Thank you for your bug report.

In unicode, characters and bytes do not map one-on-one. That means that the codepoint for a character may be bigger than 256. The ord() function returns the numerical value for the codepoint, which may be bigger than 256. It should not wrap around as you describe.
PHP Copyright © 2001-2024 The PHP Group
All rights reserved.
Last updated: Fri Apr 12 22:01:29 2024 UTC