php.net |  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Bug #55837 Glob skips utf8 file names after setlocale en_US.UTF-8
Submitted: 2011-10-03 15:34 UTC Modified: 2011-10-04 10:39 UTC
Votes:1
Avg. Score:5.0 ± 0.0
Reproduced:1 of 1 (100.0%)
Same Version:1 (100.0%)
Same OS:0 (0.0%)
From: php at maisqi dot com Assigned:
Status: Open Package: Filesystem function related
PHP Version: 5.3.8 OS: CentOS release 5.7 (Final)
Private report: No CVE-ID: None
View Add Comment Developer Edit
Welcome! If you don't have a Git account, you can't do anything here.
You can add a comment by following this link or if you reported this bug, you can edit this bug over here.
(description)
Block user comment
Status: Assign to:
Package:
Bug Type:
Summary:
From: php at maisqi dot com
New email:
PHP Version: OS:

 

 [2011-10-03 15:34 UTC] php at maisqi dot com
Description:
------------
The title says it all. I think it will be faster if you run the test I attached.
Gunzip the archive on a Linux Server and run glob-locale-test.php.

You can see it directly on http://www.maisqi.com/glob-locale-test.php

Test script:
---------------
See the script on the attached archive (glob-locale-test.php).
It tries to list the files on the associated folder (that is also on the archive) three times.

1. The first one uses glob with the default locale.
2. The second, also uses glob but after the locale has been set to en_US.UTF-8.
3. The third, uses DirectoryIterator.

Expected result:
----------------
All tries should list 5 files.

Actual result:
--------------
The second try only prints the file with a ascii file name.

Patches

Add a Patch

Pull Requests

Add a Pull Request

History

AllCommentsChangesGit/SVN commitsRelated reports
 [2011-10-03 15:38 UTC] php at maisqi dot com
Please download the archive from:
   http://www.maisqi.com/glob-locale-test.tgz
 [2011-10-04 10:39 UTC] cataphract@php.net
This behavior is common to several filesystem functions (see basename()). If the filenames were indeed encoded in UTF-8, there would be no problems.
 
PHP Copyright © 2001-2019 The PHP Group
All rights reserved.
Last updated: Wed Dec 11 09:01:23 2019 UTC