php.net |  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Doc Bug #65981 Documentation on using IntlBreakIterator for Word Breaks
Submitted: 2013-10-28 05:48 UTC Modified: 2021-11-11 11:16 UTC
From: robert dot rcampbell at gmail dot com Assigned:
Status: Open Package: I18N and L10N related
PHP Version: 5.5.5 OS: Ubuntu 13.10 amd64
Private report: No CVE-ID: None
Welcome back! If you're the original bug submitter, here's where you can edit the bug or add additional notes.
If this is not your bug, you can add a comment by following this link.
If this is your bug, but you forgot your password, you can retrieve your password here.
Password:
Status:
Package:
Bug Type:
Summary:
From: robert dot rcampbell at gmail dot com
New email:
PHP Version: OS:

 

 [2013-10-28 05:48 UTC] robert dot rcampbell at gmail dot com
Description:
------------
IntlBreakIterator documentation doesn't shed much light on how to use the Word Break Iterator. Having an example of usage would be great, as I'd like to use it in combination with the latest ICU to process Lao text inserting zero-width-spaces between words, in order that browsers will word-wrap correctly.

Ideally, an example would take some Lao text, such as some of the long lines of text on http://laosabbathschool.com/development/ssd/site/, and loop through each word, adding a zero-width-space at each possible break point. 

This documentation will greatly help all of us doing web products in Southeast Asia (Thai, Khmer, Burmese, & Lao all deal with the same problem) better support the languages we target.

Thanks!


Test script:
---------------
n/a

Expected result:
----------------
ZWSP inserted Lao text

Actual result:
--------------
Not sure how to use

Patches

Add a Patch

Pull Requests

Add a Pull Request

History

AllCommentsChangesGit/SVN commitsRelated reports
 [2021-11-11 11:16 UTC] nikic@php.net
-Package: intl +Package: I18N and L10N related
 
PHP Copyright © 2001-2024 The PHP Group
All rights reserved.
Last updated: Tue Apr 23 13:01:29 2024 UTC