|  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Doc Bug #65981 Documentation on using IntlBreakIterator for Word Breaks
Submitted: 2013-10-28 05:48 UTC Modified: 2021-11-11 11:16 UTC
From: robert dot rcampbell at gmail dot com Assigned:
Status: Open Package: I18N and L10N related
PHP Version: 5.5.5 OS: Ubuntu 13.10 amd64
Private report: No CVE-ID: None
View Add Comment Developer Edit
Anyone can comment on a bug. Have a simpler test case? Does it work for you on a different platform? Let us know!
Just going to say 'Me too!'? Don't clutter the database with that please — but make sure to vote on the bug!
Your email address:
Solve the problem:
35 + 20 = ?
Subscribe to this entry?

 [2013-10-28 05:48 UTC] robert dot rcampbell at gmail dot com
IntlBreakIterator documentation doesn't shed much light on how to use the Word Break Iterator. Having an example of usage would be great, as I'd like to use it in combination with the latest ICU to process Lao text inserting zero-width-spaces between words, in order that browsers will word-wrap correctly.

Ideally, an example would take some Lao text, such as some of the long lines of text on, and loop through each word, adding a zero-width-space at each possible break point. 

This documentation will greatly help all of us doing web products in Southeast Asia (Thai, Khmer, Burmese, & Lao all deal with the same problem) better support the languages we target.


Test script:

Expected result:
ZWSP inserted Lao text

Actual result:
Not sure how to use


Add a Patch

Pull Requests

Add a Pull Request


AllCommentsChangesGit/SVN commitsRelated reports
 [2021-11-11 11:16 UTC]
-Package: intl +Package: I18N and L10N related
PHP Copyright © 2001-2022 The PHP Group
All rights reserved.
Last updated: Tue Dec 06 05:05:55 2022 UTC