php.net |  support |  documentation |  report a bug |  advanced search |  search howto |  statistics |  random bug |  login
Doc Bug #65981 Documentation on using IntlBreakIterator for Word Breaks
Submitted: 2013-10-28 05:48 UTC Modified: 2021-11-11 11:16 UTC
From: robert dot rcampbell at gmail dot com Assigned:
Status: Open Package: I18N and L10N related
PHP Version: 5.5.5 OS: Ubuntu 13.10 amd64
Private report: No CVE-ID: None
View Add Comment Developer Edit
Welcome! If you don't have a Git account, you can't do anything here.
You can add a comment by following this link or if you reported this bug, you can edit this bug over here.
(description)
Block user comment
Status: Assign to:
Package:
Bug Type:
Summary:
From: robert dot rcampbell at gmail dot com
New email:
PHP Version: OS:

 

 [2013-10-28 05:48 UTC] robert dot rcampbell at gmail dot com
Description:
------------
IntlBreakIterator documentation doesn't shed much light on how to use the Word Break Iterator. Having an example of usage would be great, as I'd like to use it in combination with the latest ICU to process Lao text inserting zero-width-spaces between words, in order that browsers will word-wrap correctly.

Ideally, an example would take some Lao text, such as some of the long lines of text on http://laosabbathschool.com/development/ssd/site/, and loop through each word, adding a zero-width-space at each possible break point. 

This documentation will greatly help all of us doing web products in Southeast Asia (Thai, Khmer, Burmese, & Lao all deal with the same problem) better support the languages we target.

Thanks!


Test script:
---------------
n/a

Expected result:
----------------
ZWSP inserted Lao text

Actual result:
--------------
Not sure how to use

Patches

Add a Patch

Pull Requests

Add a Pull Request

History

AllCommentsChangesGit/SVN commitsRelated reports
 [2021-11-11 11:16 UTC] nikic@php.net
-Package: intl +Package: I18N and L10N related
 
PHP Copyright © 2001-2024 The PHP Group
All rights reserved.
Last updated: Sat Apr 27 23:01:30 2024 UTC