| » Summary | » License |
|---|---|
| Unicode Normalizer | The BSD License |
| » Current Release | » Bug Summary |
| 1.0.0 (stable) was released on 2007-08-04 by mcorne (Changelog) |
No open bugs Report a new bug to I18N_UnicodeNormalizer |
| » Description | |
| "...Unicode's normalization is the concept of character composition and decomposition. Character composition is the process of combining simpler characters into fewer precomposed characters, such as the n character and the combining ~ character into the single n+~ character. Decomposition is the opposite process, breaking precomposed characters back into their component pieces... ...Normalization is important when comparing text strings for searching and sorting (collation)..." [Wikipedia] Performs the 4 normalizations: NFD: Canonical Decomposition NFC: Canonical Decomposition, followed by Canonical Composition NFKD: Compatibility Decomposition NFKC: Compatibility Decomposition, followed by Canonical Composition Complies with the official Unicode.org regression test. Uses UTF8 binary strings natively but can normalize a string in any UTF format. Fully tested with phpUnit. Code coverage test close to 100%. |
|
| » Maintainers | » More Information |
|
|