| 12345678910111213141516171819202122232425262728293031323334353637383940414243 |
- The accompanying C source code file "ConvertUTF.c" and the associated header
- file "ConvertUTF.h" provide for conversion between various transformation
- formats of Unicode characters. The following conversions are supported:
- UTF-32 to UTF-16
- UTF-32 to UTF-8
- UTF-16 to UTF-32
- UTF-16 to UTF-8
- UTF-8 to UTF-16
- UTF-8 to UTF-32
- In addition, there is a test harness which runs various tests.
- The files "CVTUTF7.C" and "CVTUTF7.H" are for archival and historical purposes
- only. They have not been updated to Unicode 3.0 or later and should be
- considered obsolescent. "CVTUTF7.C" contains two functions that can convert
- between UCS2 (i.e., the BMP characters only) and UTF-7. Surrogates are
- not supported, the code has not been tested, and should be considered
- unsuitable for general purpose use.
- Please submit any bug reports about these programs here:
- http://www.unicode.org/unicode/reporting.html
- Version 1.0: initial version.
- Version 1.1: corrected some minor problems; added stricter checks.
- Version 1.2: corrected switch statements associated with "extraBytesToRead"
- in 4 & 5 byte cases, in functions for conversion from UTF8.
- Note: formally, the 4 & 5 byte cases are illegal in the latest
- UTF8, but the table and this code has always catered for those,
- cases since at one time they were legal.
- Version 1.3: Updated UTF-8 legality check;
- updated to use UNI_MAX_LEGAL_UTF32 in UTF-32 conversions
- Updated UTF-8 legality tests in harness.c
-
- Last update: October 19, 2004
|