readme.txt 1.5 KB

12345678910111213141516171819202122232425262728293031323334353637383940414243
  1. The accompanying C source code file "ConvertUTF.c" and the associated header
  2. file "ConvertUTF.h" provide for conversion between various transformation
  3. formats of Unicode characters. The following conversions are supported:
  4. UTF-32 to UTF-16
  5. UTF-32 to UTF-8
  6. UTF-16 to UTF-32
  7. UTF-16 to UTF-8
  8. UTF-8 to UTF-16
  9. UTF-8 to UTF-32
  10. In addition, there is a test harness which runs various tests.
  11. The files "CVTUTF7.C" and "CVTUTF7.H" are for archival and historical purposes
  12. only. They have not been updated to Unicode 3.0 or later and should be
  13. considered obsolescent. "CVTUTF7.C" contains two functions that can convert
  14. between UCS2 (i.e., the BMP characters only) and UTF-7. Surrogates are
  15. not supported, the code has not been tested, and should be considered
  16. unsuitable for general purpose use.
  17. Please submit any bug reports about these programs here:
  18. http://www.unicode.org/unicode/reporting.html
  19. Version 1.0: initial version.
  20. Version 1.1: corrected some minor problems; added stricter checks.
  21. Version 1.2: corrected switch statements associated with "extraBytesToRead"
  22. in 4 & 5 byte cases, in functions for conversion from UTF8.
  23. Note: formally, the 4 & 5 byte cases are illegal in the latest
  24. UTF8, but the table and this code has always catered for those,
  25. cases since at one time they were legal.
  26. Version 1.3: Updated UTF-8 legality check;
  27. updated to use UNI_MAX_LEGAL_UTF32 in UTF-32 conversions
  28. Updated UTF-8 legality tests in harness.c
  29. Last update: October 19, 2004