Commit graph

4 commits

Author SHA1 Message Date
Brion Vibber
9857a47c3f Correction to the \r stripping 2004-09-03 06:44:57 +00:00
Brion Vibber
ed46bd50fe Add UtfNormal::cleanUp() function: strips XML-unsafe characters and illegal UTF-8 sequences, then normalizes to form C. 2004-09-03 05:39:30 +00:00
Brion Vibber
53e71c1702 Split the data arrays for form KC, KD to a separate include file and load it on demand.
These are less likely to be used, so save the memory and parse time...
2004-09-02 07:39:06 +00:00
Brion Vibber
a5cfdf0360 Unicode normalization routines.
See: http://www.unicode.org/reports/tr15/
2004-08-29 10:30:23 +00:00