Commit graph

118 commits

Author SHA1 Message Date
Brion Vibber
dafeb1fe3b Work through the NFC substeps with the actual data to make the substep times more meaningful 2004-10-30 10:20:19 +00:00
Brion Vibber
711899c70d Benchmark was pulling the wrong Tokyo article (shorter than the others) 2004-10-30 06:47:36 +00:00
Brion Vibber
959f097c2d Add some sub-functions back to the benchmark 2004-10-30 06:42:39 +00:00
Brion Vibber
de3549d9e9 Optimize inner loops a bit. 2004-10-30 06:02:30 +00:00
Brion Vibber
5cf94de93f Subject UtfNormal::cleanUp() to the same tests as UtfNormal::toNFC() 2004-10-30 05:24:24 +00:00
Brion Vibber
d2e152e6de Munge doc comments. Mark as its own package for docs. 2004-10-28 02:56:13 +00:00
Brion Vibber
6377e82b76 Load form C data on demand; if we are dealing in all-ASCII text we can save some memory and time by not loading it. 2004-10-09 08:08:26 +00:00
Brion Vibber
0824182956 Add support for using ICU to perform normalization, which is much much faster than the PHP code!
Still need to add support for cleanup/verification.
2004-10-07 05:59:10 +00:00
Brion Vibber
bcd1e9e844 Fetch test data for the benchmark 2004-10-07 03:40:06 +00:00
Brion Vibber
f0610d0f67 Doc comments 2004-09-27 02:59:24 +00:00
Brion Vibber
106d11a197 Add remotely fetched files to .cvsignore to reduce screen pollution 2004-09-23 07:29:25 +00:00
Brion Vibber
dd195aa594 Some more phpdoc bits 2004-09-04 09:35:01 +00:00
Antoine Musso
ba2afcd9fa Split files and classes in different packages for phpdocumentor. I probably changed some double quotes to single and used function foo () { shema 2004-09-03 23:00:01 +00:00
Antoine Musso
705bb88da0 Change the way comment are generated so they are compatible with phpdocumentor. Changes already existing files as well. 2004-09-03 22:52:28 +00:00
Brion Vibber
9857a47c3f Correction to the \r stripping 2004-09-03 06:44:57 +00:00
Brion Vibber
ed46bd50fe Add UtfNormal::cleanUp() function: strips XML-unsafe characters and illegal UTF-8 sequences, then normalizes to form C. 2004-09-03 05:39:30 +00:00
Brion Vibber
53e71c1702 Split the data arrays for form KC, KD to a separate include file and load it on demand.
These are less likely to be used, so save the memory and parse time...
2004-09-02 07:39:06 +00:00
Brion Vibber
a5cfdf0360 Unicode normalization routines.
See: http://www.unicode.org/reports/tr15/
2004-08-29 10:30:23 +00:00