> > And most (but not all) Unicode string implementations use UTF-16.
>
> A disadvantage of UTF-16 is that character predicates like
> char-alphabetic? break for characters above U+FFFF.
>
> Since this breakage is rare in practice, programs usually don't care.
> They are only subtly wrong.
I'm not sure what you mean. char-alphabetic? is well defined; the encoding
doesn't matter. There's a good overview of Unicode concepts and
terminology at
http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf.
Received on Sat Mar 24 2007 - 13:08:00 UTC