[r6rs-discuss] Strings

From: MichaelL_at_frogware.com <MichaelL>
Date: Sat Mar 24 13:08:40 2007

> > And most (but not all) Unicode string implementations use UTF-16.
>
> A disadvantage of UTF-16 is that character predicates like
> char-alphabetic? break for characters above U+FFFF.
>
> Since this breakage is rare in practice, programs usually don't care.
> They are only subtly wrong.

I'm not sure what you mean. char-alphabetic? is well defined; the encoding
doesn't matter. There's a good overview of Unicode concepts and
terminology at http://www.unicode.org/versions/Unicode4.0.0/ch03.pdf.
Received on Sat Mar 24 2007 - 13:08:00 UTC

This archive was generated by hypermail 2.3.0 : Wed Oct 23 2024 - 09:15:01 UTC