[r6rs-discuss] Lexical syntax: unicode category "Co"

From: John Cowan <cowan>
Date: Thu, 13 Sep 2007 09:44:35 -0400

Abdulaziz Ghuloum scripsit:

> Section 4.2 Lexical syntax (page 12) defines what "constituents" are
> by enumerating their unicode categories. The inclusion of the
> category "Co" is interesting since it's a unicode "private use"
> category containing only the following markers:
>
> E000;<Private Use, First>;Co;0;L;;;;;N;;;;;
> F8FF;<Private Use, Last>;Co;0;L;;;;;N;;;;;
> F0000;<Plane 15 Private Use, First>;Co;0;L;;;;;N;;;;;
> FFFFD;<Plane 15 Private Use, Last>;Co;0;L;;;;;N;;;;;
> 100000;<Plane 16 Private Use, First>;Co;0;L;;;;;N;;;;;
> 10FFFD;<Plane 16 Private Use, Last>;Co;0;L;;;;;N;;;;;

Note that this is a convention to keep UnicodeData a bit smaller;
the properties apply to *all* the characters in the ranges
E000-F8FF, F0000-FFFFD, and 100000-10FFFD.

-- 
John Cowan  cowan at ccil.org   http://ccil.org/~cowan
Assent may be registered by a signature, a handshake, or a click of a computer
mouse transmitted across the invisible ether of the Internet. Formality
is not a requisite; any sign, symbol or action, or even willful inaction,
as long as it is unequivocally referable to the promise, may create a contract.
       --Specht v. Netscape
Received on Thu Sep 13 2007 - 09:44:35 UTC

This archive was generated by hypermail 2.3.0 : Wed Oct 23 2024 - 09:15:01 UTC