Abdulaziz Ghuloum scripsit:
> Section 4.2 Lexical syntax (page 12) defines what "constituents" are
> by enumerating their unicode categories. The inclusion of the
> category "Co" is interesting since it's a unicode "private use"
> category containing only the following markers:
>
> E000;<Private Use, First>;Co;0;L;;;;;N;;;;;
> F8FF;<Private Use, Last>;Co;0;L;;;;;N;;;;;
> F0000;<Plane 15 Private Use, First>;Co;0;L;;;;;N;;;;;
> FFFFD;<Plane 15 Private Use, Last>;Co;0;L;;;;;N;;;;;
> 100000;<Plane 16 Private Use, First>;Co;0;L;;;;;N;;;;;
> 10FFFD;<Plane 16 Private Use, Last>;Co;0;L;;;;;N;;;;;
Note that this is a convention to keep UnicodeData a bit smaller;
the properties apply to *all* the characters in the ranges
E000-F8FF, F0000-FFFFD, and 100000-10FFFD.
--
John Cowan cowan at ccil.org http://ccil.org/~cowan
Assent may be registered by a signature, a handshake, or a click of a computer
mouse transmitted across the invisible ether of the Internet. Formality
is not a requisite; any sign, symbol or action, or even willful inaction,
as long as it is unequivocally referable to the promise, may create a contract.
--Specht v. Netscape
Received on Thu Sep 13 2007 - 09:44:35 UTC