[R6RS] BOM-based codecs
Michael Sperber
sperber at informatik.uni-tuebingen.de
Mon Aug 14 13:23:09 EDT 2006
I suggest adding a codec returned by a nullary procedure
utf-bom-codec
that will return a codec for a meta-encoding based on the Unicode
byte-order mark. This codec will only work for input ports, and raise
an exception if used for an output port.
For an input port with a transcoder with such a codec, the first
attempt to read from the port will read 2, 3 or 4 bytes from the port
that determine the actual encoding according the following table:
EF BB BF UTF-8
FE FF UTF-16be
FF FE UTF-16le
00 00 FE FF UTF-32be
FF FE 00 00 UTF-32le
Will, is that what you had in mind as far as the BOM is concerned?
--
Cheers =8-} Mike
Friede, Völkerverständigung und überhaupt blabla
More information about the R6RS
mailing list