[BACK]Return to input.in CVS log [TXT][DIR] Up to [cvsweb.bsd.lv] / mandoc / regress / char / unicode

File: [cvsweb.bsd.lv] / mandoc / regress / char / unicode / input.in (download)

Revision 1.2, Tue Jul 4 15:24:44 2017 UTC (6 years, 11 months ago) by schwarze
Branch: MAIN
CVS Tags: VERSION_1_14_5, VERSION_1_14_4, VERSION_1_14_3, VERSION_1_14_2
Changes since 1.1: +2 -1 lines

Messages of the -Wbase level now print STYLE:.  Since this
causes horrible churn anyway, profit of the opportunity to stop
excessive testing, such that this is hopefully the last instance
of such churn.  Consistently use OpenBSD RCS tags, blank .Os,
blank fourth .TH argument, and Mdocdate like everywhere else.
Use -Ios=OpenBSD for platform-independent predictable output.

.\" $OpenBSD: input.in,v 1.2 2017/07/04 14:53:23 schwarze Exp $
.TH CHAR-UNICODE-INPUT 1 "December 19, 2014"
.SH NAME
char-unicode-input \- Unicode characters in the input file
.SH DESCRIPTION
lowest valid: ย€
.SS One-byte range
.TS
tab(:);
l l l l.
U+0000:0x00:\[u0000]:lowest ASCII
U+001f:0x1f:\[u001F]:highest ASCII control character
U+007f:0x7f:\[u007F]:highest ASCII
:0x80:€:leading lowest continuation
:0xbf:ฟ:leading highest continuation
.TE
.SS Two-byte range
.TS
tab(:);
l l l l.
U+0000:0xc080:ภ€:lowest obfuscated ASCII
U+007f:0xc1bf:มฟ:highest obfuscated ASCII
:0xc278:ยx:ASCII continuation
U+0080:0xc280:\[u0080]ย€:lowest two-byte
:0xc2c380:ยร€:high continuation
U+07FF:0xdfbf:\[u07FF]฿ฟ:highest two-byte
.TE
.SS Three-byte range
.TS
tab(:);
l l l l.
U+0000:0xe08080:ภ€€:lowest obfuscated ASCII
U+007f:0xe081bf:เฟ:highest obfuscated ASCII
U+0080:0xe08280:เ‚€:lowest obfuscated two-byte
U+07FF:0xe09fbf:เŸฟ:highest obfuscated two-byte
U+0800:0xe0a080:\[u0800]เ €:lowest three-byte
U+0FFF:0xe0bfbf:\[u0FFF]เฟฟ:end of first middle byte
U+1000:0xe18080:\[u1000]แ€€:begin of second middle byte
U+CFFF:0xecbfbf:\[uCFFF]์ฟฟ:end of last normal middle byte
U+D000:0xed8080:\[uD000]ํ€€:begin of strange middle byte
U+D7FF:0xed9fbf:\[uD7FF]ํŸฟ:highest public three-byte
U+D800:0xeda080:\[uD800]ํ €:lowest surrogate
U+DFFF:0xedbfbf:\[uDFFF]ํฟฟ:highest surrogate
U+E000:0xee8080:\[uE000]๎€€:lowest private use
U+FFFF:0xefbfbf:\[uFFFF]๏ฟฟ:highest three-byte
.TE
.SS Four-byte range
.TS
tab(:);
l l l l.
U+0000:0xf0808080:๐€€€:lowest obfuscated ASCII
U+007f:0xf08081bf:๐€ฟ:highest obfuscated ASCII
U+0080:0xf0808280:๐€‚€:lowest obfuscated two-byte
U+07FF:0xf0809fbf:๐€Ÿฟ:highest obfuscated two-byte
U+0800:0xf080a080:๐€ €:lowest obfuscated three-byte
U+FFFF:0xf08fbfbf:๐ฟฟ:highest obfuscated three-byte
U+10000:0xf0908080:\[u10000]๐€€:lowest four-byte
U+3FFFF:0xf0bfbfbf:\[u3FFFF]๐ฟฟฟ:end of first middle byte
U+40000:0xf1808080:\[u40000]๑€€€:second middle byte
U+FFFFF:0xf3bfbfbf:\[uFFFFF]๓ฟฟฟ:last normal middle byte
U+100000:0xf4808080:\[u100000]๔€€€:strange middle byte
U+10FFFF:0xf48fbfbf:\[u10FFFF]๔ฟฟ:last valid four-byte
U+110000:0xf4908080:\[u110000]๔€€:lowest beyond Unicode
U+13FFFF:0xf4bfbfbf:\[u13FFFF]๔ฟฟฟ:end of strange middle byte
U+140000:0xf5808080:\[u140000]๕€€€:lowest invalid middle byte
U+1FFFFF:0xf7bfbfbf:\[u1FFFFF]๗ฟฟฟ:highest four-byte
U+200000:0xf888808080:\[u200000]๘ˆ€€€:lowest five-byte
.TE