[BACK]Return to input.in CVS log [TXT][DIR] Up to [cvsweb.bsd.lv] / mandoc / regress / char / unicode

File: [cvsweb.bsd.lv] / mandoc / regress / char / unicode / input.in (download)

Revision 1.4, Wed Jun 2 17:39:00 2021 UTC (3 years, 1 month ago) by schwarze
Branch: MAIN
CVS Tags: VERSION_1_14_6
Changes since 1.3: +3 -1 lines

test private use areas some more as they have proven fragile

.\" $OpenBSD: input.in,v 1.4 2021/06/02 17:36:59 schwarze Exp $
.TH CHAR-UNICODE-INPUT 1 "June 2, 2021"
.SH NAME
char-unicode-input \- Unicode characters in the input file
.SH DESCRIPTION
lowest valid: ย€
.SS One-byte range
.TS
tab(:);
l l l l.
U+0000:0x00:\[u0000]:lowest ASCII
U+001f:0x1f:\[u001F]:highest ASCII control character
U+007f:0x7f:\[u007F]:highest ASCII
:0x80:€:leading lowest continuation
:0xbf:ฟ:leading highest continuation
.TE
.SS Two-byte range
.TS
tab(:);
l l l l.
U+0000:0xc080:ภ€:lowest obfuscated ASCII
U+007f:0xc1bf:มฟ:highest obfuscated ASCII
U+0080:0xc280:\[u0080]ย€:lowest two-byte
U+07FF:0xdfbf:\[u07FF]฿ฟ:highest two-byte
:0xc278:ยx:ASCII instead of continuation
:0xc2c380:ยร€:start byte instead of continuation
.TE
.SS Three-byte range
.TS
tab(:);
l l l l.
U+0000:0xe08080:ภ€€:lowest obfuscated ASCII
U+007f:0xe081bf:เฟ:highest obfuscated ASCII
U+0080:0xe08280:เ‚€:lowest obfuscated two-byte
U+07FF:0xe09fbf:เŸฟ:highest obfuscated two-byte
U+0800:0xe0a080:\[u0800]เ €:lowest three-byte
U+0FFF:0xe0bfbf:\[u0FFF]เฟฟ:end of first start byte
U+1000:0xe18080:\[u1000]แ€€:begin of second start byte
U+CFFF:0xecbfbf:\[uCFFF]์ฟฟ:end of last normal start byte
U+D000:0xed8080:\[uD000]ํ€€:begin of last start byte
U+D7FF:0xed9fbf:\[uD7FF]ํŸฟ:highest public three-byte
U+D800:0xeda080:\[uD800]ํ €:lowest surrogate
U+DFFF:0xedbfbf:\[uDFFF]ํฟฟ:highest surrogate
U+E000:0xee8080:\[uE000]๎€€:lowest private use
U+F8FF:0xefa3bf:\[uF8FF]๏ฃฟ:highest private use
U+F900:0xefa480:\[uF900]๏ค€:lowest post-private
U+FFFF:0xefbfbf:\[uFFFF]๏ฟฟ:highest three-byte
.TE
.SS Four-byte range
.TS
tab(:);
l l l l.
U+0000:0xf0808080:๐€€€:lowest obfuscated ASCII
U+007f:0xf08081bf:๐€ฟ:highest obfuscated ASCII
U+0080:0xf0808280:๐€‚€:lowest obfuscated two-byte
U+07FF:0xf0809fbf:๐€Ÿฟ:highest obfuscated two-byte
U+0800:0xf080a080:๐€ €:lowest obfuscated three-byte
U+FFFF:0xf08fbfbf:๐ฟฟ:highest obfuscated three-byte
U+10000:0xf0908080:\[u10000]๐€€:lowest four-byte
U+3FFFF:0xf0bfbfbf:\[u3FFFF]๐ฟฟฟ:end of first start byte
U+40000:0xf1808080:\[u40000]๑€€€:begin of second start byte
U+EFFFF:0xf2bfbfbf:\[uEFFFF]๒ฟฟฟ:highest public character
U+F0000:0xf3808080:\[uF0000]๓€€€:lowest plane 15 private use
U+FFFFF:0xf3bfbfbf:\[uFFFFF]๓ฟฟฟ:highest plane 15 private use
U+100000:0xf4808080:\[u100000]๔€€€:lowest plane 16 private use
U+10FFFF:0xf48fbfbf:\[u10FFFF]๔ฟฟ:highest valid four-byte
U+110000:0xf4908080:\[u110000]๔€€:lowest beyond Unicode
U+13FFFF:0xf4bfbfbf:\[u13FFFF]๔ฟฟฟ:end of last start byte
U+140000:0xf5808080:\[u140000]๕€€€:lowest invalid start byte
U+1FFFFF:0xf7bfbfbf:\[u1FFFFF]๗ฟฟฟ:highest invalid four-byte
U+200000:0xf888808080:\[u200000]๘ˆ€€€:lowest five-byte
.TE