Annotation of mandoc/regress/char/unicode/input.in, Revision 1.3
1.3 ! schwarze 1: .\" $OpenBSD: input.in,v 1.3 2021/06/02 16:35:25 schwarze Exp $
! 2: .TH CHAR-UNICODE-INPUT 1 "June 2, 2021"
1.1 schwarze 3: .SH NAME
4: char-unicode-input \- Unicode characters in the input file
5: .SH DESCRIPTION
6: lowest valid: ย
7: .SS One-byte range
8: .TS
9: tab(:);
10: l l l l.
11: U+0000:0x00:\[u0000] :lowest ASCII
12: U+001f:0x1f:\[u001F]:highest ASCII control character
13: U+007f:0x7f:\[u007F]:highest ASCII
14: :0x80::leading lowest continuation
15: :0xbf:ฟ:leading highest continuation
16: .TE
17: .SS Two-byte range
18: .TS
19: tab(:);
20: l l l l.
21: U+0000:0xc080:ภ:lowest obfuscated ASCII
22: U+007f:0xc1bf:มฟ:highest obfuscated ASCII
23: U+0080:0xc280:\[u0080]ย:lowest two-byte
24: U+07FF:0xdfbf:\[u07FF]฿ฟ:highest two-byte
1.3 ! schwarze 25: :0xc278:ยx:ASCII instead of continuation
! 26: :0xc2c380:ยร:start byte instead of continuation
1.1 schwarze 27: .TE
28: .SS Three-byte range
29: .TS
30: tab(:);
31: l l l l.
32: U+0000:0xe08080:ภ:lowest obfuscated ASCII
33: U+007f:0xe081bf:เฟ:highest obfuscated ASCII
34: U+0080:0xe08280:เ:lowest obfuscated two-byte
35: U+07FF:0xe09fbf:เฟ:highest obfuscated two-byte
36: U+0800:0xe0a080:\[u0800]เ :lowest three-byte
1.3 ! schwarze 37: U+0FFF:0xe0bfbf:\[u0FFF]เฟฟ:end of first start byte
! 38: U+1000:0xe18080:\[u1000]แ:begin of second start byte
! 39: U+CFFF:0xecbfbf:\[uCFFF]์ฟฟ:end of last normal start byte
! 40: U+D000:0xed8080:\[uD000]ํ:begin of last start byte
1.1 schwarze 41: U+D7FF:0xed9fbf:\[uD7FF]ํฟ:highest public three-byte
42: U+D800:0xeda080:\[uD800]ํ :lowest surrogate
43: U+DFFF:0xedbfbf:\[uDFFF]ํฟฟ:highest surrogate
44: U+E000:0xee8080:\[uE000]๎:lowest private use
45: U+FFFF:0xefbfbf:\[uFFFF]๏ฟฟ:highest three-byte
46: .TE
47: .SS Four-byte range
48: .TS
49: tab(:);
50: l l l l.
51: U+0000:0xf0808080:๐:lowest obfuscated ASCII
52: U+007f:0xf08081bf:๐ฟ:highest obfuscated ASCII
53: U+0080:0xf0808280:๐:lowest obfuscated two-byte
54: U+07FF:0xf0809fbf:๐ฟ:highest obfuscated two-byte
55: U+0800:0xf080a080:๐ :lowest obfuscated three-byte
56: U+FFFF:0xf08fbfbf:๐ฟฟ:highest obfuscated three-byte
57: U+10000:0xf0908080:\[u10000]๐:lowest four-byte
1.3 ! schwarze 58: U+3FFFF:0xf0bfbfbf:\[u3FFFF]๐ฟฟฟ:end of first start byte
! 59: U+40000:0xf1808080:\[u40000]๑:begin of second start byte
! 60: U+EFFFF:0xf2bfbfbf:\[uEFFFF]๒ฟฟฟ:highest public character
! 61: U+F0000:0xf3808080:\[uF0000]๓:lowest plane 15 private use
! 62: U+FFFFF:0xf3bfbfbf:\[uFFFFF]๓ฟฟฟ:highest plane 15 private use
! 63: U+100000:0xf4808080:\[u100000]๔:lowest plane 16 private use
! 64: U+10FFFF:0xf48fbfbf:\[u10FFFF]๔ฟฟ:highest valid four-byte
1.1 schwarze 65: U+110000:0xf4908080:\[u110000]๔:lowest beyond Unicode
1.3 ! schwarze 66: U+13FFFF:0xf4bfbfbf:\[u13FFFF]๔ฟฟฟ:end of last start byte
! 67: U+140000:0xf5808080:\[u140000]๕:lowest invalid start byte
! 68: U+1FFFFF:0xf7bfbfbf:\[u1FFFFF]๗ฟฟฟ:highest invalid four-byte
1.1 schwarze 69: U+200000:0xf888808080:\[u200000]๘:lowest five-byte
70: .TE
CVSweb