Annotation of mandoc/regress/char/unicode/input.in, Revision 1.4
1.4 ! schwarze 1: .\" $OpenBSD: input.in,v 1.4 2021/06/02 17:36:59 schwarze Exp $
1.3 schwarze 2: .TH CHAR-UNICODE-INPUT 1 "June 2, 2021"
1.1 schwarze 3: .SH NAME
4: char-unicode-input \- Unicode characters in the input file
5: .SH DESCRIPTION
6: lowest valid: ย
7: .SS One-byte range
8: .TS
9: tab(:);
10: l l l l.
11: U+0000:0x00:\[u0000] :lowest ASCII
12: U+001f:0x1f:\[u001F]:highest ASCII control character
13: U+007f:0x7f:\[u007F]:highest ASCII
14: :0x80::leading lowest continuation
15: :0xbf:ฟ:leading highest continuation
16: .TE
17: .SS Two-byte range
18: .TS
19: tab(:);
20: l l l l.
21: U+0000:0xc080:ภ:lowest obfuscated ASCII
22: U+007f:0xc1bf:มฟ:highest obfuscated ASCII
23: U+0080:0xc280:\[u0080]ย:lowest two-byte
24: U+07FF:0xdfbf:\[u07FF]฿ฟ:highest two-byte
1.3 schwarze 25: :0xc278:ยx:ASCII instead of continuation
26: :0xc2c380:ยร:start byte instead of continuation
1.1 schwarze 27: .TE
28: .SS Three-byte range
29: .TS
30: tab(:);
31: l l l l.
32: U+0000:0xe08080:ภ:lowest obfuscated ASCII
33: U+007f:0xe081bf:เฟ:highest obfuscated ASCII
34: U+0080:0xe08280:เ:lowest obfuscated two-byte
35: U+07FF:0xe09fbf:เฟ:highest obfuscated two-byte
36: U+0800:0xe0a080:\[u0800]เ :lowest three-byte
1.3 schwarze 37: U+0FFF:0xe0bfbf:\[u0FFF]เฟฟ:end of first start byte
38: U+1000:0xe18080:\[u1000]แ:begin of second start byte
39: U+CFFF:0xecbfbf:\[uCFFF]์ฟฟ:end of last normal start byte
40: U+D000:0xed8080:\[uD000]ํ:begin of last start byte
1.1 schwarze 41: U+D7FF:0xed9fbf:\[uD7FF]ํฟ:highest public three-byte
42: U+D800:0xeda080:\[uD800]ํ :lowest surrogate
43: U+DFFF:0xedbfbf:\[uDFFF]ํฟฟ:highest surrogate
44: U+E000:0xee8080:\[uE000]๎:lowest private use
1.4 ! schwarze 45: U+F8FF:0xefa3bf:\[uF8FF]๏ฃฟ:highest private use
! 46: U+F900:0xefa480:\[uF900]๏ค:lowest post-private
1.1 schwarze 47: U+FFFF:0xefbfbf:\[uFFFF]๏ฟฟ:highest three-byte
48: .TE
49: .SS Four-byte range
50: .TS
51: tab(:);
52: l l l l.
53: U+0000:0xf0808080:๐:lowest obfuscated ASCII
54: U+007f:0xf08081bf:๐ฟ:highest obfuscated ASCII
55: U+0080:0xf0808280:๐:lowest obfuscated two-byte
56: U+07FF:0xf0809fbf:๐ฟ:highest obfuscated two-byte
57: U+0800:0xf080a080:๐ :lowest obfuscated three-byte
58: U+FFFF:0xf08fbfbf:๐ฟฟ:highest obfuscated three-byte
59: U+10000:0xf0908080:\[u10000]๐:lowest four-byte
1.3 schwarze 60: U+3FFFF:0xf0bfbfbf:\[u3FFFF]๐ฟฟฟ:end of first start byte
61: U+40000:0xf1808080:\[u40000]๑:begin of second start byte
62: U+EFFFF:0xf2bfbfbf:\[uEFFFF]๒ฟฟฟ:highest public character
63: U+F0000:0xf3808080:\[uF0000]๓:lowest plane 15 private use
64: U+FFFFF:0xf3bfbfbf:\[uFFFFF]๓ฟฟฟ:highest plane 15 private use
65: U+100000:0xf4808080:\[u100000]๔:lowest plane 16 private use
66: U+10FFFF:0xf48fbfbf:\[u10FFFF]๔ฟฟ:highest valid four-byte
1.1 schwarze 67: U+110000:0xf4908080:\[u110000]๔:lowest beyond Unicode
1.3 schwarze 68: U+13FFFF:0xf4bfbfbf:\[u13FFFF]๔ฟฟฟ:end of last start byte
69: U+140000:0xf5808080:\[u140000]๕:lowest invalid start byte
70: U+1FFFFF:0xf7bfbfbf:\[u1FFFFF]๗ฟฟฟ:highest invalid four-byte
1.1 schwarze 71: U+200000:0xf888808080:\[u200000]๘:lowest five-byte
72: .TE
CVSweb