Annotation of mandoc/regress/char/unicode/input.in, Revision 1.5
1.5 ! schwarze 1: .\" $OpenBSD: input.in,v 1.5 2024/05/16 20:32:24 schwarze Exp $
! 2: .TH CHAR-UNICODE-INPUT 1 "May 16, 2024"
1.1 schwarze 3: .SH NAME
4: char-unicode-input \- Unicode characters in the input file
5: .SH DESCRIPTION
6: lowest valid:
7: .SS One-byte range
8: .TS
9: tab(:);
10: l l l l.
11: U+0000:0x00:\[u0000] :lowest ASCII
12: U+001f:0x1f:\[u001F]:highest ASCII control character
13: U+007f:0x7f:\[u007F]:highest ASCII
14: :0x80::leading lowest continuation
15: :0xbf::leading highest continuation
16: .TE
17: .SS Two-byte range
18: .TS
19: tab(:);
20: l l l l.
21: U+0000:0xc080::lowest obfuscated ASCII
22: U+007f:0xc1bf::highest obfuscated ASCII
23: U+0080:0xc280:\[u0080]:lowest two-byte
24: U+07FF:0xdfbf:\[u07FF]߿:highest two-byte
1.3 schwarze 25: :0xc278:x:ASCII instead of continuation
26: :0xc2c380:À:start byte instead of continuation
1.1 schwarze 27: .TE
28: .SS Three-byte range
29: .TS
30: tab(:);
31: l l l l.
32: U+0000:0xe08080::lowest obfuscated ASCII
33: U+007f:0xe081bf::highest obfuscated ASCII
34: U+0080:0xe08280::lowest obfuscated two-byte
35: U+07FF:0xe09fbf::highest obfuscated two-byte
36: U+0800:0xe0a080:\[u0800]ࠀ:lowest three-byte
1.3 schwarze 37: U+0FFF:0xe0bfbf:\[u0FFF]:end of first start byte
38: U+1000:0xe18080:\[u1000]က:begin of second start byte
39: U+CFFF:0xecbfbf:\[uCFFF]쿿:end of last normal start byte
40: U+D000:0xed8080:\[uD000]퀀:begin of last start byte
1.5 ! schwarze 41: U+D7FB:0xed9fbb:\[uD7FB]ퟻ:highest valid public three-byte
1.1 schwarze 42: U+D7FF:0xed9fbf:\[uD7FF]:highest public three-byte
43: U+D800:0xeda080:\[uD800]:lowest surrogate
44: U+DFFF:0xedbfbf:\[uDFFF]:highest surrogate
45: U+E000:0xee8080:\[uE000]:lowest private use
1.4 schwarze 46: U+F8FF:0xefa3bf:\[uF8FF]:highest private use
47: U+F900:0xefa480:\[uF900]豈:lowest post-private
1.5 ! schwarze 48: U+FEFF:0xefbbbf:\[uFEFF]:byte-order mark
! 49: U+FFFC:0xefbfbc:\[uFFFC]:object replacement character
! 50: U+FFFD:0xefbfbd:\[uFFFD]�:replacement character
! 51: U+FFFE:0xefbfbe:\[uFFFE]:reversed byte-order mark
1.1 schwarze 52: U+FFFF:0xefbfbf:\[uFFFF]:highest three-byte
53: .TE
54: .SS Four-byte range
55: .TS
56: tab(:);
57: l l l l.
58: U+0000:0xf0808080::lowest obfuscated ASCII
59: U+007f:0xf08081bf::highest obfuscated ASCII
60: U+0080:0xf0808280::lowest obfuscated two-byte
61: U+07FF:0xf0809fbf::highest obfuscated two-byte
62: U+0800:0xf080a080::lowest obfuscated three-byte
63: U+FFFF:0xf08fbfbf::highest obfuscated three-byte
64: U+10000:0xf0908080:\[u10000]𐀀:lowest four-byte
1.3 schwarze 65: U+3FFFF:0xf0bfbfbf:\[u3FFFF]:end of first start byte
66: U+40000:0xf1808080:\[u40000]:begin of second start byte
67: U+EFFFF:0xf2bfbfbf:\[uEFFFF]:highest public character
68: U+F0000:0xf3808080:\[uF0000]:lowest plane 15 private use
69: U+FFFFF:0xf3bfbfbf:\[uFFFFF]:highest plane 15 private use
70: U+100000:0xf4808080:\[u100000]:lowest plane 16 private use
71: U+10FFFF:0xf48fbfbf:\[u10FFFF]:highest valid four-byte
1.1 schwarze 72: U+110000:0xf4908080:\[u110000]:lowest beyond Unicode
1.3 schwarze 73: U+13FFFF:0xf4bfbfbf:\[u13FFFF]:end of last start byte
74: U+140000:0xf5808080:\[u140000]:lowest invalid start byte
75: U+1FFFFF:0xf7bfbfbf:\[u1FFFFF]:highest invalid four-byte
1.1 schwarze 76: U+200000:0xf888808080:\[u200000]:lowest five-byte
77: .TE
CVSweb