=================================================================== RCS file: /cvs/mandoc/mandoc_char.7,v retrieving revision 1.43 retrieving revision 1.45 diff -u -p -r1.43 -r1.45 --- mandoc/mandoc_char.7 2011/04/20 22:50:22 1.43 +++ mandoc/mandoc_char.7 2011/05/15 15:30:33 1.45 @@ -1,4 +1,4 @@ -.\" $Id: mandoc_char.7,v 1.43 2011/04/20 22:50:22 kristaps Exp $ +.\" $Id: mandoc_char.7,v 1.45 2011/05/15 15:30:33 kristaps Exp $ .\" .\" Copyright (c) 2009 Kristaps Dzonsons .\" @@ -14,7 +14,7 @@ .\" ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF .\" OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. .\" -.Dd $Mdocdate: April 20 2011 $ +.Dd $Mdocdate: May 15 2011 $ .Dt MANDOC_CHAR 7 .Os .Sh NAME @@ -520,6 +520,20 @@ portable. .It \e*(Px Ta \*(Px Ta POSIX standard name .It \e*(Ai Ta \*(Ai Ta ANSI standard name .El +.Sh UNICODE CHARACTERS +The escape sequence +.Pp +.Dl \e[uXXXX] +.Pp +is interpreted as a Unicode codepoint. +The codepoint must be in the range above U+0080 and less than U+10FFFF. +For compatibility, points must be zero-padded to four characters; if +greater than four characters, no zero padding is allowed. +Unicode surrogates are not allowed. +.\" .Pp +.\" Unicode glyphs attenuate to the +.\" .Sq \&? +.\" character if invalid or not rendered by current output media. .Sh NUMBERED CHARACTERS For backward compatibility with existing manuals, .Xr mandoc 1 @@ -541,6 +555,9 @@ troff implementations, at this time limited to GNU tro .Pq Qq groff . .Pp .Bl -dash -compact +.It +The \eN\(aq\(aq escape sequence is limited to printable characters; in +groff, it accepts arbitrary character numbers. .It In .Fl T Ns Cm ascii ,