=================================================================== RCS file: /cvs/mandoc/mandoc_char.7,v retrieving revision 1.42 retrieving revision 1.45 diff -u -p -r1.42 -r1.45 --- mandoc/mandoc_char.7 2011/02/09 22:53:20 1.42 +++ mandoc/mandoc_char.7 2011/05/15 15:30:33 1.45 @@ -1,4 +1,4 @@ -.\" $Id: mandoc_char.7,v 1.42 2011/02/09 22:53:20 schwarze Exp $ +.\" $Id: mandoc_char.7,v 1.45 2011/05/15 15:30:33 kristaps Exp $ .\" .\" Copyright (c) 2009 Kristaps Dzonsons .\" @@ -14,7 +14,7 @@ .\" ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF .\" OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. .\" -.Dd $Mdocdate: February 9 2011 $ +.Dd $Mdocdate: May 15 2011 $ .Dt MANDOC_CHAR 7 .Os .Sh NAME @@ -481,8 +481,13 @@ Greek letters: .It \e(ts Ta \(ts Ta sigma terminal .El .Sh PREDEFINED STRINGS -These are not recommended for use, as they differ across -implementations: +Predefined strings are inherited from the macro packages of historical +troff implementations. +They are +.Em not recommended +for use, as they differ across implementations. +Manuals using these predefined strings are almost certainly not +portable. .Pp .Bl -column -compact -offset indent "Input" "Rendered" "Description" .It Em Input Ta Em Rendered Ta Em Description @@ -512,7 +517,23 @@ implementations: .It \e*(>= Ta \*(>= Ta greater-than-equal .It \e*(aa Ta \*(aa Ta acute .It \e*(ga Ta \*(ga Ta grave +.It \e*(Px Ta \*(Px Ta POSIX standard name +.It \e*(Ai Ta \*(Ai Ta ANSI standard name .El +.Sh UNICODE CHARACTERS +The escape sequence +.Pp +.Dl \e[uXXXX] +.Pp +is interpreted as a Unicode codepoint. +The codepoint must be in the range above U+0080 and less than U+10FFFF. +For compatibility, points must be zero-padded to four characters; if +greater than four characters, no zero padding is allowed. +Unicode surrogates are not allowed. +.\" .Pp +.\" Unicode glyphs attenuate to the +.\" .Sq \&? +.\" character if invalid or not rendered by current output media. .Sh NUMBERED CHARACTERS For backward compatibility with existing manuals, .Xr mandoc 1 @@ -534,6 +555,9 @@ troff implementations, at this time limited to GNU tro .Pq Qq groff . .Pp .Bl -dash -compact +.It +The \eN\(aq\(aq escape sequence is limited to printable characters; in +groff, it accepts arbitrary character numbers. .It In .Fl T Ns Cm ascii ,