=================================================================== RCS file: /cvs/mandoc/mandoc_char.7,v retrieving revision 1.70 retrieving revision 1.74 diff -u -p -r1.70 -r1.74 --- mandoc/mandoc_char.7 2018/08/08 14:03:27 1.70 +++ mandoc/mandoc_char.7 2018/08/21 01:59:22 1.74 @@ -1,4 +1,4 @@ -.\" $Id: mandoc_char.7,v 1.70 2018/08/08 14:03:27 schwarze Exp $ +.\" $Id: mandoc_char.7,v 1.74 2018/08/21 01:59:22 schwarze Exp $ .\" .\" Copyright (c) 2003 Jason McIntyre .\" Copyright (c) 2009, 2010, 2011 Kristaps Dzonsons @@ -16,7 +16,7 @@ .\" ACTION OF CONTRACT, NEGLIGENCE OR OTHER TORTIOUS ACTION, ARISING OUT OF .\" OR IN CONNECTION WITH THE USE OR PERFORMANCE OF THIS SOFTWARE. .\" -.Dd $Mdocdate: August 8 2018 $ +.Dd $Mdocdate: August 21 2018 $ .Dt MANDOC_CHAR 7 .Os .Sh NAME @@ -62,7 +62,7 @@ escape sequences to represent national language charac names; instead, provide ASCII transcriptions of the names. .Ss Dashes and Hyphens In typography there are different types of dashes of various width: -the hyphen (-), +the hyphen (\(hy), the en-dash (\(en), the em-dash (\(em), and the mathematical minus sign (\(mi). @@ -76,30 +76,6 @@ blue-eyed lorry-driver .Ed .Pp -If a word on a text input line contains a hyphen, a formatter may decide -to insert an output line break after the hyphen if that helps filling -the current output line, but the whole word would overflow the line. -If it is important that the word is not broken across lines in this -way, a zero-width space -.Pq Sq \e& -can be inserted before or after the hyphen. -While -.Xr mandoc 1 -never breaks the output line after hyphens adjacent to a zero-width -space, after any of the other dash- or hyphen-like characters -represented by escape sequences, or after hyphens inside words in -macro arguments, other software may not respect these rules and may -break the line even in such cases. -.Pp -Some -.Xr roff 7 -implementations contains dictionaries allowing to break the line -at syllable boundaries even inside words that contain no hyphens. -Such automatic hyphenation is not supported by -.Xr mandoc 1 , -which only breaks the line at whitespace, and inside words only -after existing hyphens. -.Pp The en-dash is used to separate the two elements of a range, or can be used the same way as an em-dash. It should be written as @@ -135,6 +111,30 @@ But currently, no practically relevant manual page for requires that subtlety, so in manual pages just write plain .Sq - to represent hyphen, minus, and hyphen-minus. +.Pp +If a word on a text input line contains a hyphen, a formatter may decide +to insert an output line break after the hyphen if that helps filling +the current output line, but the whole word would overflow the line. +If it is important that the word is not broken across lines in this +way, a zero-width space +.Pq Sq \e& +can be inserted before or after the hyphen. +While +.Xr mandoc 1 +never breaks the output line after hyphens adjacent to a zero-width +space, after any of the other dash- or hyphen-like characters +represented by escape sequences, or after hyphens inside words in +macro arguments, other software may not respect these rules and may +break the line even in such cases. +.Pp +Some +.Xr roff 7 +implementations contains dictionaries allowing to break the line +at syllable boundaries even inside words that contain no hyphens. +Such automatic hyphenation is not supported by +.Xr mandoc 1 , +which only breaks the line at whitespace, and inside words only +after existing hyphens. .Ss Spaces To separate words in normal text, for indenting and alignment in literal context, and when none of the following special cases apply, @@ -543,11 +543,13 @@ Accented letters: .It \e(\(aqI Ta \('I Ta acute I .It \e(\(aqO Ta \('O Ta acute O .It \e(\(aqU Ta \('U Ta acute U +.It \e(\(aqY Ta \('Y Ta acute Y .It \e(\(aqa Ta \('a Ta acute a .It \e(\(aqe Ta \('e Ta acute e .It \e(\(aqi Ta \('i Ta acute i .It \e(\(aqo Ta \('o Ta acute o .It \e(\(aqu Ta \('u Ta acute u +.It \e(\(aqy Ta \('y Ta acute y .It \e(\(gaA Ta \(`A Ta grave A .It \e(\(gaE Ta \(`E Ta grave E .It \e(\(gaI Ta \(`I Ta grave I @@ -761,14 +763,16 @@ For backward compatibility with existing manuals, .Xr mandoc 1 also supports the .Pp -.Dl \eN\(aq Ns Ar number Ns \(aq +.Dl \eN\(aq Ns Ar number Ns \(aq and \e[ Ns Cm char Ns Ar number ] .Pp -escape sequence, inserting the character +escape sequences, inserting the character .Ar number from the current character set into the output. Of course, this is inherently non-portable and is already marked -as deprecated in the Heirloom roff manual. -For example, do not use \eN\(aq34\(aq, use \e(dq, or even the plain +as deprecated in the Heirloom roff manual; +on top of that, the second form is a GNU extension. +For example, do not use \eN\(aq34\(aq or \e[char34], use \e(dq, +or even the plain .Sq \(dq character where possible. .Sh COMPATIBILITY