mandoc(1)

General Commands Manual

Name

mandoc — format manual pages

Synopsis

mandoc [-ac] [-I os=name] [-K encoding] [-mdoc | -man] [-O options] [-T output] [-W level] [file ]
mandoc [-v | --version]
mandoc [-h | --help]

The last two lines above are fictional additions to mandoc's usual synopsis, included to demonstrate handling of more complex synopsis content.

Description

The mandoc utility formats UNIX manual pages for display.

By default, mandoc reads mdoc(7) or man(7) text from stdin and produces -T locale output.

The options are as follows:

-a

If the standard output is a terminal device and -c is not specified, use more(1) to paginate the output, just like man(1) would.

-c

Copy the formatted manual pages to the standard output without using more(1) to paginate them. This is the default. It can be specified to override -a.

-I os=name

Override the default operating system name for the mdoc(7) Os and for the man(7) TH macro.

-K encoding

Specify the input encoding. The supported encoding arguments are us-ascii, iso-8859-1, and utf-8. If not specified, autodetection uses the first match in the following list:

  1. If the first three bytes of the input file are the UTF-8 byte-order mark (BOM, 0xefbbbf), input is interpreted as utf-8.
  2. If the first or second line of the input file matches the Emacs mode line format
    .\" -*- […;] coding: encoding; -*-
    then input is interpreted according to encoding.
  3. If the first non-ASCII byte in the file introduces a valid UTF-8 sequence, input is interpreted as utf-8.
  4. Otherwise, input is interpreted as iso-8859-1.
-mdoc | -man

With -mdoc, all input files are interpreted as mdoc(7). With -man, all input files are interpreted as man(7). By default, the input language is automatically detected for each file: if the first macro is Dd or Dt, the mdoc(7) parser is used; otherwise, the man(7) parser is used. With other arguments, -m is silently ignored.

-O options

Comma-separated output options.

-T output

Output format. See Output Formats for available formats. Defaults to -T locale.

-W level

Specify the minimum message level to be reported on the standard error output and to affect the exit status. The level can be base, style, warning, error, or unsupp. The base level automatically derives the operating system from the contents of the Os macro, from the -Ios command line option, or from the uname(3) return value. The levels openbsd and netbsd are variants of base that bypass autodetection and request validation of base system conventions for a particular operating system. The level all is an alias for base. By default, mandoc is silent. See Exit status.

The special option -W stop tells mandoc to exit after parsing a file that causes warnings or errors of at least the requested level. No formatted output will be produced from that file. If both a level and stop are requested, they can be joined with a comma, for example -W error,stop.

file

Read input from zero or more files. If unspecified, reads from stdin. If multiple files are specified, mandoc will halt with the first failed parse.

The options -fhklw are also supported and are documented in man(1). In -f and -k mode, mandoc also supports the options -CMmOSs described in the apropos(1) manual. The options -fkl are mutually exclusive and override each other.

Output formats

The mandoc utility accepts the following -T arguments, which correspond to output modes:

-T ascii

Produce 7-bit ASCII output. See ASCII Output.

-T html

Produce HTML5, CSS1, and MathML output. See HTML Output.

-T lint

Parse only: produce no output. Implies -W all and redirects parser messages, which usually appear on standard error output, to standard output.

-T locale

Encode output using the current locale. This is the default. See Locale Output.

-T man

Produce man(7) format output. See Man Output.

-T markdown

Produce output in markdown format. See Markdown Output.

-T pdf

Produce PDF output. See PDF Output.

-T ps

Produce PostScript output. See PostScript Output.

-T tree

Produce an indented parse tree. See Syntax tree output.

-T utf8

Encode output in the UTF-8 multi-byte format. See UTF-8 Output.

If multiple input files are specified, these will be processed by the corresponding filter in-order.

ASCII output

Output produced by -T ascii is rendered in standard 7-bit ASCII documented in ascii(7).

Font styles are applied by using back-spaced encoding such that an underlined character c is rendered as _\[bs]c, where \[bs] is the back-space character number 8. Emboldened characters are rendered as c\[bs]c.

The special characters documented in mandoc_char(7) are rendered best-effort in an ASCII equivalent.

The following -O arguments are accepted:

indent=indent

The left margin for normal text is set to indent blank characters instead of the default of five for mdoc(7) and seven for man(7). Increasing this is not recommended; it may result in degraded formatting, for example overfull lines or ugly line breaks. When output is to a pager on a terminal that is less than 66 columns wide, the default is reduced to three columns.

mdoc

Format man(7) input files in mdoc(7) output style. Specifically, this suppresses the two additional blank lines near the top and the bottom of each page, and it implies -O indent=5. One useful application is for checking that -T man output formats in the same way as the mdoc(7) source it was generated from.

width=width

The output width is set to width instead of the default of 78. When output is to a pager on a terminal that is less than 79 columns wide, the default is reduced to one less than the terminal width. In any case, lines that are output in literal mode are never wrapped and may exceed the output width.

HTML output

Output produced by -T html conforms to HTML5 using optional self-closing tags. Default styles use only CSS1. Equations rendered from eqn(7) blocks use MathML.

The mandoc.css file documents style-sheet classes available for customising output. If a style-sheet is not specified with -O style, -T html defaults to simple output (via an embedded style-sheet) readable in any graphical or text-based web browser.

Special characters are rendered in decimal-encoded UTF-8.

The following -O arguments are accepted:

fragment

Omit the <!DOCTYPE> declaration and the <html>, <head>, and <body> elements and only emit the subtree below the <body> element. The style argument will be ignored. This is useful when embedding manual content within existing documents.

includes=fmt

The string fmt, for example, ../src/%I.html, is used as a template for linked header files (usually via the In macro). Instances of %I are replaced with the include filename. The default is not to present a hyperlink.

man=fmt

The string fmt, for example, ../html%S/%N.%S.html, is used as a template for linked manuals (usually via the Xr macro). Instances of %N and %S are replaced with the linked manual’s name and section, respectively. If no section is included, section 1 is assumed. The default is not to present a hyperlink.

style=style.css

The file style.css is used for an external style-sheet. This must be a valid absolute or relative URI.

Locale output

Locale-depending output encoding is triggered with -T locale. This is the default.

This option is not available on all systems: systems without locale support, or those whose internal representation is not natively UCS-4, will fall back to -T ascii. See ASCII Output for font style specification and available command-line arguments.

Man output

Translate input format into man(7) output format. This is useful for distributing manual sources to legacy systems lacking mdoc(7) formatters.

If mdoc(7) is passed as input, it is translated into man(7). If the input format is man(7), the input is copied to the output, expanding any roff(7) so requests. The parser is also run, and as usual, the -W level controls which diagnostics are displayed before copying the input to the output.

Markdown output

Translate mdoc(7) input to the markdown format conforming to John Gruber’s 2004 specification. The output also almost conforms to the CommonMark specification.

The character set used for the markdown output is ASCII. Non-ASCII characters are encoded as HTML entities. Since that is not possible in literal font contexts, because these are rendered as code spans and code blocks in the markdown output, non-ASCII characters are transliterated to ASCII approximations in these contexts.

Markdown is a very weak markup language, so all semantic markup is lost, and even part of the presentational markup may be lost. Do not use this as an intermediate step in converting to HTML; instead, use -T html directly.

The man(7), tbl(7), and eqn(7) input languages are not supported by -T markdown output mode.

PDF output

PDF-1.1 output may be generated by -T pdf. See PostScript output for -O arguments and defaults.

PostScript output

PostScript “Adobe-3.0” Level-2 pages may be generated by -T ps. Output pages default to letter sized and are rendered in the Times font family, 11-point. Margins are calculated as 1/9 the page length and width. Line-height is 1.4m.

Special characters are rendered as in ASCII output.

The following -O arguments are accepted:

paper=name

The paper size name may be one of A3, A4, A5, legal, or letter. You may also manually specify dimensions as NNxNN, width by height in millimetres. If an unknown value is encountered, letter is used.

UTF-8 output

Use -T utf8 to force a UTF-8 locale. See Locale output for details and options.

Syntax tree output

Use -T tree to show a human readable representation of the syntax tree. It is useful for debugging the source code of manual pages. The exact format is subject to change, so don't write parsers for it.

The first paragraph shows meta data found in the mdoc(7) prologue, on the man(7) TH line, or the fallbacks used.

In the tree dump, each output line shows one syntax tree node. Child nodes are indented with respect to their parent node. The columns are:

  1. For macro nodes, the macro name; for text and tbl(7) nodes, the content. There is a special format for eqn(7) nodes.
  2. Node type (text, elem, block, head, body, body-end, tail, tbl, eqn).
  3. Flags:
    • An opening parenthesis if the node is an opening delimiter.
    • An asterisk if the node starts a new input line.
    • The input line number (starting at one).
    • A colon.
    • The input column number (starting at one).
    • A closing parenthesis if the node is a closing delimiter.
    • A full stop if the node ends a sentence.
    • BROKEN if the node is a block broken by another block.
    • NOSRC if the node is not in the input file, but automatically generated from macros.
    • NOPRT if the node is not supposed to generate output for any output format.

The following -O argument is accepted:

noval
Skip validation and show the unvalidated syntax tree. This can help to find out whether a given behaviour is caused by the parser or by the validator. Meta data is not available in this case.

Environment

MANPAGER
Any non-empty value of the environment variable MANPAGER is used instead of the standard pagination program, more(1); see man(1) for details. Only used if -a or -l is specified.
PAGER
Specifies the pagination program to use when MANPAGER is not defined. If neither PAGER nor MANPAGER is defined, more(1) -s is used. Only used if -a or -l is specified.

Exit status

The mandoc utility exits with one of the following values, controlled by the message level associated with the -W option:

0
No base system convention violations, style suggestions, warnings, or errors occurred, or those that did were ignored because they were lower than the requested level.
1
At least one base system convention violation or style suggestion occurred, but no warning or error, and -W base or -W style was specified.
2
At least one warning occurred, but no error, and -W warning or a lower level was requested.
3
At least one parsing error occurred, but no unsupported feature was encountered, and -W error or a lower level was requested.
4
At least one unsupported feature was encountered, and -W unsupp or a lower level was requested.
5
Invalid command line arguments were specified. No input files have been read.
6
An operating system error occurred, for example exhaustion of memory, file descriptors, or process table entries. Such errors cause mandoc to exit at once, possibly in the middle of parsing or formatting a file.

Note that selecting -T lint output mode implies -W all.

Examples

To page manuals to the terminal:
$ mandoc -l mandoc.1 man.1 apropos.1 makewhatis.8
To produce HTML manuals with mandoc.css as the style-sheet:
$ mandoc -T html -O style=mandoc.css mdoc.7 > mdoc.7.html
To check over a large set of manuals:
$ mandoc -T lint `find /usr/src -name \*\.[1-9]`
To produce a series of PostScript manuals for A4 paper:
$ mandoc -T ps -O paper=a4 mdoc.7 man.7 > manuals.ps
Convert a modern mdoc(7) manual to the older man(7) format, for use on systems lacking an mdoc(7) parser:
$ mandoc -T man foo.mdoc > foo.man

Diagnostics

Messages displayed by mandoc follow this format:

mandoc: file:line:column: level: message: macro args (os)

Line and column numbers start at 1. Both are omitted for messages referring to an input file as a whole. Macro names and arguments are omitted where meaningless. The os operating system specifier is omitted for messages that are relevant for all operating systems. Fatal messages about invalid command line arguments or operating system errors, for example when memory is exhausted, may also omit the file and level fields.

Message levels have the following meanings:

unsupp
An input file uses unsupported low-level roff(7) features. The output may be incomplete and/or misformatted, so using GNU troff instead of mandoc to process the file may be preferable.
error
Indicates a risk of information loss or severe misformatting, in most cases caused by serious syntax errors.
warning
Indicates a risk that the information shown or its formatting may mismatch the author's intent in minor ways. Additionally, syntax errors are classified at least as warnings, even if they do not usually cause misformatting.
style
An input file uses dubious or discouraged style. This is not a complaint about the syntax, and probably neither formatting nor portability are in danger. While great care is taken to avoid false positives on the higher message levels, the style level tries to reduce the probability that issues go unnoticed, so it may occasionally issue bogus suggestions. Please use your good judgement to decide whether any particular style suggestion really justifies a change to the input file.
base
A convention used in the base system of a specific operating system is not adhered to. These are not markup mistakes, and neither the quality of formatting nor portability are in danger. Messages of the base level are printed with the more intuitive style level tag.

Messages of the base, style, warning, error, and unsupp levels except those about non-existent or unreadable input files are hidden unless their level, or a lower level, is requested using a -W option or -T lint output mode.

As indicated below, all base and some style checks are only performed if a specific operating system name occurs in the arguments of the -W command line option, of the Os macro, of the -Ios command line option, or, if neither are present, in the return value of the uname(3) function.

Conventions for base system manuals

Mdocdate found
(mdoc, NetBSD) The Dd macro uses CVS Mdocdate keyword substitution, which is not supported by the NetBSD base system. Consider using the conventional “Month dd, yyyy” format instead.
Mdocdate missing
(mdoc, OpenBSD) The Dd macro does not use CVS Mdocdate keyword substitution, but using it is conventionally expected in the OpenBSD base system.
unknown architecture
(mdoc, OpenBSD, NetBSD) The third argument of the Dt macro does not match any of the architectures this operating system is running on.
operating system explicitly specified
(mdoc, OpenBSD, NetBSD) The Os macro has an argument. In the base system, it is conventionally left blank.
RCS id missing
(OpenBSD, NetBSD) The manual page lacks the comment line with the RCS identifier generated by CVS OpenBSD or NetBSD keyword substitution as conventionally used in these operating systems.
referenced manual not found
(mdoc) An Xr macro references a manual page that is not found in the base system. The path to look for base system manuals is configurable at compile time and defaults to /usr/share/man: /usr/X11R6/man.

Style suggestions

legacy man(7) date format
(mdoc) The Dd macro uses the legacy man(7) date format “yyyy-dd-mm”. Consider using the conventional mdoc(7) date format “Month dd, yyyy” instead.
lower case character in document title
(mdoc, man) The title is still used as given in the Dt or TH macro.
duplicate RCS id
A single manual page contains two copies of the RCS identifier for the same operating system. Consider deleting the later instance and moving the first one up to the top of the page.
possible typo in section name
(mdoc) Fuzzy string matching revealed that the argument of an Sh macro is similar, but not identical to a standard section name.
unterminated quoted argument
(roff) Macro arguments can be enclosed in double quote characters such that space characters and macro names contained in the quoted argument need not be escaped. The closing quote of the last argument of a macro can be omitted. However, omitting it is not recommended because it makes the code harder to read.
useless macro
(mdoc) A Bt, Tn, or Ud macro was found. Simply delete it: it serves no useful purpose.
consider using OS macro
(mdoc) A string was found in plain text or in a Bx macro that could be represented using Ox, Nx, Fx, or Dx.
errnos out of order
(mdoc, NetBSD) The Er items in a Bl list are not in alphabetical order.
duplicate errno
(mdoc, NetBSD) A Bl list contains two consecutive It entries describing the same Er number.
trailing delimiter
(mdoc) The last argument of an Ex, Fo, Nd, Nm, Os, Sh, Ss, St, or Sx macro ends with a trailing delimiter. This is usually bad style and often indicates typos. Most likely, the delimiter can be removed.
no blank before trailing delimiter
(mdoc) The last argument of a macro that supports trailing delimiter arguments is longer than one byte and ends with a trailing delimiter. Consider inserting a blank such that the delimiter becomes a separate argument, thus moving it out of the scope of the macro.
fill mode already enabled, skipping
(man) A fi request occurs even though the document is still in fill mode, or already switched back to fill mode. It has no effect.
fill mode already disabled, skipping
(man) An nf request occurs even though the document already switched to no-fill mode and did not switch back to fill mode yet. It has no effect.
verbatim "--", maybe consider using \(em
(mdoc) Even though the ASCII output device renders an em-dash as --, that is not a good way to write it in an input file because it renders poorly on all other output devices.
function name without markup
(mdoc) A word followed by an empty pair of parentheses occurs on a text line. Consider using an Fn or Xr macro.
whitespace at end of input line
(mdoc, man, roff) Whitespace at the end of input lines is almost never semantically significant — but in the odd case where it might be, it is extremely confusing when reviewing and maintaining documents.
bad comment style
(roff) Comment lines start with a dot, a backslash, and a double-quote character. The mandoc utility treats the line as a comment line even without the backslash, but leaving out the backslash might not be portable.

Warnings regarding document structure

.so is fragile, better use ln(1)
(roff) Including files only works when the parser program runs with the correct current working directory.
no document body
(mdoc, man) The document body contains neither text nor macros. An empty document is shown, consisting only of a header and a footer line.
content before first section header
(mdoc, man) Some macros or text precede the first Sh or SH section header. The offending macros and text are parsed and added to the top level of the syntax tree, outside any section block.
first section is not NAME
(mdoc) The argument of the first Sh macro is not ‘NAME’. This may confuse makewhatis(8) and apropos(1).
NAME section without Nm before Nd
(mdoc) The NAME section does not contain any Nm child macro before the first Nd macro.
NAME section without description
(mdoc) The NAME section lacks the mandatory Nd child macro.
description not at the end of NAME
(mdoc) The NAME section does contain an Nd child macro, but other content follows it.
bad NAME section content
(mdoc) The NAME section contains plain text or macros other than Nm and Nd.
missing comma before name
(mdoc) The NAME section contains an Nm macro that is neither the first one nor preceded by a comma.
missing description line, using ""
(mdoc) The Nd macro lacks the required argument. The title line of the manual will end after the dash.
description line outside NAME section
(mdoc) An Nd macro appears outside the NAME section. The arguments are printed anyway and the following text is used for apropos(1), but none of that behaviour is portable.
sections out of conventional order
(mdoc) A standard section occurs after another section it usually precedes. All section titles are used as given, and the order of sections is not changed.
duplicate section title
(mdoc) The same standard section title occurs more than once.
unexpected section
(mdoc) A standard section header occurs in a section of the manual where it normally isn't useful.
cross reference to self
(mdoc) An Xr macro refers to a name and section matching the section of the present manual page and a name mentioned in an Nm macro in the NAME or SYNOPSIS section, or in an Fn or Fo macro in the SYNOPSIS. Consider using Nm or Fn instead of Xr.
unusual Xr order
(mdoc) In the SEE ALSO section, an Xr macro with a lower section number follows one with a higher number, or two Xr macros referring to the same section are out of alphabetical order.
unusual Xr punctuation
(mdoc) In the SEE ALSO section, punctuation between two Xr macros differs from a single comma, or there is trailing punctuation after the last Xr macro.
AUTHORS section without An macro
(mdoc) An AUTHORS sections contains no An macros, or only empty ones. Probably, there are author names lacking markup.

Unsupported features

input too large
(mdoc, man) Currently, mandoc cannot handle input files larger than its arbitrary size limit of 2^31 bytes (2 Gigabytes). Since useful manuals are always small, this is not a problem in practice. Parsing is aborted as soon as the condition is detected.
unsupported control character
(roff) An ASCII control character supported by other roff(7) implementations but not by mandoc was found in an input file. It is replaced by a question mark.
unsupported roff request
(roff) An input file contains a roff(7) request supported by GNU troff or Heirloom troff but not by mandoc, and it is likely that this will cause information loss or considerable misformatting.
eqn delim option in tbl
(eqn, tbl) The options line of a table defines equation delimiters. Any equation source code contained in the table will be printed unformatted.
unsupported table layout modifier
(tbl) A table layout specification contains an m modifier. The modifier is discarded.
ignoring macro in table
(tbl, mdoc, man) A table contains an invocation of an mdoc(7) or man(7) macro or of an undefined macro. The macro is ignored, and its arguments are handled as if they were a text line.

See also

apropos(1), man(1), eqn(7), man(7), mandoc_char(7), mdoc(7), roff(7), tbl(7)

History

The mandoc utility first appeared in OpenBSD 4.8. The option -I appeared in OpenBSD 5.2, and -aCcfhKklMSsw in OpenBSD 5.7.

Authors

The mandoc utility was written by and is maintained by .