...

About the -M Operator" in somedoc

name/"section"E> links from Cname/itemE> links (and their targets). These have been merged syntactically and semantically in the current specification, and I

can refer either to a "=headI Heading Content" command or to a "=item Item Content" command. This specification does not specify what behavior should be in the case of a given document having several things all seeming to produce the same I

identifier (e.g., in HTML, several things all producing the same I in ... elements). Where Pod processors can control this behavior, they should use the first such anchor. That is, CFoo/BarE> refers to the I "Bar" section in Foo. But for some processors/formats this cannot be easily controlled; as with the HTML example, the behavior of multiple ambiguous ... is most easily just left up to browsers to decide. =item * Authors wanting to link to a particular (absolute) URL, must do so only with "LEscheme:...>" codes (like LEhttp://www.perl.org>), and must not attempt "LESome Site Name|scheme:...>" codes. This restriction avoids many problems in parsing and rendering LE...> codes. =item * In a Ctext|...E> code, text may contain formatting codes for formatting or for EE...> escapes, as in: Lstuff>|...> For C...E> codes without a "name|" part, only C...E> and CE> codes may occur -- no other formatting codes. That is, authors should not use "CBEFoo::BarEE>". Note, however, that formatting codes and ZE>'s can occur in any and all parts of an LE...> (i.e., in I, I

, I, and I). Authors must not nest LE...> codes. For example, "LEThe LEFoo::Bar> man page>" should be treated as an error. =item * Note that Pod authors may use formatting codes inside the "text" part of "LEtext|name>" (and so on for LEtext|/"sec">). In other words, this is valid: Go read L|perlvar/"$."> Some output formats that do allow rendering "LE...>" codes as hypertext, might not allow the link-text to be formatted; in that case, formatters will have to just ignore that formatting. =item * At time of writing, CnameE> values are of two types: either the name of a Pod page like CFoo::BarE> (which might be a real Perl module or program in an @INC / PATH directory, or a .pod file in those places); or the name of a UNIX man page, like Ccrontab(5)E>. In theory, CchmodE> in ambiguous between a Pod page called "chmod", or the Unix man page "chmod" (in whatever man-section). However, the presence of a string in parens, as in "crontab(5)", is sufficient to signal that what is being discussed is not a Pod page, and so is presumably a UNIX man page. The distinction is of no importance to many Pod processors, but some processors that render to hypertext formats may need to distinguish them in order to know how to render a given CfooE> code. =item * Previous versions of perlpod allowed for a CsectionE> syntax (as in CObject AttributesE>), which was not easily distinguishable from CnameE> syntax and for C"section"E> which was only slightly less ambiguous. This syntax is no longer in the specification, and has been replaced by the C/sectionE> syntax (where the slash was formerly optional). Pod parsers should tolerate the C"section"E> syntax, for a while at least. The suggested heuristic for distinguishing CsectionE> from CnameE> is that if it contains any whitespace, it's a I

. Pod processors should warn about this being deprecated syntax. =back =head1 About =over...=back Regions "=over"..."=back" regions are used for various kinds of list-like structures. (I use the term "region" here simply as a collective term for everything from the "=over" to the matching "=back".) =over =item * The non-zero numeric I in "=over I" ... "=back" is used for giving the formatter a clue as to how many "spaces" (ems, or roughly equivalent units) it should tab over, although many formatters will have to convert this to an absolute measurement that may not exactly match with the size of spaces (or M's) in the document's base font. Other formatters may have to completely ignore the number. The lack of any explicit I parameter is equivalent to an I value of 4. Pod processors may complain if I is present but is not a positive number matching C. =item * Authors of Pod formatters are reminded that "=over" ... "=back" may map to several different constructs in your output format. For example, in converting Pod to (X)HTML, it can map to any of

...,

..., or

...

. Similarly, "=item" can map to

. =item * Each "=over" ... "=back" region should be one of the following: =over =item * An "=over" ... "=back" region containing only "=item *" commands, each followed by some number of ordinary/verbatim paragraphs, other nested "=over" ... "=back" regions, "=for..." paragraphs, and "=begin"..."=end" regions. (Pod processors must tolerate a bare "=item" as if it were "=item *".) Whether "*" is rendered as a literal asterisk, an "o", or as some kind of real bullet character, is left up to the Pod formatter, and may depend on the level of nesting. =item * An "=over" ... "=back" region containing only C paragraphs, each one (or each group of them) followed by some number of ordinary/verbatim paragraphs, other nested "=over" ... "=back" regions, "=for..." paragraphs, and/or "=begin"..."=end" codes. Note that the numbers must start at 1 in each section, and must proceed in order and without skipping numbers. (Pod processors must tolerate lines like "=item 1" as if they were "=item 1.", with the period.) =item * An "=over" ... "=back" region containing only "=item [text]" commands, each one (or each group of them) followed by some number of ordinary/verbatim paragraphs, other nested "=over" ... "=back" regions, or "=for..." paragraphs, and "=begin"..."=end" regions. The "=item [text]" paragraph should not match C or C, nor should it match just C. =item * An "=over" ... "=back" region containing no "=item" paragraphs at all, and containing only some number of ordinary/verbatim paragraphs, and possibly also some nested "=over" ... "=back" regions, "=for..." paragraphs, and "=begin"..."=end" regions. Such an itemless "=over" ... "=back" region in Pod is equivalent in meaning to a "

...

" element in HTML. =back Note that with all the above cases, you can determine which type of "=over" ... "=back" you have, by examining the first (non-"=cut", non-"=pod") Pod paragraph after the "=over" command. =item * Pod formatters I tolerate arbitrarily large amounts of text in the "=item I" paragraph. In practice, most such paragraphs are short, as in: =item For cutting off our trade with all parts of the world But they may be arbitrarily long: =item For transporting us beyond seas to be tried for pretended offenses =item He is at this time transporting large armies of foreign mercenaries to complete the works of death, desolation and tyranny, already begun with circumstances of cruelty and perfidy scarcely paralleled in the most barbarous ages, and totally unworthy the head of a civilized nation. =item * Pod processors should tolerate "=item *" / "=item I" commands with no accompanying paragraph. The middle item is an example: =over =item 1 Pick up dry cleaning. =item 2 =item 3 Stop by the store. Get Abba Zabas, Stoli, and cheap lawn chairs. =back =item * No "=over" ... "=back" region can contain headings. Processors may treat such a heading as an error. =item * Note that an "=over" ... "=back" region should have some content. That is, authors should not have an empty region like this: =over =back Pod processors seeing such a contentless "=over" ... "=back" region, may ignore it, or may report it as an error. =item * Processors must tolerate an "=over" list that goes off the end of the document (i.e., which has no matching "=back"), but they may warn about such a list. =item * Authors of Pod formatters should note that this construct: =item Neque =item Porro =item Quisquam Est Qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam aliquam quaerat voluptatem. =item Ut Enim is semantically ambiguous, in a way that makes formatting decisions a bit difficult. On the one hand, it could be mention of an item "Neque", mention of another item "Porro", and mention of another item "Quisquam Est", with just the last one requiring the explanatory paragraph "Qui dolorem ipsum quia dolor..."; and then an item "Ut Enim". In that case, you'd want to format it like so: Neque Porro Quisquam Est Qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam aliquam quaerat voluptatem. Ut Enim But it could equally well be a discussion of three (related or equivalent) items, "Neque", "Porro", and "Quisquam Est", followed by a paragraph explaining them all, and then a new item "Ut Enim". In that case, you'd probably want to format it like so: Neque Porro Quisquam Est Qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam aliquam quaerat voluptatem. Ut Enim But (for the foreseeable future), Pod does not provide any way for Pod authors to distinguish which grouping is meant by the above "=item"-cluster structure. So formatters should format it like so: Neque Porro Quisquam Est Qui dolorem ipsum quia dolor sit amet, consectetur, adipisci velit, sed quia non numquam eius modi tempora incidunt ut labore et dolore magnam aliquam quaerat voluptatem. Ut Enim That is, there should be (at least roughly) equal spacing between items as between paragraphs (although that spacing may well be less than the full height of a line of text). This leaves it to the reader to use (con)textual cues to figure out whether the "Qui dolorem ipsum..." paragraph applies to the "Quisquam Est" item or to all three items "Neque", "Porro", and "Quisquam Est". While not an ideal situation, this is preferable to providing formatting cues that may be actually contrary to the author's intent. =back =head1 About Data Paragraphs and "=begin/=end" Regions Data paragraphs are typically used for inlining non-Pod data that is to be used (typically passed through) when rendering the document to a specific format: =begin rtf \par{\pard\qr\sa4500{\i Printed\~\chdate\~\chtime}\par} =end rtf The exact same effect could, incidentally, be achieved with a single "=for" paragraph: =for rtf \par{\pard\qr\sa4500{\i Printed\~\chdate\~\chtime}\par} (Although that is not formally a data paragraph, it has the same meaning as one, and Pod parsers may parse it as one.) Another example of a data paragraph: =begin html I like PIE!

Especially pecan pie! =end html If these were ordinary paragraphs, the Pod parser would try to expand the "EE/em>" (in the first paragraph) as a formatting code, just like "EElt>" or "EEeacute>". But since this is in a "=begin I"..."=end I" region I the identifier "html" doesn't begin have a ":" prefix, the contents of this region are stored as data paragraphs, instead of being processed as ordinary paragraphs (or if they began with a spaces and/or tabs, as verbatim paragraphs). As a further example: At time of writing, no "biblio" identifier is supported, but suppose some processor were written to recognize it as a way of (say) denoting a bibliographic reference (necessarily containing formatting codes in ordinary paragraphs). The fact that "biblio" paragraphs were meant for ordinary processing would be indicated by prefacing each "biblio" identifier with a colon: =begin :biblio Wirth, Niklaus. 1976. I Prentice-Hall, Englewood Cliffs, NJ. =end :biblio This would signal to the parser that paragraphs in this begin...end region are subject to normal handling as ordinary/verbatim paragraphs (while still tagged as meant only for processors that understand the "biblio" identifier). The same effect could be had with: =for :biblio Wirth, Niklaus. 1976. I Prentice-Hall, Englewood Cliffs, NJ. The ":" on these identifiers means simply "process this stuff normally, even though the result will be for some special target". I suggest that parser APIs report "biblio" as the target identifier, but also report that it had a ":" prefix. (And similarly, with the above "html", report "html" as the target identifier, and note the I of a ":" prefix.) Note that a "=begin I"..."=end I" region where I begins with a colon, I contain commands. For example: =begin :biblio Wirth's classic is available in several editions, including: =for comment hm, check abebooks.com for how much used copies cost. =over =item Wirth, Niklaus. 1975. I Teubner, Stuttgart. [Yes, it's in German.] =item Wirth, Niklaus. 1976. I Prentice-Hall, Englewood Cliffs, NJ. =back =end :biblio Note, however, a "=begin I"..."=end I" region where I does I begin with a colon, should not directly contain "=head1" ... "=head4" commands, nor "=over", nor "=back", nor "=item". For example, this may be considered invalid: =begin somedata This is a data paragraph. =head1 Don't do this! This is a data paragraph too. =end somedata A Pod processor may signal that the above (specifically the "=head1" paragraph) is an error. Note, however, that the following should I be treated as an error: =begin somedata This is a data paragraph. =cut # Yup, this isn't Pod anymore. sub excl { (rand() > .5) ? "hoo!" : "hah!" } =pod This is a data paragraph too. =end somedata And this too is valid: =begin someformat This is a data paragraph. And this is a data paragraph. =begin someotherformat This is a data paragraph too. And this is a data paragraph too. =begin :yetanotherformat =head2 This is a command paragraph! This is an ordinary paragraph! And this is a verbatim paragraph! =end :yetanotherformat =end someotherformat Another data paragraph! =end someformat The contents of the above "=begin :yetanotherformat" ... "=end :yetanotherformat" region I data paragraphs, because the immediately containing region's identifier (":yetanotherformat") begins with a colon. In practice, most regions that contain data paragraphs will contain I data paragraphs; however, the above nesting is syntactically valid as Pod, even if it is rare. However, the handlers for some formats, like "html", will accept only data paragraphs, not nested regions; and they may complain if they see (targeted for them) nested regions, or commands, other than "=end", "=pod", and "=cut". Also consider this valid structure: =begin :biblio Wirth's classic is available in several editions, including: =over =item Wirth, Niklaus. 1975. I Teubner, Stuttgart. [Yes, it's in German.] =item Wirth, Niklaus. 1976. I Prentice-Hall, Englewood Cliffs, NJ. =back Buy buy buy! =begin html

=end html Now now now! =end :biblio There, the "=begin html"..."=end html" region is nested inside the larger "=begin :biblio"..."=end :biblio" region. Note that the content of the "=begin html"..."=end html" region is data paragraph(s), because the immediately containing region's identifier ("html") I begin with a colon. Pod parsers, when processing a series of data paragraphs one after another (within a single region), should consider them to be one large data paragraph that happens to contain blank lines. So the content of the above "=begin html"..."=end html" I be stored as two data paragraphs (one consisting of "

\n" and another consisting of "

\n"), but I be stored as a single data paragraph (consisting of "

\n\n

\n"). Pod processors should tolerate empty "=begin I"..."=end I" regions, empty "=begin :I"..."=end :I" regions, and contentless "=for I" and "=for :I" paragraphs. I.e., these should be tolerated: =for html =begin html =end html =begin :biblio =end :biblio Incidentally, note that there's no easy way to express a data paragraph starting with something that looks like a command. Consider: =begin stuff =shazbot =end stuff There, "=shazbot" will be parsed as a Pod command "shazbot", not as a data paragraph "=shazbot\n". However, you can express a data paragraph consisting of "=shazbot\n" using this code: =for stuff =shazbot The situation where this is necessary, is presumably quite rare. Note that =end commands must match the currently open =begin command. That is, they must properly nest. For example, this is valid: =begin outer X =begin inner Y =end inner Z =end outer while this is invalid: =begin outer X =begin inner Y =end outer Z =end inner This latter is improper because when the "=end outer" command is seen, the currently open region has the formatname "inner", not "outer". (It just happens that "outer" is the format name of a higher-up region.) This is an error. Processors must by default report this as an error, and may halt processing the document containing that error. A corollary of this is that regions cannot "overlap" -- i.e., the latter block above does not represent a region called "outer" which contains X and Y, overlapping a region called "inner" which contains Y and Z. But because it is invalid (as all apparently overlapping regions would be), it doesn't represent that, or anything at all. Similarly, this is invalid: =begin thing =end hting This is an error because the region is opened by "thing", and the "=end" tries to close "hting" [sic]. This is also invalid: =begin thing =end This is invalid because every "=end" command must have a formatname parameter. =head1 SEE ALSO L, L, L =head1 AUTHOR Sean M. Burke =cut

About the -M Operator

About the `-M` Operator