1
0
mirror of https://github.com/postgres/postgres.git synced 2025-10-25 13:17:41 +03:00
Files
postgres/doc/src/sgml
Andrew Dunstan b292256272 Improve docs syntax checking
Move the checks out of the Makefile into a perl script that can be
called from both the Makefile and meson.build. The set of files checked
is simplified, so it is just all the sgml and xsl files found in
docs/src/sgml directory tree.

Along the way make some adjustments to .cirrus.tasks.yml to support this
better in CI.

Also ensure that the checks are part of the Makefile's html target.

Author: Nazir Bilal Yavuz <byavuz81@gmail.com>
Co-Author: Andrew Dunstan <andrew@dunslane.net>

Discussion: https://postgr.es/m/CAN55FZ3BnM+0twT-ZWL8As9oBEte_b+SBU==cz6Hk8JUCM_5Wg@mail.gmail.com
2025-10-02 10:26:32 -04:00
..
2025-09-29 16:55:17 -04:00
2025-07-01 10:19:52 +02:00
2025-07-01 10:19:52 +02:00
2025-04-04 12:27:04 -04:00
2025-04-07 00:03:18 +02:00
2025-03-25 21:40:00 +13:00
2025-10-02 10:26:32 -04:00
2025-10-02 10:26:32 -04:00
2023-10-25 09:40:55 +09:00
2025-07-30 09:51:45 +02:00
2025-09-29 16:55:17 -04:00
2025-07-17 00:21:18 +02:00
2025-06-29 22:28:10 -04:00
2025-06-29 22:28:10 -04:00
2024-03-17 13:58:59 +00:00
2025-09-06 07:49:51 +09:00
2025-07-01 10:19:52 +02:00
2023-11-06 15:18:04 +01:00
2025-01-01 11:21:55 -05:00
2025-06-16 11:43:52 +02:00
2025-02-26 09:49:59 +09:00

<!-- doc/src/sgml/README.non-ASCII -->

Representation of non-ASCII characters
--------------------------------------

Find non-ASCII characters using:

        grep --recursive --color='auto' -P '[\x80-\xFF]' .

Convert to HTML4 named entity (&) escapes
-----------------------------------------

We support several output formats:

*  html (supports all Unicode characters)
*  man (supports all Unicode characters)
*  pdf (supports only Latin-1 characters)
*  info

While some output formatting tools support all Unicode characters,
others only support Latin-1 characters.  Specifically, the PDF rendering
engine can only display Latin-1 characters;  non-Latin-1 Unicode
characters are displayed as "###".

Therefore, in the SGML files, we can only use Latin-1 characters.  We
can use UTF8 representations of Latin-1 characters, or HTML entities of
Latin-1 characters, e.g., &Aacute;lvaro.

Do not use UTF numeric character escapes (&#nnn;).

When building the PDF docs, problem characters will appear as warnings.

HTML entities
        official:      http://www.w3.org/TR/html4/sgml/entities.html
        one page:      http://www.zipcon.net/~swhite/docs/computers/browsers/entities_page.html
        other lists:   http://www.zipcon.net/~swhite/docs/computers/browsers/entities.html
                       http://www.zipcon.net/~swhite/docs/computers/browsers/entities_page.html
                       https://en.wikipedia.org/wiki/List_of_XML_and_HTML_character_entity_references