[Fontconfig] DTD

Peter Flynn peter at silmaril.ie
Sun Apr 1 14:01:15 UTC 2018


On 30/03/18 23:04, Lawrence D'Oliveiro wrote:
> On Fri, 30 Mar 2018 18:41:34 +0000, Jerry Casiano wrote:
> 
>> I would consider [the lack of a DTD] a bug if I used Debian.
> 
> Does anybody still use DTDs for anything? 

Yes, extensively. See discussions ad perpetuam in the markup field (eg 
Balisage :-)

> They may have made sense with SGML, but HTML, for example, does so
> much more with CSS instead.
CSS isn't connected in any way with using a DTD. A DTD is like a Schema: 
it provides a machine-readable framework for an editor to use to ensure 
that only valid element types are used, and only in valid places. CSS is 
used for styling output (eg HTML), and does not affect document 
construction in any way.

> And as for XML, it seems to me a DTD is neither necessary nor
> sufficient to capture the semantics of most real-world applications.
It depends what your real-world applications are.

If you are using XML to represent rectangular data (eg a config file, 
spreadsheet, or database table) then a W3C Schema with its datatypes is 
probably more useful. This is known as "data"-XML.

In the publishing industry, however, where they deal with continuous 
text, not rectangular data, DTDs are terser and simpler and better 
supported by the toolchain (and data types are not useful or meaningful 
for continuous text). This is known as "document"-XML.

In either case, hardly anyone nowadays actually codes a DTD or Schema by 
hand; most are machine-generated from a better and more comprehensive 
language like Relax-NG or ODD.

> Fontconfig being a case in point: half of fonts.dtd is comments
> explaining what the constructs actually mean. 

DTDs have no comment declaration syntax of their own, so using XML 
comment declarations is the only solution. But comments are good: 
uncommented code is usually regarded as A Bad Thing.

> And much of the rest is
> essentially content-free syntax like
>      <!ELEMENT dir (#PCDATA)>

This reflects the relatively light weight of markup needed to describe 
the application data. It's basically a set of containers, so all it 
needs is the names: the content is just strings. I must admit I would 
have written it differently, but that's just a matter of coding style :-)

I did find a much older version that was even simpler...

> And things like
>      <!ELEMENT plus (%expr;)*>

I was surprised when I saw this. It appears that someone decided to try 
and store code that could be extracted and executed in some way. Not 
that there's anything technically wrong with it; just a little unusual.

> convey that these operators are recognized and can take multiple
> operands, but nothing more.

That's exactly it. A DTD is just a spec. Its principal use is to 
guarantee that the files which claim to conform to it really do 
(validation). The utilities to do this are easily available, eg onsgmls 
(part of OpenSP) or rxp (from the LTxml2 toolkit at Edinburgh University).

///Peter



More information about the Fontconfig mailing list