[Fontconfig] Aliases

Fri Apr 2 18:25:15 EST 2004

Sorry for the tone of my previous mail.  I hope my tone this time
is better and what I write below makes sense.

On Fri, Apr 02, 2004 at 12:52:10AM -0500, John A. Boyd Jr. wrote:
> 
> - "Times", "Timmons" - symbolic, not semantic/descriptive
> - "Serif", "Sans", "Monospace" - semantic/descriptive

How about names like "Baskerville" or "Caslon", etc. which are
in the public domain and somewhat cross the boundary between
symbolic and semantic?  I agree with you that they ultimately
are shorthands for lots of descriptive attributes; however,
I would say that it should not be entirely dismissed that
certain names can legitimately / usefully serve as abstractions
themselves.

> How about "non-Latin" for starters?  Then, say, "Hebrew",
> "Greek", ... (These are character set descriptions, I'm
> imagining, from my limited knowledge of fonts.)

I think fontconfig already knows something about this.

What I was trying to say was, e.g., the Kai (CJK) style being
classed by default as "sans serif" by stock fontconfig and as
"Kai" in Debian. According to limited understanding of how
Latin fonts are categorized, both classifications are wrong; it
should properly be classed as italic even though it would not
match well with Latin italic types (because of different italic
angles).  (And naturally, the panose data are wrong about the
correct classification.)  Continuing with this particular case,
what do we do with existing classification schemes, and how
far do we go in making sure something is correctly classified
using existing criteria (in this case the correct definition
of "italic"); and how do we handle unexpected cases where two
different styles *ought* to be classed identically but are
incompatible?

Or perhaps let us consider the "Kai" style from a more realistic
point of view: In a normal fontconfig installation, when the
user asks for a "sans serif" Chinese font, the system is likely
to deliver a Kai font because of the simple reason that there is
no free Chinese sans serif type in existence.  But in this case
a typical user -- provided that he/she knows what "sans serif"
means -- would be able to tell that this is wrong, if not only
for the reason that Kai types have (round, sometimes subtle)
serifs.

> Let's try another example.  Let's say we want to describe
> font width - I like that example...  How about "condensed,
> expanded," and the like.  I'll bet you can find these in
> existing fonts as well.

Yes. But let us pause for a moment and say the same thing
about "normal", "medium", and "regular".  These certainly sound
identical, mean the same thing to most people, and might even
mean the same thing for lots of fonts.  But there are also lots
of commercial fonts where they represent different weights.

And there are commercial fonts who describe themselves as
"normal" or "regular" (say) even though they are bold (say).
We should be aware of such irregularities.

> I would suggest, Ambrose, that you're thinking less like a
> user than like a system developer.
> 
> In that regsrd, let me just ask: how does (or might) a _user_
> discriminate fonts at the level of panose information?

I'd say there are different kinds of users.

There are users who don't know what sans serif is (or what
a serif is), the same users literally would call all any old
sans serif font "Arial" (and not know what "Helvetica" is) and
all serif fonts "Times New Roman"; and I personally do know a
number of such users.  To these users, even serif and sans serif
are meaningless descriptions.

At the other extreme, there are users who know what "FF Meta"
and "Myriad MM" are, and some who can tell the difference
between Helvetica and Arial.  These users (those who have
studied typography and/or do typographic work day to day) would
at least have a handle on what the panose data mean.

Of course, the two kinds of users would discriminate fonts at
the level of panose information differently.  The "clueless
users" might be able to only distinguish between panose "family"
and weight, the "professional users" will be able to distinguish
most of the panose attributes, and the "academic" might be able
to distinguish all.  The "professionals" and the "academics"
certainly would discriminate fonts based on more than what the
panose data would describe (e.g., by the "feel" of the font, the
"historical period", etc.), especially for some of the panose
attributes which are not helpfully descriptive (e.g., when
Family is Script or Decorative); however, their ability to
distinguish might be subconcious.

Under normal use, I might speculate that perhaps a quarter to a
half of the panose data are useful, and perhaps several
additional, non-panose descriptions will be required to
sufficiently describe a font.  Certainly, to the user, the
panose data are not sufficient to meaningfully describe fonts.

I would assume that the typical user you have in mind is
somewhere between the two extremes.  However, at least one
extreme cases is not negligible; "clueless users" do exist, and
GNU/Linux becomes more and more generally usable as a desktop
system it could attract more and more "professional users". We
can reasonably ignore the "clueless" ones, but what I call the
"professional" users might describe type in a way you would call
"like a system developer".

If what we are going to do is to improve on the way fontconfig
is to do matching, by using more descriptive attributes, I have
no objections; this can only benefit typical and professional
users alike.

> Aliases have the properties of abstractions if they are also
> abstractions, but again, they are then only a special case.
> But in general, aliases are not also abstractions.  In general,
> an abstraction can be used wherever an alias can, but the
> opposite is not true at all.

I would tend to be a bit reserved about this statement. Unicode
(at least the CJK unification part) is supposed to be based on
the same principle, but in practice it does not deliver even
though this principle should be theoretically sound.

I might speculate that this property would break down for script
and decorative types, esp. the latter.

Descriptive attributes may change in time too.  Helvetica at one
time was perceived as "authoritative"; today perhaps "default"
or "plain" might describe it more appropriately.

> To your second point - no; there's a legal principle called
> the doctrine of fair use that applies to certain classes of
> use of intellectual property.  Teaching, journalism, and
> research are among those uses exempted under the fair use
> doctrine, as I understand it.

I am not as optimistic.  Judging from how the recording, movie,
and commercial software industries, and our legal system are
acting, I would say that fair use will disappear in the future,
perhaps the near future, or perhaps it already disappeared
behind our backs.

I certainly hope that I am just being overly pessimistic.