APL Unicode Font – Extended
A New Release
A new version of SimPL.ttf is now available from the Vector website.
[Ed: Phil has sent an even newer version as of 29th January – about 202K download.]
The original article appeared in Vector 16.1.
What’s Changed
The objective remains the same: to provide the full set of APL glyphs, plus enough characters to display text in all modern European languages, and anything else that looks like it might be useful. The principal differences from the earlier versions of the font are those required to bring it in line with Unicode version 3.0, plus some of the elements of the proposed extensions to Unicode version 3.2. First among these is that the APL Quad now has its own codepoint at U+2395, which makes uniformity of appearance much easier to achieve in a large font.The domain names [these are shown here in bold, but are actually “fancy” characters from the Unicode set. Ed] H, N, Z, Q, R, C and P are already part of the Unicode standard (although H and P are not given any semantics), so it seems sensible to continue to use double-struck characters for domain names. A double-struck zero, Ø, is such an obvious choice for the empty domain, that it has been retained but moved from the “empty set” codepoint (U+2205) to the Private Use Area, where it keeps company with 2, D, F, S, U and some other characters whose semantics we have yet to define.
The move is necessary because the Unicode Consortium have mandated (or, more exactly, appear to be about to mandate) that the empty set shall be represented by the circle-slash. This is a difficult area for Unicode, whose stated objective is to concentrate on semantics, not appearance. In practice, mathematic symbology works the other way, appropriating whatever symbols are available: attaching semantics to these symbols is a second, separate, stage. Maybe the Unicode consortium should just define the appearance, and leave the rest of us to interpret them as we choose? (Also, what do they mean by “slash”? Most slashes are inclined at an angle of about 24º, whereas every “empty set” symbol I have seen uses a diagonal slash, at an angle of 45º. And if I think the result looks ugly, and totally out of sympathy with the surrounding notation, that is best dismissed as just a prejudiced personal reaction.)What’s New
Glyphs (some better than others) are provided for all the Latin and Latin-based characters in Unicode 3.0 – that is the whole of Latin 1, Latin Extended A, Latin Extended B, and Latin Extended Additional. The resultant character set covers not only the modern European languages, as per our objective, but also covers many African, North American and Asian languages as well.- Bearing in mind that this is intended as a programmer’s font, an attempt has been made to better distinguish upper-case letter “O” from the digit zero, and lower-case letter “l” from the digit “1”.
- The double accents used in Vietnamese are not well done, particularly the circumflex-tilde combination on uppercase letters. This is the result of a wrong decision early in the design stage, and backing out of that mistake is going to be a long job, so the font is being released as is, warts and all.
- The presence of the long-tailed “r” and the addition of the Tironian Et, mean that most recent renderings of Irish are now possible, provided you can accept the “gaelic g” as simply an alternative glyph for the lower-case Latin letter “g”.
- The Latin character set may not yet be complete: I believe proposals for Sami1 are still outstanding, and these will be included in the font when they have been included in the Unicode standard.
- The characters necessary for Romanised Sanskrit and Pin-Yin Chinese are now all available within Unicode 3.0, and so some of the stuff previously in the Private Use Area has been shifted out.
There are a few additional characters in the Greek area – nothing major.
Coverage of Cyrillic is much improved. Coverage of Church Slavonic is incomplete, but (I believe that) all the glyphs currently in non-liturgical use are provided, giving 100% coverage of those languages formerly known as “Soviet minority languages”, with the exception of Komi2.
Unicode never promised to include pre-composed forms for all accented characters, so that they cannot really be criticised for most of the omissions listed3 at
http://www.eki.ee/letter/chardata.cgi?ucode=e000-f8ff
but some of these characters cannot be formed from the existing repertoire of base characters and non-spacing modifiers, so I have included the lot. This should give complete coverage for Tagalog4 and mediaeval Luxemburgish, for those who need it. If not, let me know what is missing.
The Unicode 3.2 proposals include extended (broken, multi-line) parentheses, brackets and braces in one of the Mathematical Symbols areas. Their proposals for multi-line braces look unworkable, so that the characters previously included in SImPL.ttf remain in the Private Use Area. Whether these characters connect vertically appears to vary from printer to printer (or does it vary from driver to driver?).
Using the Font
Apart from that unfortunate exception, the font appears to print OK at 12pt, 24pt and 36 pt, on lasers and inkjets.
The font is still based on the Adobe standard of an M-square of 1000 units. This has caused problems in the past, with early versions of Word 2000, but no problems have been encountered so far with either Word 2000 or Word 97, even though Microsoft still “strongly recommend” an M-square of 2048 units.
By maintaining one version of the font for development, and one for font generation, it has been possible to produce a font with a better screen display, which is particularly noticeable in those areas where NT’s screen rendering has always been a bit off-hand: in Font Preview, CharMap and Insert/Symbol. (Hinting is still done automatically.)
There are 101 small things to clear up, mostly in the area of Extended Cyrillic. As these things improve, they will be made available via the Vector website, but new releases will not be announced, unless something major changes.
Footnotes
Copyright © Phil Chastney 2000