How to find out which unicode codepoints are defined in a TTF file?

7,342

Solution 1

I found a python library, fonttools (pypi) that can be used to do it with a bit of python scripting.

Here is a simple script that lists all fonts that have specified glyph:

#!/usr/bin/env python3

from fontTools.ttLib import TTFont
import sys

char = int(sys.argv[1], base=0)

print("Looking for U+%X (%c)" % (char, chr(char)))

for arg in sys.argv[2:]:
    try:
        font = TTFont(arg)

        for cmap in font['cmap'].tables:
            if cmap.isUnicode():
                if char in cmap.cmap:
                    print("Found in", arg)
                    break
    except Exception as e:
        print("Failed to read", arg)
        print(e)

First argument is codepoint (decimal or hexa with 0x) and the rest is font files to look in.

I didn't bother trying to make it work for .ttc files (it requires some extra parameter somewhere).

Note: I first tried the otfinfo tool, but I only got basic multilingual plane characters (<= U+FFFF). The python script finds extended plane characters OK.

Solution 2

otfinfo looks promising:

-u, --unicode
  Print each Unicode code point supported by the font, followed by
  the glyph number representing that code point (and, if present,
  the name of the corresponding glyph).

For example DejaVuSans-Bold knows about the fl ligature(fl):

$ otfinfo -u /usr/share/fonts/TTF/DejaVuSans-Bold.ttf |grep ^uniFB02
uniFB02 4899 fl
Share:
7,342

Related videos on Youtube

Sanuuu
Author by

Sanuuu

Updated on September 18, 2022

Comments

  • Sanuuu
    Sanuuu almost 2 years

    I need to automate a process of verification which Unicode characters have actual glyphs defined for them in a True Type Font file. How do I go around doing that? I can't seem to find information on how to make sense of the numbers I seem to be getting when I open a .ttf file in a text editor.

  • Sanuuu
    Sanuuu over 8 years
    This tool is exactly what I need but it also doesn't seem to work with TrueType fonts, only OpenType ones.
  • michas
    michas over 8 years
    It works with ttf, too. See my example above. (According to wikipedia ttf is a special type of OpenType font.)
  • Sanuuu
    Sanuuu over 8 years
    Hmm... my version of otfinfo (2.92) doesn't seem to have the -u option at all. Which version are you using?
  • michas
    michas over 8 years
    I used "otfinfo (LCDF typetools) 2.104" from my texlive package.
  • Jan Hudec
    Jan Hudec over 8 years
    @Sanuuu, the -u option does not appear in --help, but still seems to exist. However (at least in Debian 2.105 build) it seems to only list basic plane (up to U+FFFF). The -g option knows about the extended planes, but that does not work for all fonts.
  • ccpizza
    ccpizza almost 4 years
    fyi: on mac otfinfo can be installed with brew install lcdf-typetools