PDF-files make internal use of glyph names. For example, the name of ≈ (U+2248; TeX \approx) appearing in a PDF-file might be approxequal.
One can find such names in a TeX-generated PDF-file by
- compiling the TeX code with
\pdfcompresslevel=0, - inspecting the resulting PDF-file as a text file, and
- looking for lines starting with
/CharSet.
(information taken from Ulrike Fischer's answer elsewhere, which provides more information).
Apparently the glyph names are font-dependent. So they are determined by the fonts? Do all font formats use such names? Which font formats use textual names? Do all glyphs in all PDF-files have such names?
How are the glyph names in PDF-files determined? Who determined the existing ones? What are they for? (Why doesn't PDF refer to the glyphs by number? Clearly some readers are relying on the glyph names (see link to question about hyperlink detection below), so the PDF format or some readers make some assumptions about these names. There must be a reason about why an intermediary of names is used. Perhaps this has to do with the age of Unicode in relation to PDF.) What else is there to know on this topic for a user of (La)TeX?
For me, the issue of PDF glyph names came up here:
- Manipulating the Unicode codepoints of glyphs in the resulting PDF-file requires knowledge of the glyph names. Notably,
glyphtounicode.texmaps from glyph names to Unicode codepoints, with lines such as\pdfglyphtounicode{approximatelyequal}{2245}: How to fix missing or incorrect mappings from glyphtounicode.tex - At least one PDF reader uses glyph names for a heuristic for HTTP URL detection: \input{glyphtounicode} with \pdfgentounicode=1 creates unwanted hyperlinks from link-like text
A similar question is How to find the proper glyph name required by \pdfglyphtounicode, but there is more ground that needs to be covered in this topic.