PDF ExplainedApril 2, 20264 min read

What Is Font Subsetting in PDF?

Font subsetting embeds only the glyphs used in a PDF, reducing file size by 70-95% compared to full font embedding. Learn how it works and when to use it.

Font subsetting is the process of embedding only the glyphs (individual characters) from a font that are actually used in the PDF document, rather than the entire font program. If your document uses only the letters A-Z, a-z, and digits 0-9 from a font that contains 10,000 glyphs (Latin + Greek + Cyrillic + CJK), subsetting embeds only those ~80 glyphs — reducing the font data from potentially several MB to a few KB.

How Subsetting Works

During PDF export, the application scans the document for all character codes used with each font. It then extracts only the glyph outlines, widths, and encoding information for those characters and packages them into a subset font program embedded in the PDF. The subset font gets a modified name with a 6-character random prefix (e.g., "ABCDEF+Helvetica") to distinguish it from the full font and signal that it's a subset. PDF viewers handle subset fonts identically to full fonts for rendering — the name difference is just a signal to tools.

File Size Impact

The impact of subsetting varies by font type and document content. For a typical business document using one or two Latin fonts, subsetting might reduce font data from 200-500 KB (full embedding) to 10-30 KB (subset). For documents using CJK fonts, the impact is dramatic: a full Chinese font program can be 10-20 MB; subsetting a document that uses 200 unique Chinese characters brings it down to under 1 MB. Font data often represents 20-60% of a PDF's total size, making subsetting one of the most effective size-reduction techniques.

Limitation: Post-Editing

The key limitation of font subsetting: if you try to edit the PDF and add new text using the embedded font, characters not in the original document aren't available. If you open a subset-embedded PDF in Acrobat and type a new letter that wasn't in the subset, Acrobat substitutes from the system font, potentially causing a font rendering mismatch. For final distribution PDFs, this is usually fine. For working documents that will be edited, use full font embedding.

Identifying Subset Fonts

In Adobe Acrobat's Font Properties panel (File → Properties → Fonts), subset fonts are labeled "Embedded Subset" rather than just "Embedded." In the raw PDF, subset font names appear with the 6-character uppercase prefix before the plus sign: /BCDFHI+TimesNewRoman. The random prefix is different in every PDF so font analysis tools can identify that fonts in two different documents are both subsets of the same typeface.

Try Compress PDF Now — Free

Browser-based, private, and instant. No account or software required.

Open Compress PDF
Report Bug
Send Feedback
Feature Request