Parse a Font

Parsing TrueType (TTF) Font Files — A Reference

A practical, implementation-agnostic guide to reading the TTF binary format. It is organized around the act of parsing: the spine of the document is 5. How to parse, which walks the file in order and links each step to that table’s byte layout in 6. Table reference. Read the parse flow top to bottom; click into a table’s layout only when you need the exact fields.

Type	Bytes	Meaning
`uint8`	1	unsigned byte
`int8`	1	signed byte
`uint16`	2	unsigned short
`int16`	2	signed short
`uint24`	3	unsigned 24-bit
`uint32`	4	unsigned long
`int32`	4	signed long
`Fixed`	4	16.16 signed fixed-point
`FWORD`	2	`int16` in font design units
`UFWORD`	2	`uint16` in font design units
`F2Dot14`	2	2.14 signed fixed-point (scales/variations)
`LONGDATETIME`	8	signed `int64`, seconds since 1904-01-01 00:00 UTC
`Tag`	4	four `uint8` ASCII characters, e.g. `glyf`
`Offset16`	2	`uint16` offset
`Offset32`	4	`uint32` offset
`Version16Dot16`	4	packed major/minor version

Type	Name	Notes
`uint32`	sfntVersion	`0x00010000` = TrueType outlines; `0x4F54544F` (`OTTO`) = CFF/PostScript; `0x74727565` (`true`) on Apple. Your “is this a TTF?” check.
`uint16`	numTables	number of table records that follow
`uint16`	searchRange	`(largest power of 2 ≤ numTables) × 16`
`uint16`	entrySelector	`log2(largest power of 2 ≤ numTables)`
`uint16`	rangeShift	`numTables × 16 − searchRange`

Type	Name	Notes
`Tag`	tableTag	4 ASCII bytes, e.g. `head`, `glyf`
`uint32`	checksum	table checksum
`Offset32`	offset	from the beginning of the file
`uint32`	length	the table’s actual length, excluding pad bytes

Table	Class	Why
Offset table	fixed	12 bytes, fixed fields
Table record	fixed	16 bytes, fixed fields
`head`	fixed	54 bytes, all fixed-width
`maxp`	fixed*	fixed per version (0.5 vs 1.0)
`hhea`	fixed	36 bytes
`OS/2`	fixed*	fixed per version (0–5)
`hmtx`	variable	array sized by `hhea.numberOfHMetrics` + trailing array
`loca`	variable	`numGlyphs + 1` entries; entry width set by `head`
`glyf`	variable	per-glyph variable length, two glyph formats
`cmap`	variable	sub-tables in several formats, offset-linked
`name`	variable	record array + string storage block
`post`	variable*	header fixed; v2.0 adds a glyph-name array
`kern`	variable	optional; multiple sub-table formats

Type	Name	Notes
`uint16`	majorVersion	1
`uint16`	minorVersion	0
`Fixed`	fontRevision	designer’s version; store raw if not interpreting
`uint32`	checksumAdjustment	whole-file checksum; see spec
`uint32`	magicNumber	always `0x5F0F3CF5` (validation hook)
`uint16`	flags	bit field
`uint16`	unitsPerEm	design grid; 16–16384, power of 2 for TrueType
`LONGDATETIME`	created	`int64`, seconds since 1904
`LONGDATETIME`	modified	`int64`
`int16`	xMin	font bounding box (signed!)
`int16`	yMin
`int16`	xMax
`int16`	yMax
`uint16`	macStyle	bit field
`uint16`	lowestRecPPEM	smallest readable size in pixels
`int16`	fontDirectionHint
`int16`	indexToLocFormat	0 = short `loca`, 1 = long `loca`
`int16`	glyphDataFormat	0

Type	Name	Notes
`uint32`	version	`0x00005000`
`uint16`	numGlyphs	the only field you need

Type	Name	Notes
`uint16`	maxPoints	max points in a non-composite glyph
`uint16`	maxContours	max contours in a non-composite glyph
`uint16`	maxCompositePoints	max points in a composite glyph
`uint16`	maxCompositeContours	max contours in a composite glyph
`uint16`	maxZones	1 = no twilight zone; 2 = twilight zone used
`uint16`	maxTwilightPoints	max points in the twilight zone
`uint16`	maxStorage	max storage area locations
`uint16`	maxFunctionDefs	max function definitions
`uint16`	maxInstructionDefs	max instruction definitions
`uint16`	maxStackElements	max stack depth
`uint16`	maxSizeOfInstructions	max byte count for glyph instructions
`uint16`	maxComponentElements	max number of components in a composite glyph
`uint16`	maxComponentDepth	max nesting depth of composites

Type	Name	Notes
`int16`	numberOfContours	≥ 0 → simple glyph; < 0 (use −1) → composite glyph
`int16`	xMin	glyph bounding box
`int16`	yMin
`int16`	xMax
`int16`	yMax

Type	Name	Notes
`uint16`	endPtsOfContours[numberOfContours]	last value + 1 = total point count
`uint16`	instructionLength	bytes of hinting that follow
`uint8`	instructions[instructionLength]	TrueType hinting bytecode (can pass through opaquely)
`uint8`	flags[…]	one logical flag per point, compressed (below)
—	xCoordinates[…]	delta-encoded, width per flags
—	yCoordinates[…]	delta-encoded, width per flags

Bit	Name	Meaning
`0x01`	ON_CURVE_POINT	point is on the curve (vs an off-curve Bézier control point)
`0x02`	X_SHORT_VECTOR	x delta is 1 byte (else 2 bytes or 0)
`0x04`	Y_SHORT_VECTOR	y delta is 1 byte
`0x08`	REPEAT_FLAG	the next byte is a repeat count; repeat this flag that many additional times
`0x10`	X_IS_SAME_OR_POSITIVE_X_SHORT_VECTOR	dual meaning (below)
`0x20`	Y_IS_SAME_OR_POSITIVE_Y_SHORT_VECTOR	dual meaning
`0x40`	OVERLAP_SIMPLE	contours may overlap
`0x80`	reserved	set to 0

Type	Name	Notes
`uint16`	majorVersion	1
`uint16`	minorVersion	0
`FWORD`	ascender	(`int16`)
`FWORD`	descender	(`int16`, usually negative)
`FWORD`	lineGap	(`int16`)
`UFWORD`	advanceWidthMax	(`uint16`)
`FWORD`	minLeftSideBearing	(`int16`)
`FWORD`	minRightSideBearing	(`int16`)
`FWORD`	xMaxExtent	(`int16`)
`int16`	caretSlopeRise	slope of the cursor (rise/run); 1 for vertical
`int16`	caretSlopeRun	0 for vertical
`int16`	caretOffset	shift for slanted highlight; 0 for non-slanted
`[4]int16`	(reserved)	set to 0
`int16`	metricDataFormat	0
`uint16`	numberOfHMetrics	sizes the `hmtx` table

Type	Name
`uint16`	advanceWidth
`int16`	lsb (left side bearing)

Type	Name	Notes
`uint16`	flags	component flags (below)
`uint16`	glyphIndex	the component glyph’s id
arg1, arg2	—	`int8`/`uint8` or `int16`/`uint16` per `ARG_1_AND_2_ARE_WORDS`; if `ARGS_ARE_XY_VALUES`, signed placement offsets, else point-matching indices
transform	—	0, 1, 2, or 4 × `F2Dot14` depending on the scale flags

Bit	Name	Meaning
`0x0001`	ARG_1_AND_2_ARE_WORDS	args are 16-bit (else 8-bit)
`0x0002`	ARGS_ARE_XY_VALUES	args are offsets (else point indices)
`0x0004`	ROUND_XY_TO_GRID
`0x0008`	WE_HAVE_A_SCALE	one `F2Dot14` uniform scale follows
`0x0020`	MORE_COMPONENTS	another component follows; loop
`0x0040`	WE_HAVE_AN_X_AND_Y_SCALE	two `F2Dot14` follow
`0x0080`	WE_HAVE_A_TWO_BY_TWO	four `F2Dot14` (2×2 matrix) follow
`0x0100`	WE_HAVE_INSTRUCTIONS	after the last component: `uint16` length + instruction bytes
`0x0200`	USE_MY_METRICS
`0x0400`	OVERLAP_COMPOUND

Type	Name	Notes
`uint16`	platformID	0=Unicode, 1=Mac, 3=Windows
`uint16`	encodingID	platform-specific
`Offset32`	subtableOffset	from the start of the `cmap` table

Type	Name	Notes
`uint16`	version	0 or 1
`uint16`	count	number of name records
`Offset16`	storageOffset	start of string storage, from the table start

Type	Name	Notes
`uint16`	version	0–5
`int16`	xAvgCharWidth	weighted average advance width of lower-case chars
`uint16`	usWeightClass	100–900 (matches CSS `font-weight`)
`uint16`	usWidthClass	1–9, condensed → expanded
`uint16`	fsType	embedding permission flags
`int16`	ySubscriptXSize
`int16`	ySubscriptYSize
`int16`	ySubscriptXOffset
`int16`	ySubscriptYOffset
`int16`	ySuperscriptXSize
`int16`	ySuperscriptYSize
`int16`	ySuperscriptXOffset
`int16`	ySuperscriptYOffset
`int16`	yStrikeoutSize
`int16`	yStrikeoutPosition
`int16`	sFamilyClass	IBM font-family classification
`uint8[10]`	panose	10-byte PANOSE classification
`uint32`	ulUnicodeRange1	Unicode block coverage bits 0–31
`uint32`	ulUnicodeRange2	bits 32–63
`uint32`	ulUnicodeRange3	bits 64–95
`uint32`	ulUnicodeRange4	bits 96–127
`Tag`	achVendID	4-char vendor identifier
`uint16`	fsSelection	style flags (italic, bold, regular, …)
`uint16`	usFirstCharIndex	lowest Unicode codepoint in the font
`uint16`	usLastCharIndex	highest Unicode codepoint in the font
`int16`	sTypoAscender	typographic ascender (FUnits)
`int16`	sTypoDescender	typographic descender (FUnits, usually negative)
`int16`	sTypoLineGap	typographic line gap (FUnits)
`uint16`	usWinAscent	Windows ascender metric
`uint16`	usWinDescent	Windows descender metric (positive value)

Type	Name	Notes
`uint32`	ulCodePageRange1	code-page coverage bits 0–31
`uint32`	ulCodePageRange2	bits 32–63

Type	Name	Notes
`int16`	sxHeight	height of lowercase ‘x’ (FUnits)
`int16`	sCapHeight	height of uppercase ‘H’ (FUnits)
`uint16`	usDefaultChar	glyph index for the default character
`uint16`	usBreakChar	glyph index for the word-break char
`uint16`	usMaxContext	max length of target glyph context

Type	Name	Notes
`uint16`	usLowerOpticalPointSize	lower optical size, ×20
`uint16`	usUpperOpticalPointSize	upper optical size, ×20

Type	Name	Notes
`uint16`	version	subtable version (0)
`uint16`	length	total subtable length in bytes (including this header)
`uint16`	coverage	high byte = format (0 or 2); low byte = flags (see below)

Type	Name	Notes
`uint16`	nPairs	number of kern pairs
`uint16`	searchRange	`(largest power of 2 ≤ nPairs) × 6`
`uint16`	entrySelector	`log2(largest power of 2 ≤ nPairs)`
`uint16`	rangeShift	`nPairs × 6 − searchRange`

Type	Name	Notes
`uint16`	left	left glyph index
`uint16`	right	right glyph index
`int16`	value	kern adjustment in FUnits

Table	Page	Table	Page
structure	`otff`	`cmap`	`cmap`
`head`	`head`	`name`	`name`
`maxp`	`maxp`	`post`	`post`
`hhea`	`hhea`	`OS/2`	`os2`
`hmtx`	`hmtx`	`kern`	`kern`
`loca`	`loca`	`glyf`	`glyf`

Parse a Font

Parsing TrueType (TTF) Font Files — A Reference

Contents

1. Introduction

2. Mental model

3. Conventions

Endianness

Data types

4. File structure

4.1 Offset table (sfnt header) — 12 bytes, at byte 0

4.2 Table directory — numTables records of 16 bytes each, at byte 12

5. How to parse

5.1 The parse sequence

5.2 Fixed- vs variable-layout tables — the decode strategy

6. Table reference

6.1 head — font header (54 bytes, fixed)

6.2 maxp — maximum profile

6.3 hhea — horizontal header (36 bytes, fixed)

6.4 hmtx — horizontal metrics (variable)

6.5 loca — index to location (variable)

6.6 glyf — glyph data (variable, the hard one)

Simple glyph (numberOfContours ≥ 0)

Composite glyph (numberOfContours < 0)

6.7 cmap — character-to-glyph mapping (variable)

6.8 name — human-readable strings (variable)

6.9 post — PostScript data (variable, version-dependent)

6.10 OS/2 — OS/2 and Windows metrics (fixed per version)

6.11 kern — kerning (optional, variable)

7. Sources

4.2 Table directory — `numTables` records of 16 bytes each, at byte 12

6.1 `head` — font header (54 bytes, fixed)

6.2 `maxp` — maximum profile

6.3 `hhea` — horizontal header (36 bytes, fixed)

6.4 `hmtx` — horizontal metrics (variable)

6.5 `loca` — index to location (variable)

6.6 `glyf` — glyph data (variable, the hard one)

6.7 `cmap` — character-to-glyph mapping (variable)

6.8 `name` — human-readable strings (variable)

6.9 `post` — PostScript data (variable, version-dependent)

6.10 `OS/2` — OS/2 and Windows metrics (fixed per version)

6.11 `kern` — kerning (optional, variable)