The Block Object
Detailed information about the Block object structure and types returned by the /parse
API endpoint.
Overview
A Block represents a distinct content element within a document, such as a paragraph of text, a heading, a table, or a figure. Blocks are the fundamental units that make up chunks in parsed documents.
Block Object Structure
The type of object. Always “block”.
A unique identifier for the block, deterministically generated as a hash of the block content.
The type of block. Possible values include:
text
: Regular text contentheading
: Section or document headingssection_heading
: Subsection headingstable
: Tabular data with rows and columnsfigure
: Images, charts, or diagrams
The textual content of the block, formatted according to the target format specified in the parse request.
Additional details specific to the block type. The structure varies depending on the block type.
Metadata about the block.
An array of points defining the polygon that bounds the block on the page.
Each point is an object with x
and y
coordinates.
A simplified rectangular bounding box for the block, derived from the polygon.