Our highest performance (in terms of accuracy and reliability) base extraction processor. Always stays up to date with the best foundation models available across all of our benchmarks.

Versions

VersionDateChangelog
4.0.0-beta
2025-02-10
- Rolls out a new foundation model that so far outperforms all our internal benchmarks. In Beta due to a few unsupported features (logprobs confidence scoring mainly) and remaining evaluations to be done on some key use cases as well as some stability testing.
- You can use in production now if desired, or simply wait for us to move it out of beta and evaluate it on your use cases in the meantime.
3.10.0
2025-02-09
Significantly improved figure parsing in our pre-processing to better segment out and transform sub-classes of figures (e.g. charts, logos, etc.) in markdown
3.9.0
2024-11-15
Add support for nested arrays and objects. See here for an example schema.
3.8.0
2024-11-04
Add support for nested enums in array and object fields. Make advanced multimodal enabled by default.
3.7.0
2024-10-10
Add support for an new, optimized enum field type. Updates to document pre-processing for handwritten text and dense/large tables.
3.6.0
2024-09-16
Add base support for advanced multimodal features, which can be enabled in the Processor Settings in Studio.
3.5.0
2024-08-27
Several changes to improve extraction performance, including minor model version upgrade, a new bounding box system, and model insights.
3.4.0
2024-08-16
Updates to our document pre-processing to better handle more complex document/table layouts.
3.3.0
2024-07-30
Improvements to signature extraction accuracy.
3.2.0
2024-07-14
Updates to our document pre-processing to better handle checkboxes.
3.1.0
2024-06-01
Updates to our document pre-processing to better handle more complex document/table layouts.
3.0.0
2024-05-14
Promoting a new foundation model to default - it’s faster and more accurate across all of our internal benchmarks for extraction.
2.0.0
2024-04-12
Promoting a new foundation model to default. Very minor increases in accuracy and speed.
1.0.0
2023-08-01
Initial (and now legacy) extraction model.